Text Generation
Transformers
Safetensors
PyTorch
nemotron_h
nvidia
elastic
conversational
custom_code
jrd971000's picture
Add explicit note that elastic budget control is not yet in vLLM and is being worked on
4bcc432