NVIDIA releases Dynamo 1.0 software for AI inference scaling

Go back to NVIDIA releases Dynamo 1.0 software for AI inference scaling

NVIDIA Enters Production With Dynamo, the Broadly Adopted Inference Operating System for AI Factories

March 16, 2026 4:36 PM EDT

News Summary:

NVIDIA Dynamo 1.0 provides a production-grade, open source foundation for inference at scale.Dynamo and NVIDIA TensorRT-LLM optimizations integrate natively into open source frameworks such as LangChain, llm-d, LMCache, SGLang and vLLM to boost inference performance.Dynamo boosts inference performance of NVIDIA Blackwell GPUs by up to 7x, lowering token cost and increasing revenue opportunity for millions of GPUs with free, open source software.NVIDIA inference platform integrated by cloud service providers, Amazon Web Services... More