NVIDIA
NVIDIA
Dynamo vLLM Runtime
Container
NVIDIA
NVIDIA
Dynamo vLLM Runtime

The Dynamo vLLM runtime image is a containerized build of Dynamo + vLLM which serves as the base runtime environment for vLLM based inference with Dynamo's distributed inference framework.

LayerLabelCreated
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4CMD
04/21/2026 4:52 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ENTRYPOINT
/opt/nvidia/nvidia_entrypoint.sh
04/21/2026 4:52 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4USER
dynamo
04/21/2026 4:52 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ENV
EFA_VERSION=1.45.1
04/21/2026 4:52 PM UTC
sha256:32f112e3802cadcab3543160f4d2aa607b3cc1c62140d57b4f5441384f40e927RUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 SITE_PACKAGES=/opt/dynamo/venv/lib/python3.12/site-packages ENABLE_KVBM=true ENABLE_GPU_MEMORY_SERVICE=true ENABLE_MODELEXPRESS_P2P=false MODELEXPRESS_REF=3d73992ce6c10e52ddc54f7f12af35d27e173f15 DYNAMO_COMMIT_SHA=5dc542097ab165ff000fd1b6059a7b2288cc5bc7 EFA_VERSION=1.45.1 /bin/bash -l -o pipefail -c mkdir -p /tmp/efa &&
  cd /tmp/efa &&
  curl --retry 3 --retry-delay 2 -fsSL -o aws-efa-installer-${EFA_VERSION}.tar.gz https://efa-installer.amazonaws.com/aws-efa-installer-${EFA_VERSION}.tar.gz &&
  tar -xf aws-efa-installer-${EFA_VERSION}.tar.gz &&
  cd aws-efa-installer &&
  apt-get update &&
  ./efa_installer.sh -y --skip-kmod --skip-limit-conf --no-verify &&
  rm -rf /tmp/efa &&
  rm -rf /opt/amazon/aws-ofi-nccl &&
  ldconfig
04/21/2026 4:52 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4USER
root
04/21/2026 4:50 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ARG
EFA_VERSION=1.45.1
04/21/2026 4:50 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4CMD
04/21/2026 4:50 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ENTRYPOINT
/opt/nvidia/nvidia_entrypoint.sh
04/21/2026 4:50 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ENV
VLLM_USE_FLASHINFER_SAMPLER=1
04/21/2026 4:50 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.