NVIDIA
NVIDIA
Dynamo vLLM Runtime
Container
NVIDIA
NVIDIA
Dynamo vLLM Runtime

The Dynamo vLLM runtime image is a containerized build of Dynamo + vLLM which serves as the base runtime environment for vLLM based inference with Dynamo's distributed inference framework.

LayerLabelCreated
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4CMD
03/16/2026 9:55 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ENTRYPOINT
/opt/nvidia/nvidia_entrypoint.sh
03/16/2026 9:55 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4USER
dynamo
03/16/2026 9:55 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ENV
EFA_VERSION=1.45.1
03/16/2026 9:55 PM UTC
sha256:32f112e3802cadcab3543160f4d2aa607b3cc1c62140d57b4f5441384f40e927RUN
DEVICE=cuda PYTHON_VERSION=3.12 SITE_PACKAGES=/opt/dynamo/venv/lib/python3.12/site-packages ENABLE_KVBM=true ENABLE_GPU_MEMORY_SERVICE=true ENABLE_MODELEXPRESS_P2P=false MODELEXPRESS_REF=3d73992ce6c10e52ddc54f7f12af35d27e173f15 DYNAMO_COMMIT_SHA=c758eb5b58872fa9b2880b9ea29d058b141bb8ec EFA_VERSION=1.45.1 /bin/bash -l -o pipefail -c mkdir -p /tmp/efa &&
  cd /tmp/efa &&
  curl --retry 3 --retry-delay 2 -fsSL -o aws-efa-installer-${EFA_VERSION}.tar.gz https://efa-installer.amazonaws.com/aws-efa-installer-${EFA_VERSION}.tar.gz &&
  tar -xf aws-efa-installer-${EFA_VERSION}.tar.gz &&
  cd aws-efa-installer &&
  apt-get update &&
  ./efa_installer.sh -y --skip-kmod --skip-limit-conf --no-verify &&
  rm -rf /tmp/efa &&
  rm -rf /opt/amazon/aws-ofi-nccl &&
  ldconfig
03/16/2026 9:55 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4USER
root
03/16/2026 9:52 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ARG
EFA_VERSION=1.45.1
03/16/2026 9:52 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4CMD
03/16/2026 9:52 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ENTRYPOINT
/opt/nvidia/nvidia_entrypoint.sh
03/16/2026 9:52 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4ENV
VLLM_USE_FLASHINFER_SAMPLER=1
03/16/2026 9:52 PM UTC
...