NVIDIA
NVIDIA
Dynamo vLLM Runtime
Container
NVIDIA
NVIDIA
Dynamo vLLM Runtime

The Dynamo vLLM runtime image is a containerized build of Dynamo + vLLM which serves as the base runtime environment for vLLM based inference with Dynamo's distributed inference framework.

LayerLabelCreated
445aa29374417d2c4f2f7d8e94970d24b71c07bf205cfc7d90494e6c328a0b3fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /workspace
11/26/2025 5:31 PM UTC
8b3c92adb9fb2a52589242c66066475d8e0988fd098387ed322bced234b1ec20ENTRYPOINT
/opt/nvidia/nvidia_entrypoint.sh
11/26/2025 5:31 PM UTC
c955283df69e9cb08229c263593941acc7773ff15779ac55288c0322bf19ad39USER
dynamo
11/26/2025 5:31 PM UTC
737f6dda9a19a825c683abc2f389f4cbac1199c3288d029813006d0aea44a80aRUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 ENABLE_KVBM=true DYNAMO_COMMIT_SHA=f49d6873e417ef82090ed492ef00b6939bd5a8d0 chmod 755 /opt/dynamo/.launch_screen &&
  echo 'source /opt/dynamo/venv/bin/activate' >> /etc/bash.bashrc &&
  echo 'cat /opt/dynamo/.launch_screen' >> /etc/bash.bashrc
11/26/2025 5:31 PM UTC
b36dab52bfe91936b631a07fac2372259388a954a1c08723b23fd576d66e8021USER
root
11/26/2025 5:31 PM UTC
06b4745c575304ad2bf7c348345d6990461c15a854ab1856e6a1d8b43d738888RUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 ENABLE_KVBM=true DYNAMO_COMMIT_SHA=f49d6873e417ef82090ed492ef00b6939bd5a8d0 sed '/^#\s/d' /opt/dynamo/launch_message.txt > /opt/dynamo/.launch_screen
11/26/2025 5:31 PM UTC
efc23c6fb9d9489c5f888efb882b102195dc5605f989a8bcec522221c3aa5408COPY
--chown=dynamo: ATTRIBUTION* LICENSE /workspace/
11/26/2025 5:31 PM UTC
225c49596df653141caf904c96e1370abeda768deefacf325b8c2bfc236de95fCOPY
--chown=dynamo: . /workspace/
11/26/2025 5:31 PM UTC
2aae9492cad9ebc1a87a0ea0091201e9a4bbb313b5845f51cccc5ccdfa5afb46RUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 ENABLE_KVBM=true DYNAMO_COMMIT_SHA=f49d6873e417ef82090ed492ef00b6939bd5a8d0 UV_GIT_LFS=1 uv pip install --no-cache --requirement /tmp/requirements.txt --requirement /tmp/requirements.test.txt
11/26/2025 5:31 PM UTC
60403ff7524bb9aca47aa6cb7527743e1359d48a586daef0f2fd998e78cecce8RUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 ENABLE_KVBM=true DYNAMO_COMMIT_SHA=f49d6873e417ef82090ed492ef00b6939bd5a8d0 uv pip install /opt/dynamo/wheelhouse/ai_dynamo_runtime*.whl /opt/dynamo/wheelhouse/ai_dynamo*any.whl /opt/dynamo/wheelhouse/nixl/nixl*.whl &&
  if [ "${ENABLE_KVBM}" = "true" ]; then uv pip install /opt/dynamo/wheelhouse/kvbm*.whl; fi &&
  cd /opt/dynamo/benchmarks &&
  UV_GIT_LFS=1 uv pip install --no-cache . &&
  cd - &&
  rm -rf /opt/dynamo/benchmarks
11/26/2025 5:31 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.