NVIDIA
NVIDIA
Dynamo vLLM Runtime
Container
NVIDIA
NVIDIA
Dynamo vLLM Runtime

The Dynamo vLLM runtime image is a containerized build of Dynamo + vLLM which serves as the base runtime environment for vLLM based inference with Dynamo's distributed inference framework.

LayerLabelCreated
d9e085c1fc4f9bed710a327d70464d4474f1c89fd4bc699090f76a116767085dCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /workspace
12/05/2025 8:01 PM UTC
719d4bb483eacf80f6c63903049176b232581fd2bd623ff93f5c95c4cca13e0cENTRYPOINT
/opt/nvidia/nvidia_entrypoint.sh
12/05/2025 8:01 PM UTC
469fd81f29e81e4795813004943dca2ea22a144a3e4a5bf4890409a05a239bf9USER
dynamo
12/05/2025 8:01 PM UTC
44d06921caada81c30b81596aac9a13eafeaa64c45f5eec305e8de969681b70eRUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 ENABLE_KVBM=true DYNAMO_COMMIT_SHA=41a72ab36698ce3eba0fba9af1dfad124d9e71ea chmod 755 /opt/dynamo/.launch_screen &&
  echo 'source /opt/dynamo/venv/bin/activate' >> /etc/bash.bashrc &&
  echo 'cat /opt/dynamo/.launch_screen' >> /etc/bash.bashrc
12/05/2025 8:01 PM UTC
8a5cfb8bd347c0553cf0a389d07432d11b8b69930412d2f4f7bbc6b0ce0c59a2USER
root
12/05/2025 8:01 PM UTC
b221282402247ed6ed6c5a3426bc20bc31bc7c1d8b27c4fe5944e65f3512203fRUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 ENABLE_KVBM=true DYNAMO_COMMIT_SHA=41a72ab36698ce3eba0fba9af1dfad124d9e71ea sed '/^#\s/d' /opt/dynamo/launch_message.txt > /opt/dynamo/.launch_screen
12/05/2025 8:01 PM UTC
c40de57361f67eaa2e0552ef9c10d8c75d61726d488eb373b1e85a5913123bd3COPY
--chown=dynamo: ATTRIBUTION* LICENSE /workspace/
12/05/2025 8:01 PM UTC
d2cc5a42bca7177ad33fb9e1077b1f8c3c9cddff1d8aaa900d4742d8518cf591COPY
--chown=dynamo: . /workspace/
12/05/2025 8:01 PM UTC
aff4ad127789c6487b8cccb6b7c2f6eae7bb651a11240aae5db8241ddef8eb50RUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 ENABLE_KVBM=true DYNAMO_COMMIT_SHA=41a72ab36698ce3eba0fba9af1dfad124d9e71ea UV_GIT_LFS=1 uv pip install --no-cache --requirement /tmp/requirements.txt --requirement /tmp/requirements.test.txt
12/05/2025 8:01 PM UTC
70b0b4d9a925b5854f7d07065232b14688de3c91982063d0654a6abffe950ecaRUN
ARCH_ALT=x86_64 PYTHON_VERSION=3.12 ENABLE_KVBM=true DYNAMO_COMMIT_SHA=41a72ab36698ce3eba0fba9af1dfad124d9e71ea uv pip install /opt/dynamo/wheelhouse/ai_dynamo_runtime*.whl /opt/dynamo/wheelhouse/ai_dynamo*any.whl /opt/dynamo/wheelhouse/nixl/nixl*.whl &&
  if [ "${ENABLE_KVBM}" = "true" ]; then uv pip install /opt/dynamo/wheelhouse/kvbm*.whl; fi &&
  cd /opt/dynamo/benchmarks &&
  UV_GIT_LFS=1 uv pip install --no-cache . &&
  cd - &&
  rm -rf /opt/dynamo/benchmarks
12/05/2025 8:01 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.