NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
9ff5cfd4abce8e7cd17d1e34aee1c3f1c3a031a4b0d5ebc1c7e1944381207b9fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
05/19/2026 12:47 AM UTC
f7ae772038c5d46c52a7cc6092ece337a1e2f6352384c7cb15c707238a5c48a9LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
05/19/2026 12:47 AM UTC
b76f2fdaccf59c763c835c2fb5438452778949bbda07f9f5e41c5c4df14f9907LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
05/19/2026 12:47 AM UTC
5351c020310a7ea094f9e035e2d25d69124d1d10cb70d878e5bb89b75b6e6a25RUN
TRITON_VERSION=2.69.0 TRITON_CONTAINER_VERSION=26.05 pip3 install -r python/openai/requirements.txt
05/19/2026 12:47 AM UTC
62b96ef0674a25f9948e2f3b01208a822089c12d314d37e4d212e18b67e6c6c7RUN
TRITON_VERSION=2.69.0 TRITON_CONTAINER_VERSION=26.05 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
05/19/2026 12:47 AM UTC
bd4c5ddc8a7855e484c6ea388f54bbfd1b85dc17962a4bcb5803625853d3ffd0COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
05/19/2026 12:47 AM UTC
c03b8d4a9f78e817c27bde77a8ba0f7d8adde7cd2d30800d813f7892082399aeWORKDIR
/opt/tritonserver
05/19/2026 12:47 AM UTC
3b10f1f5dbc05a206e5d400b9696587ff5aaf3699b7df5cc06508cafd9bb3be1COPY
--chown=1000:1000 build/install tritonserver
05/19/2026 12:47 AM UTC
f3b1c17d8e9360c36a355ad1d620d201756690542c3b241d68010e2161de3528WORKDIR
/opt
05/19/2026 12:47 AM UTC
c2b6e3e00b4ac5c66b56c4bcdca8da4f7443b014657067aec3e473c8c5d8072fLABEL
com.nvidia.build.ref=84b0640223ddce28a0887ac64fb53a0d1d436c72
05/19/2026 12:47 AM UTC
...