NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
97f064dd42858a7dddb8c520fcf4ce0417c5f1f0d457eb70a03979b41636e826CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
01/27/2026 10:18 PM UTC
fe44531ab6802840d1d4c937432b300a882ad73c861174e68bd53e33fe9720d3LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
01/27/2026 10:18 PM UTC
9ef4436bfa2d698c72a66ad32fa7fc8079500cf2c4a9d1b47cfddaf6d607fd5eLABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
01/27/2026 10:18 PM UTC
a829813a7fec8124b5b4defd874d0466b5b8f917de392ff378b928052528564aRUN
TRITON_VERSION=2.65.0 TRITON_CONTAINER_VERSION=26.01 BUILD_PUBLIC_VLLM=false PYVER=3.12 pip3 install -r python/openai/requirements.txt
01/27/2026 10:18 PM UTC
93ade7f10cd5568a5a22654d91eec807f1b133d4946447b4ed85818d1b788460RUN
TRITON_VERSION=2.65.0 TRITON_CONTAINER_VERSION=26.01 BUILD_PUBLIC_VLLM=false PYVER=3.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
01/27/2026 10:17 PM UTC
5084329da4e567419521e16018f72948e960f006a5dfb4a880c715390ff7bb38COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
01/27/2026 10:17 PM UTC
3fa59d04e409d458c2ab33c6d14a19a4af4de2a2b560b7a195219171b7180365WORKDIR
/opt/tritonserver
01/27/2026 10:17 PM UTC
a5ed9202594f2061f3639d3a74955769244f06a0072f1cc47e11a5b5f9127bc3COPY
--chown=1000:1000 build/install tritonserver
01/27/2026 10:17 PM UTC
5eee6fb757dbd61f330241aa810edfea9d964457acafe713ace54bdd4d4e4ba4WORKDIR
/opt
01/27/2026 10:17 PM UTC
7f99a1ab2694d7ceaa5171736268a08107f1efeee2b3e0000a351e789837e78fLABEL
com.nvidia.build.ref=c2536fda2e5c6fa09b84b08768f450a39a608761
01/27/2026 10:17 PM UTC
...