NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
9df4626ee0ddf265e8706ac74247589cdee8cee9dcf218cbab68054a25199310CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
10/29/2025 7:01 PM UTC
2d1888d6af91952b04f38fc5bab555b4cb63d31139603eca8e4c964667e825b6LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
10/29/2025 7:01 PM UTC
79143d50b4f896b62f410a83c896c2bc8c7e5145f3595eaf55e222f292aaa96fLABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
10/29/2025 7:01 PM UTC
1e97cf7fa26ea9744d8dbeeabb02cd6c1f34f77e6923393d81bf62ada1e3ddd5RUN
TRITON_VERSION=2.62.0 TRITON_CONTAINER_VERSION=25.10 pip3 install -r python/openai/requirements.txt
10/29/2025 7:01 PM UTC
607ec92f185326f536a5e9c2741b85846907a701313b12cdcd1caceb10ebb7a2RUN
TRITON_VERSION=2.62.0 TRITON_CONTAINER_VERSION=25.10 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
10/29/2025 7:01 PM UTC
4c1785d556b38a4277e86356333394692e37e7be4d785679eba907b7e7c52138COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
10/29/2025 7:01 PM UTC
6a03e0a18e24171ba75a1f4dbde2214ea1c190ab58277077e84564143ff5166bWORKDIR
/opt/tritonserver
10/29/2025 7:01 PM UTC
b3894fff53df43aae52c4a743c04768fa57d049117de7e059e73423877a25de4COPY
--chown=1000:1000 build/install tritonserver
10/29/2025 7:01 PM UTC
39541ca0dc21e55f1044e6aab38293f52802cc9f645cab85505dba77ffe43972WORKDIR
/opt
10/29/2025 7:01 PM UTC
fbe2b3bfe7a3bc8c15b0c2e67cf3bea286447f7f2518bc46f31646c6e7eb494eLABEL
com.nvidia.build.ref=374194f9f494599a028053e9d3f6771b7e37ef76
10/29/2025 7:01 PM UTC
...