NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
7c91ea22b8d85781779467729e9b17638bfc9b411f8060dc20c395df09417ef7CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
08/22/2025 1:14 AM UTC
b789d67b2a670b67e86d9581bd8f1f0dedda724b3a30dd67d5e34e4ceca3b2b8LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
08/22/2025 1:14 AM UTC
82fafa386ad8183e16a921e815e9920fa2e6a0f011834d6fb512060219d4b533LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
08/22/2025 1:14 AM UTC
bb3aa6eb1c4cfc155abcdafe70f80565d19eef01582f47e4ff7330af2fffbc9aRUN
TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 BUILD_PUBLIC_VLLM=false PYVER=3.12 pip3 install -r python/openai/requirements.txt
08/22/2025 1:14 AM UTC
1dc8072bbcaa2d86a7a65eadbb09d028f6c277b1399345effe9bda435ad4717eRUN
TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 BUILD_PUBLIC_VLLM=false PYVER=3.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
08/22/2025 1:14 AM UTC
2d328b58a9b3424d50397cb7d2a9a61c6bac366589b97e69044b515cc69d3efeCOPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
08/22/2025 1:14 AM UTC
98901c76e09c960071c6664d9374fd98a36caec38493c213f39d389b353eb0e5WORKDIR
/opt/tritonserver
08/22/2025 1:14 AM UTC
3c3b13ca746603ac052d29f2336500c2aafa7bb5ebf2e50f6602b35324824ebdCOPY
--chown=1000:1000 build/install tritonserver
08/22/2025 1:14 AM UTC
0e6365c816b096861d61ae463c90d0af5c1342d55b8a414dec536707f4c8a8d2WORKDIR
/opt
08/22/2025 1:14 AM UTC
70b0ecd260eed09707b3a9d9e9b63831233abe86f02431a0d910cb2608a50fabLABEL
com.nvidia.build.ref=8ced3b40794a0aed058b85333c4c4bb638de5476
08/22/2025 1:14 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.