NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
ae418f852dceb3a9bb5d9b2aba4e66bab7e80668577813e335c937429828d308CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
12/19/2025 1:12 AM UTC
1332e13549669b4010d7755a63d10f2740ffb6043440f1523dabdc54aad479e0LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
12/19/2025 1:12 AM UTC
51e7e6ba72bf86125fbf4ea5631960338e664b5cd3cfe1e116b4643005732d7dLABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
12/19/2025 1:12 AM UTC
f73d3f9e4c007374fe1e76635071ef19f8acc126f5c099830b50d1f959e5a4b7RUN
TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 pip3 install -r python/openai/requirements.txt
12/19/2025 1:12 AM UTC
cc9c1df469024179fae3f7bf3ac51569886924d48ee3ae98b13283a82a96b92eRUN
TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
12/19/2025 1:12 AM UTC
87da3f470a85cf8d519b7473b5d4a2cd6e5e4d39ed21141239a73559618ad8e1COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
12/19/2025 1:11 AM UTC
70a1ed4b6f82f88df8fdca2a817cb3e20ae662b67cd41b06ba408735bdfae10aWORKDIR
/opt/tritonserver
12/19/2025 1:11 AM UTC
3bf66fee4e2384e8881e4b45fe97a28064401370a53e4ea616eab4795990a91aCOPY
--chown=1000:1000 build/install tritonserver
12/19/2025 1:11 AM UTC
4e5d21c069a6c79433e288944e46ab9b74207fed1b64879fca6da62c23ca8d21WORKDIR
/opt
12/19/2025 1:11 AM UTC
f41173d359f345901e848917dfe467a6af8a6d492eebffa8f7fc71e6fb556d0cLABEL
com.nvidia.build.ref=9bf730f9c9c3d98fa3d2fd5ef82f0aa5bd7c5d45
12/19/2025 1:11 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.