NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
8a33a2034a53442ff717bf64ce4a5752ccaa3d6db7978f7a1d712308106988dcCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
11/20/2025 9:36 PM UTC
ddf323155a21a0c77e0cea99376c97955d7ac2c2335345871e7292357381130bLABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
11/20/2025 9:36 PM UTC
119df85807db09186fc5bd70113a40958bb01b9bd830c40bedf2bfce90614b24LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
11/20/2025 9:36 PM UTC
2fc3e342e35596e12e6c718f44275b603d05eff2701b6bc33d3bc1e570c275daRUN
TRITON_VERSION=2.63.0 TRITON_CONTAINER_VERSION=25.11 pip3 install -r python/openai/requirements.txt
11/20/2025 9:36 PM UTC
0715b254c770054cf013c93b0db9e2b005783563646047977abec5aaf446d529RUN
TRITON_VERSION=2.63.0 TRITON_CONTAINER_VERSION=25.11 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
11/20/2025 9:36 PM UTC
cf85614bcc459c7931d87404ad971767d065fd36999bafc49feb337051f27b29COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
11/20/2025 9:36 PM UTC
ef0ad7ed712d3e804c7dafa99b9b58a7acbffc299f340a516640f2a2a49410fbWORKDIR
/opt/tritonserver
11/20/2025 9:36 PM UTC
661f0a8d2c5e2beab921eed4fa3cd7e4916bbf20e13a50c7879a704fa630fdfcCOPY
--chown=1000:1000 build/install tritonserver
11/20/2025 9:36 PM UTC
4fb2da5c525018ba4da6bf26c9b88dae24f0531a7240e8e920482b9a4477899bWORKDIR
/opt
11/20/2025 9:36 PM UTC
8b3b2ea73bd95213b35dcc8a4c5f0ac1621ba7a68f58df030f4dccbcca201ff3LABEL
com.nvidia.build.ref=cc221d499acd606668317ca89e5a056a30ec4c90
11/20/2025 9:36 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.