NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
14c1b8fc874ead5bc1bbed52d5007f9b6333151dfbb7e7bfb07d9e5111b76523CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
12/19/2025 12:58 AM UTC
489bfc24eb87df991d8255085e9be4d30c5114468974e453247dbb89747ad019LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
12/19/2025 12:58 AM UTC
ba4502c57bba2a8ed8c56f5c6f8d034a10d6604f0b18267ce5aa366147dd4234LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
12/19/2025 12:58 AM UTC
58b61a4bf06d591c9902762780053c7ab60b226620f295214b9892e7153f5863RUN
TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 BUILD_PUBLIC_VLLM=false PYVER=3.12 pip3 install -r python/openai/requirements.txt
12/19/2025 12:58 AM UTC
0a592cbbd7446a7a4f77045dd7ce0a351a793c6fc74e8450713e5c962f5b846aRUN
TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 BUILD_PUBLIC_VLLM=false PYVER=3.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
12/19/2025 12:58 AM UTC
55d7f504f5e627b41c8c3895628e1a9bab0b43933b253e294d9f6f1fbf599975COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
12/19/2025 12:58 AM UTC
e260b6efd9d8a214408eff3361c297025817b7eaabff6ae1d5031e6d7b288cd2WORKDIR
/opt/tritonserver
12/19/2025 12:58 AM UTC
616271856a723179c973e0a4f235bc585ee9d4c5b0ac3db573a48c3ee36934eeCOPY
--chown=1000:1000 build/install tritonserver
12/19/2025 12:58 AM UTC
5fdf88ae0e1ae5d2ce560bf0932d43bd1afb792219a66b830f1b645147dd45faWORKDIR
/opt
12/19/2025 12:58 AM UTC
3a8d427c8d63967ea517483fec394b22812330648c379c41e894c6f349653c1dLABEL
com.nvidia.build.ref=9bf730f9c9c3d98fa3d2fd5ef82f0aa5bd7c5d45
12/19/2025 12:58 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.