NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
427c7fe5f028c8f20b813b4aef1651728fe7f26582671e672622653b5643765cCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
02/22/2026 12:56 AM UTC
3b04d218cfc9050b49c4d592e45d99b6f5a045a6bf943747a44d6c3c10e66cb0LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
02/22/2026 12:56 AM UTC
28b3a29f8c305bfe1afa242fcfb7d37e67c16987000d0ec5c3d851a35541e671LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
02/22/2026 12:56 AM UTC
ef7e2dda8d9394db2deac3cd6eece369607d41f804562a860737f2b2065b0f1aRUN
TRITON_VERSION=2.66.0 TRITON_CONTAINER_VERSION=26.02 BUILD_PUBLIC_VLLM=false PYVER=3.12 pip3 install -r python/openai/requirements.txt
02/22/2026 12:56 AM UTC
61a7c8cd6eb993841adbad908ea79cb6f645b192ec327fa4ee2fff538239c8d2RUN
TRITON_VERSION=2.66.0 TRITON_CONTAINER_VERSION=26.02 BUILD_PUBLIC_VLLM=false PYVER=3.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
02/22/2026 12:56 AM UTC
428a39337fa049c1a563f284c2d08b4145f2a7b6aba4cb8a3ff63263efcf21cfCOPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
02/22/2026 12:55 AM UTC
66f095ea819b45b1c05455a4948ee534cf72860b246befae923ad16f83ff8ac4WORKDIR
/opt/tritonserver
02/22/2026 12:55 AM UTC
3931c58341724ffeab09c25baa216ea4f4ebaa4a4e113cc3fa214f573d564427COPY
--chown=1000:1000 build/install tritonserver
02/22/2026 12:55 AM UTC
905274211baa44f57da86801e51a2885c89dd48e37644c1be0e1aa3cf13285bcWORKDIR
/opt
02/22/2026 12:55 AM UTC
345cb0c8f7213761bbee58a027ac9aa90a34d1f0a3e33d1416f7ac082b9187e2LABEL
com.nvidia.build.ref=cb8966de0709711b5a5bec629eddda13e61287c8
02/22/2026 12:55 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.