NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
d60636f7b43adc7d1ac9fc488a570a8bb8c1470dddd9bcc64cdf150be956e34fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
05/19/2026 1:19 AM UTC
2e31448355cbc021498e3f90d700676043ed09eadc91a63f0ab192212cc4dbe6LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
05/19/2026 1:19 AM UTC
4f1735e1590d5e299602a589aaae9cc26fc4569a0108c6e7e6d5c3d427e6fbdfRUN
TRITON_VERSION=2.69.0 TRITON_CONTAINER_VERSION=26.05 chown triton-server:triton-server /opt/tritonserver/caches
05/19/2026 1:19 AM UTC
a03604749802c7bea2b26799c943658ba9a7f9e4f8a285225a1051a2a6887bfcCOPY
--chown=1000:1000 /opt/tritonserver/caches/local /opt/tritonserver/caches/local
05/19/2026 1:19 AM UTC
db1a77e75d08ea25c80767a4fffecaff4880ab8dc7840ce23baacbdbe1638e23RUN
TRITON_VERSION=2.69.0 TRITON_CONTAINER_VERSION=26.05 chown triton-server:triton-server /opt/tritonserver/repoagents
05/19/2026 1:19 AM UTC
7ce92731117cb4d29b2dbb2e78900903e9f7acf41682c5dd802857eadc44781aCOPY
--chown=1000:1000 /opt/tritonserver/repoagents/checksum /opt/tritonserver/repoagents/checksum
05/19/2026 1:19 AM UTC
b0c87a717f26a26c7aee5ca0e3e016f14f2c7c6a3ebe0c516ba2cb45a2a9e30bRUN
TRITON_VERSION=2.69.0 TRITON_CONTAINER_VERSION=26.05 chown triton-server:triton-server /opt/tritonserver/backends
05/19/2026 1:19 AM UTC
363d288c23796fb05382f3dcdb02cde0d9e3d441558ab5951612af773aed5fcbCOPY
--chown=1000:1000 /opt/tritonserver/backends/python /opt/tritonserver/backends/python
05/19/2026 1:19 AM UTC
5488808b6306f3dd4467021c3331d7c8624751baaf4b1d022905d7da5ff1e0c1COPY
--chown=1000:1000 /opt/tritonserver/backends/identity /opt/tritonserver/backends/identity
05/19/2026 1:19 AM UTC
31aa7449fd60562513443ad91c4addd73ba926b93a7dec8f560a6022b31e4e8cCOPY
--chown=1000:1000 /opt/tritonserver/backends/pytorch /opt/tritonserver/backends/pytorch
05/19/2026 1:19 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.