NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
7e42145557cd3af8e06d325f6efa0f2e1e5509d017437dd7b129c3a963f9a057CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
08/22/2025 1:39 AM UTC
380855ed977d143474ce30f92123c15bea02585405516d6e122ad2e6698a2f34LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
08/22/2025 1:39 AM UTC
3bdfc116d2592e16691e4df7016d66e8ffdfee2bd64f220ae8020d738d7e4fb9RUN
TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 chown triton-server:triton-server /opt/tritonserver/caches
08/22/2025 1:39 AM UTC
29c6f12675605a9254ba1a37358bf097ed195b74e138a9459da52df99aac8799COPY
--chown=1000:1000 /opt/tritonserver/caches/local /opt/tritonserver/caches/local
08/22/2025 1:39 AM UTC
96d3f9fa7f3a7532cf46f967c0139ab39d2b7a29c8b3467a94ed4925876dfdbdRUN
TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 chown triton-server:triton-server /opt/tritonserver/repoagents
08/22/2025 1:39 AM UTC
fa1882485f06bc5a102b89eb675bf9e0c84a0ad455df8437b3f5ae4739b55417COPY
--chown=1000:1000 /opt/tritonserver/repoagents/checksum /opt/tritonserver/repoagents/checksum
08/22/2025 1:39 AM UTC
6bbff6210b856a289c5e73b024d5a6c3ec43240b0631a6232b78f78dbf69c97cRUN
TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 chown triton-server:triton-server /opt/tritonserver/backends
08/22/2025 1:39 AM UTC
57c38d082d5f0a11bdce611b812952fb4679e7ef479f5ce8fa145cac74dc4208COPY
--chown=1000:1000 /opt/tritonserver/backends/python /opt/tritonserver/backends/python
08/22/2025 1:39 AM UTC
85cc10af61217334ee552c7189397c5c81f1ded32a9f41f75e6426a42855e3e6COPY
--chown=1000:1000 /opt/tritonserver/backends/identity /opt/tritonserver/backends/identity
08/22/2025 1:39 AM UTC
41dad37cd6255ed8b91e24e41d534bd5019e962b0c794d6033d97f3e252ee56cCOPY
--chown=1000:1000 /opt/tritonserver/backends/pytorch /opt/tritonserver/backends/pytorch
08/22/2025 1:39 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.