NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
07300861967c9651741986ec494fd36300fdfe92968948f57593073b9cea662eCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
10/29/2025 7:21 PM UTC
746e97cbeff0c2b7f3b7329f4028ed32135167c00486704f9f2de130a30a8af4LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
10/29/2025 7:21 PM UTC
ae89f83be379b9c8249abba95d2a68bb74824033b7af1228c8b7011b578d41c7RUN
TRITON_VERSION=2.62.0 TRITON_CONTAINER_VERSION=25.10 chown triton-server:triton-server /opt/tritonserver/caches
10/29/2025 7:21 PM UTC
a12474383c317fbf9121151f1d12bb8e610a00db3167ed94ecfe2e0509700403COPY
--chown=1000:1000 /opt/tritonserver/caches/local /opt/tritonserver/caches/local
10/29/2025 7:20 PM UTC
77626d26d18d5709645985bd1b5bd2efecfbaca9678f8ea3f7ad76a22c200679RUN
TRITON_VERSION=2.62.0 TRITON_CONTAINER_VERSION=25.10 chown triton-server:triton-server /opt/tritonserver/repoagents
10/29/2025 7:20 PM UTC
9b07862b745af17f49dfc3093fc400ef1a0acb35fcf23031b99af45fd68b9aafCOPY
--chown=1000:1000 /opt/tritonserver/repoagents/checksum /opt/tritonserver/repoagents/checksum
10/29/2025 7:20 PM UTC
431583b442f01ef2bf7626639b52da8a02261c7eee551ce34b28b8535d6585f5RUN
TRITON_VERSION=2.62.0 TRITON_CONTAINER_VERSION=25.10 chown triton-server:triton-server /opt/tritonserver/backends
10/29/2025 7:20 PM UTC
95632112f0e8d45217ba74c6a1ce55ed897ddfd7162fd9a565c224a6849b9c78COPY
--chown=1000:1000 /opt/tritonserver/backends/python /opt/tritonserver/backends/python
10/29/2025 7:20 PM UTC
21c3f53b133c62854bafa125df9810ec38e56c7e77fd344290a174bf4df338e1COPY
--chown=1000:1000 /opt/tritonserver/backends/identity /opt/tritonserver/backends/identity
10/29/2025 7:20 PM UTC
eac22844cdfd07551a83f259a9fa27d0663f435dd0aab1698d8639d7f772fdd8COPY
--chown=1000:1000 /opt/tritonserver/backends/pytorch /opt/tritonserver/backends/pytorch
10/29/2025 7:20 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.