NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
4008f727f16725ab433eaa90eb5aa033f384f1a918b421b4539ff59f8c3fb00fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
10/03/2025 6:01 PM UTC
3bd2f8e2a0305b9bb23536c86428c91a20a7d0d07c17778ab5b27b597950c122LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
10/03/2025 6:01 PM UTC
76199d9400eaac10837138fe2975e3df2293b3a6ccf4fbb6f9a445a5d39b0b5eRUN
TRITON_VERSION=2.61.0 TRITON_CONTAINER_VERSION=25.09 chown triton-server:triton-server /opt/tritonserver/caches
10/03/2025 6:01 PM UTC
01685d50b117ebe4cc423edc92e7efe9d309bb18ea9b76ca33c233388c49b256COPY
--chown=1000:1000 /opt/tritonserver/caches/local /opt/tritonserver/caches/local
10/03/2025 6:01 PM UTC
08a921224a160cef86c3633d0e749020e4a18241ad1ebb05bfeb44a22199fb23RUN
TRITON_VERSION=2.61.0 TRITON_CONTAINER_VERSION=25.09 chown triton-server:triton-server /opt/tritonserver/repoagents
10/03/2025 6:01 PM UTC
8a8e8170cc3960e0128ac52923cebf5a315a1d88d28beb308e571b7835e81d83COPY
--chown=1000:1000 /opt/tritonserver/repoagents/checksum /opt/tritonserver/repoagents/checksum
10/03/2025 6:01 PM UTC
a6e6597fe37c22e151715abedb64420a7f7d38eeee11c3040008a8b04d06b2f4RUN
TRITON_VERSION=2.61.0 TRITON_CONTAINER_VERSION=25.09 chown triton-server:triton-server /opt/tritonserver/backends
10/03/2025 6:01 PM UTC
f777309e11b63917d51112357946bf82feef41db565e17e025d883471b1f55ecCOPY
--chown=1000:1000 /opt/tritonserver/backends/python /opt/tritonserver/backends/python
10/03/2025 6:01 PM UTC
8e4c9d988dc8e45ec026f8a6d26eda1e6c534538b6bb288fc9f6aee7d83212a5COPY
--chown=1000:1000 /opt/tritonserver/backends/identity /opt/tritonserver/backends/identity
10/03/2025 6:01 PM UTC
44066d384d27fb317a9db05d597d44d4f5db25282261864e06bb48ceccff44c5COPY
--chown=1000:1000 /opt/tritonserver/backends/pytorch /opt/tritonserver/backends/pytorch
10/03/2025 6:01 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.