NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
6b7f66be6020315633c3222e9f90c38ddd7ab96cdf83b96836837adfd30a18e2CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
12/19/2025 1:30 AM UTC
d74ab9978fd11e309676112cf6d02902469836cfefb4f201f7a326135ce35649LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
12/19/2025 1:30 AM UTC
bfa7ce186765f755544bc969f4d80e1245a3740c569276628a590280ba65c6aeRUN
TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 chown triton-server:triton-server /opt/tritonserver/caches
12/19/2025 1:30 AM UTC
fa0f9f60f9392aff3d5a20e47ce8cf460259f4fc391fe729bca7b58c647ceaeaCOPY
--chown=1000:1000 /opt/tritonserver/caches/local /opt/tritonserver/caches/local
12/19/2025 1:30 AM UTC
02d398f231efa7959fb0946d2b15fcc4e0ba8707b68bb9f5d76bf2b07b74fbc2RUN
TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 chown triton-server:triton-server /opt/tritonserver/repoagents
12/19/2025 1:30 AM UTC
56d8bbb131ee4c902095d55bdf1ad991add9e29450948f70d5539b5d9ec461d7COPY
--chown=1000:1000 /opt/tritonserver/repoagents/checksum /opt/tritonserver/repoagents/checksum
12/19/2025 1:30 AM UTC
43ea51f556f7c10bd8d010b2113cc30167b4dad3a382420bfdb767e6d4a45bfdRUN
TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 chown triton-server:triton-server /opt/tritonserver/backends
12/19/2025 1:30 AM UTC
6c68f191a9500121b36d5e3317234bf2920b6fe74ace55ba1f9eb9663e442faeCOPY
--chown=1000:1000 /opt/tritonserver/backends/python /opt/tritonserver/backends/python
12/19/2025 1:30 AM UTC
2de271b3a111756f75b852bf10d728a39f8ae7fbce67d8b519b3fcb03051c66cCOPY
--chown=1000:1000 /opt/tritonserver/backends/identity /opt/tritonserver/backends/identity
12/19/2025 1:30 AM UTC
8b368a5513bc5992d87d28993e015bd1ea4f3fcddc257be823cbc06db36bbed0COPY
--chown=1000:1000 /opt/tritonserver/backends/pytorch /opt/tritonserver/backends/pytorch
12/19/2025 1:30 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.