NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
c19ffb21c079d5331e775e02ba2917359abad16e82e7c3bbedf0e5b683c3d90aCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
02/14/2026 12:28 AM UTC
74a6187b394f294c52bc0ab0e73867d358833db09cf0fdb660a0759ac9e713f0LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
02/14/2026 12:28 AM UTC
0c2a8077b5a1b8072f69701675ed609d7c1cd7471cc4a932a60000a4a20b9240RUN
TRITON_VERSION=2.66.0 TRITON_CONTAINER_VERSION=26.02 chown triton-server:triton-server /opt/tritonserver/caches
02/14/2026 12:28 AM UTC
bb4d4d4da548e2c750f5b81c5bcf24eefa2cbf7f71593a9c5198c724ad5ce238COPY
--chown=1000:1000 /opt/tritonserver/caches/local /opt/tritonserver/caches/local
02/14/2026 12:28 AM UTC
3bd264a7e046f4f68e781474f7d2205ab068a309271e9f1db0ba41e3a59f0074RUN
TRITON_VERSION=2.66.0 TRITON_CONTAINER_VERSION=26.02 chown triton-server:triton-server /opt/tritonserver/repoagents
02/14/2026 12:28 AM UTC
500a51b814549d7a6dbbf2509753ab82eedf627362d5ca6ea482b81efe3a5110COPY
--chown=1000:1000 /opt/tritonserver/repoagents/checksum /opt/tritonserver/repoagents/checksum
02/14/2026 12:28 AM UTC
4906c2b777654bb7edbf58bb7ce55970748645d8b0a69a15185af0ec1a1e85fdRUN
TRITON_VERSION=2.66.0 TRITON_CONTAINER_VERSION=26.02 chown triton-server:triton-server /opt/tritonserver/backends
02/14/2026 12:28 AM UTC
84be032ea95fc08a0dc9cca905d372880205ceeacd5764c2db98bfd5ee736534COPY
--chown=1000:1000 /opt/tritonserver/backends/python /opt/tritonserver/backends/python
02/14/2026 12:28 AM UTC
fb78407478c9577609da1eddb236b6fc77221f85bc517d4ac8f647f36b79e2a2COPY
--chown=1000:1000 /opt/tritonserver/backends/identity /opt/tritonserver/backends/identity
02/14/2026 12:28 AM UTC
dda97274be0d0325eb7874ae3db2fcd03225c6d08a38893814a3f26fade5f0beCOPY
--chown=1000:1000 /opt/tritonserver/backends/pytorch /opt/tritonserver/backends/pytorch
02/14/2026 12:28 AM UTC
...