NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
33f19b04a9ef38c177450ec8374d3b6d7a7330f56c40b7a52b64a00f5c2f9700CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
03/18/2026 9:54 PM UTC
c7ebec6382cba2785b9898e9fc5ec1c007be62be7f829f299423c7d71cb168c2LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
03/18/2026 9:54 PM UTC
5d9e389266c451a86d7a0be46029db80371b2e309ebfb6ce7fcfb321f2e80c2dRUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 chown triton-server:triton-server /opt/tritonserver/caches
03/18/2026 9:54 PM UTC
57025dcaa2947d683a977678db920cee3b0175a4dc8c9082bc9d3a42718f5051COPY
--chown=1000:1000 /opt/tritonserver/caches/local /opt/tritonserver/caches/local
03/18/2026 9:54 PM UTC
157684ecee01307b2d563bcf4a19a72981e442da06e3518f61f75d4f71072b8dRUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 chown triton-server:triton-server /opt/tritonserver/repoagents
03/18/2026 9:54 PM UTC
456e1991436de5451bdfd2109045d4fcc4e9f712490c85e3002553c4d3afe030COPY
--chown=1000:1000 /opt/tritonserver/repoagents/checksum /opt/tritonserver/repoagents/checksum
03/18/2026 9:54 PM UTC
b9af1b9d5bd10e13aa939a13c2f12196b95c16839b3b89dd284de719ca00aa42RUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 chown triton-server:triton-server /opt/tritonserver/backends
03/18/2026 9:54 PM UTC
72a069aa1968e040cb9342c33c9a88c02f70128df8885fae5d669dd0ada3d0f9COPY
--chown=1000:1000 /opt/tritonserver/backends/python /opt/tritonserver/backends/python
03/18/2026 9:53 PM UTC
a23ca55074a85c1cb0bd613287ccd6ac1921b58463c1f2dfbce589989a25cd2bCOPY
--chown=1000:1000 /opt/tritonserver/backends/identity /opt/tritonserver/backends/identity
03/18/2026 9:53 PM UTC
fd3a1913bdfb902b0c6a7406941ea339aeb11f5da5fed471319e609150daa8d1COPY
--chown=1000:1000 /opt/tritonserver/backends/pytorch /opt/tritonserver/backends/pytorch
03/18/2026 9:53 PM UTC
...