Triton Inference Server

NVIDIA

Container

NVIDIA

Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

NVIDIA AI Enterprise Supported

Layer	Label		Created
sha256:303dfb1c79f786ae536f1ea1aa4dc770f037b650faca744e5ae60b0cd0b06945	COPY	`--chown=1000:1000 docker/sagemaker/serve /usr/bin/.`	12/19/2025 1:12 AM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.amazonaws.sagemaker.capabilities.multi-models=true`	12/19/2025 1:12 AM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true`	12/19/2025 1:12 AM UTC
sha256:30f8add1672ab579ec71dce1e1bc47576b182c738c4cd626415e201db4b6b25c	RUN	`TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 pip3 install -r python/openai/requirements.txt`	12/19/2025 1:12 AM UTC
sha256:971050ee7a228bcd3784045ea481db524f91a308c18a50ff1873603d98c6210c	RUN	`TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-.whl" \| xargs -I {} pip install --upgrade {}[all] && find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-.whl" \| xargs -I {} pip install --upgrade {}[all]`	12/19/2025 1:12 AM UTC
sha256:65fb5cebd59633528fb79debde087cb3fe47ae5a5be650c774fca8287704031e	COPY	`--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .`	12/19/2025 1:11 AM UTC
sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1	WORKDIR	`/opt/tritonserver`	12/19/2025 1:11 AM UTC
sha256:42605169928f564c76c21ec571cc85804ac29bcd5463bf32b579fad50badd820	COPY	`--chown=1000:1000 build/install tritonserver`	12/19/2025 1:11 AM UTC
sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1	WORKDIR	`/opt`	12/19/2025 1:11 AM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.nvidia.build.ref=9bf730f9c9c3d98fa3d2fd5ef82f0aa5bd7c5d45`	12/19/2025 1:11 AM UTC