Triton Inference Server

NVIDIA

Container

NVIDIA

Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

NVIDIA AI Enterprise Supported

Layer	Label		Created
sha256:715500bda3c00ec2068e2a405b275742ae319d72aaa00c2e874952eeb5bd3cef	COPY	`--chown=1000:1000 docker/sagemaker/serve /usr/bin/.`	08/22/2025 1:14 AM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.amazonaws.sagemaker.capabilities.multi-models=true`	08/22/2025 1:14 AM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true`	08/22/2025 1:14 AM UTC
sha256:85fa5b02971a8121a543e676896b287c42999473690752a6a801be93f9a71b1a	RUN	`TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 BUILD_PUBLIC_VLLM=false PYVER=3.12 pip3 install -r python/openai/requirements.txt`	08/22/2025 1:14 AM UTC
sha256:f350978836ca81ee0d739a198dd686f8ab217fb7229b9cf79b5b0ce8dd004f3c	RUN	`TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 BUILD_PUBLIC_VLLM=false PYVER=3.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-.whl" \| xargs -I {} pip install --upgrade {}[all] && find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-.whl" \| xargs -I {} pip install --upgrade {}[all]`	08/22/2025 1:14 AM UTC
sha256:cbb23730bc2bfb8830756d272d80ec777946f8b00c4af7f2627965aeaa0b061a	COPY	`--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .`	08/22/2025 1:14 AM UTC
sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1	WORKDIR	`/opt/tritonserver`	08/22/2025 1:14 AM UTC
sha256:d23963b4ad5839f97a9541b971277522d14125288e53dc7b2c9664394da523d7	COPY	`--chown=1000:1000 build/install tritonserver`	08/22/2025 1:14 AM UTC
sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1	WORKDIR	`/opt`	08/22/2025 1:14 AM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.nvidia.build.ref=8ced3b40794a0aed058b85333c4c4bb638de5476`	08/22/2025 1:14 AM UTC