Triton Inference Server

NVIDIA

Container

NVIDIA

Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

NVIDIA AI Enterprise Supported

Layer	Label		Created
sha256:b4b010599856348851a0ad81a0662d752008f3300351c130ad1a868230e42bd8	COPY	`--chown=1000:1000 docker/sagemaker/serve /usr/bin/.`	06/18/2025 9:55 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.amazonaws.sagemaker.capabilities.multi-models=true`	06/18/2025 9:55 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true`	06/18/2025 9:55 PM UTC
sha256:3caedfc7f03883300763d1f6a6a7490787f66fcf5f55dd684843b2205ace847f	RUN	`TRITON_VERSION=2.59.0 TRITON_CONTAINER_VERSION=25.06 pip3 install -r python/openai/requirements.txt`	06/18/2025 9:55 PM UTC
sha256:5b7c394f237c75d63238a00fcad06c75a87f2a32832e3c657d769c7ee2e20946	RUN	`TRITON_VERSION=2.59.0 TRITON_CONTAINER_VERSION=25.06 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-.whl" \| xargs -I {} pip install --upgrade {}[all] && find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-.whl" \| xargs -I {} pip install --upgrade {}[all]`	06/18/2025 9:55 PM UTC
sha256:6eceedf655ecc1baaaa7a8d918b4afef9485656bf7ac82c6235b60fb1269e62e	COPY	`--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .`	06/18/2025 9:55 PM UTC
sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1	WORKDIR	`/opt/tritonserver`	06/18/2025 9:55 PM UTC
sha256:a594d8c87d27cc42206c7c6a3aeab2ae0a693f7cb9fd2542ebfc6fdbbf434934	COPY	`--chown=1000:1000 build/install tritonserver`	06/18/2025 9:55 PM UTC
sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1	WORKDIR	`/opt`	06/18/2025 9:55 PM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.nvidia.build.ref=7a14b7925e0195aab82a319261751f73c3e2369b`	06/18/2025 9:55 PM UTC