Triton Inference Server

NVIDIA

Container

NVIDIA

Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

NVIDIA AI Enterprise Supported

Layer	Label		Created
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	ENV	`LD_LIBRARY_PATH=/usr/local/tensorrt/lib/:/opt/tritonserver/backends/tensorrtllm:/usr/local/tensorrt/lib:/usr/local/cuda/compat/lib:/usr/local/nvidia/lib:/usr/local/nvidia/lib64`	12/19/2025 1:08 AM UTC
sha256:bee52362e9699e9ad3d05d0b70cedcccaf76212d2b8f648ee9b71283f815fa80	RUN	TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 ldconfig && ARCH="$(uname -i)" && rm -fr ${TRT_ROOT}/bin ${TRT_ROOT}/targets/${ARCH}-linux-gnu/bin ${TRT_ROOT}/data && rm -fr ${TRT_ROOT}/doc ${TRT_ROOT}/onnx_graphsurgeon ${TRT_ROOT}/python && rm -fr ${TRT_ROOT}/samples ${TRT_ROOT}/targets/${ARCH}-linux-gnu/samples && pip3 install --no-cache-dir transformers && find /usr -name libtensorrt_llm.so -exec dirname {} \; > /etc/ld.so.conf.d/tensorrt-llm.conf && find /opt/tritonserver -name libtritonserver.so -exec dirname {} \; > /etc/ld.so.conf.d/triton-tensorrtllm-worker.conf && pip3 install --no-cache-dir grpcio-tools==1.64.0 && pip3 uninstall -y setuptools	12/19/2025 1:08 AM UTC
sha256:4f671e530abc2d8d741e6d09336c5f9ed4134a05963adbc9c51af2eefdef53f2	COPY	`--chown=1000:1000 docker/sagemaker/serve /usr/bin/.`	12/19/2025 1:08 AM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.amazonaws.sagemaker.capabilities.multi-models=true`	12/19/2025 1:08 AM UTC
sha256:a3ed95caeb02ffe68cdd9fd84406680ae93d633cb16422d00e8a7c22955b46d4	LABEL	`com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true`	12/19/2025 1:08 AM UTC
sha256:2de0b52fb0aef964631be49c7a4ef35722618efc79bf8dc62e97b0f4b41a741b	RUN	`TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 pip3 install -r python/openai/requirements.txt`	12/19/2025 1:08 AM UTC
sha256:8ba9c570e37909e1c733ca63bbdfaed2b3c3ad15a53890acccecfa2a5896a58b	RUN	`TRITON_VERSION=2.64.0 TRITON_CONTAINER_VERSION=25.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-.whl" \| xargs -I {} pip install --upgrade {}[all] && find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-.whl" \| xargs -I {} pip install --upgrade {}[all]`	12/19/2025 1:08 AM UTC
sha256:483b7f0367af47c9ce954a903539963f5765c936dee666a2b533dd0d5cd23a6f	COPY	`--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .`	12/19/2025 1:08 AM UTC
sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1	WORKDIR	`/opt/tritonserver`	12/19/2025 1:08 AM UTC
sha256:cadbd2fb737021675811f90bd45316d10d228837d3e339520efb427aea61bdb1	COPY	`--chown=1000:1000 build/install tritonserver`	12/19/2025 1:08 AM UTC