NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
cc0a727828e5772afea9c4f344f24f58d2e5ae01601254bdc61fcaad1d7a933eCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh
05/19/2026 1:17 AM UTC
7b8d9b9ac1ed13d22ad7aa41eb48c37a091d182b57d11d968314cb4e591947faCOPY
docker/entrypoint.d/ /opt/nvidia/entrypoint.d/
05/19/2026 1:17 AM UTC
ffada9700043fbef763b62fa8ec1b87fcf4aed02363bf81459408021ac201c51ENV
NVIDIA_PRODUCT_NAME=Triton Server Base
05/19/2026 1:17 AM UTC
054efa7f953b53898207ea07956bbeedd451a831275712d89b4f3be662aa6d68ENV
NVIDIA_BUILD_ID=321060090
05/19/2026 1:17 AM UTC
9eb52ad342ee8fbb68894bc438b7f1814b4bed6339d69937bd41fb6b8895f1d0ENV
NVIDIA_TRITON_SERVER_BASE_VERSION=26.05
05/19/2026 1:17 AM UTC
9ade7e7f43422553394128274b898f736a95af12ce4e6563a402dfa819b27c24ARG
NVIDIA_BUILD_ID=321060090
05/19/2026 1:17 AM UTC
21bfbd8512ad6c3406c331c065bd0b4bfa51b2e618110782af57336c7c469264ARG
NVIDIA_TRITON_SERVER_BASE_VERSION=26.05
05/19/2026 1:17 AM UTC
2e1919a175b706b50b6a6316b6fdb10f132f50327e2d32326acf8b8f1e2ad553RUN
TARGETARCH=amd64 ENABLE_FIPS=0 ENABLE_MITMPROXY=0 /tmp/manage_cert.sh uninstall
05/07/2026 9:14 PM UTC
7f10e70f9799b238a87894400371d62f9672bd16fe9e6ca4c94776bbb9679a33ENV
LIBRARY_PATH=/usr/local/cuda/lib64/stubs:/usr/local/cuda/lib64/stubs:
05/07/2026 9:14 PM UTC
eb2e56aa011f6bf3eb6e5a466c85b784a7b7c19f30c20965c920eedb8069955aRUN
RUN |3 TARGETARCH=amd64 ENABLE_FIPS=0 ENABLE_MITMPROXY=0 /bin/sh -c set -exo pipefail export ARTIFACTORY_USER=$(cat /run/secrets/ARTIFACTORY_USER) export ARTIFACTORY_TOKEN=$(cat /run/secrets/ARTIFACTORY_TOKEN) export DEVEL=1 BASE=0 /nvidia/build-scripts/installNCU.sh /nvidia/build-scripts/installCUDA.sh /nvidia/build-scripts/installLIBS.sh /nvidia/build-scripts/installNCCL.sh # https://jirasw.nvidia.com/browse/DLR-4957 to get the headers and static files with symlinks to the common location DPKG_DIVERT=1 STATIC=1 /nvidia/build-scripts/installNVSHMEM.sh /nvidia/build-scripts/installCUDNN.sh /nvidia/build-scripts/installTRT.sh ARTIFACTORY_CLOUD=1 /nvidia/build-scripts/installNSYS.sh /nvidia/build-scripts/installCUSPARSELT.sh if [ -f "/tmp/cuda-${_CUDA_VERSION_MAJMIN}.patch" ]; then patch -p0 < /tmp/cuda-${_CUDA_VERSION_MAJMIN}.patch; fi rm -f /tmp/cuda-*.patch # buildkit
05/07/2026 9:14 PM UTC
...