NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
ed7d0772141ec0f03a07fb2824b2167a17db6d8d353fc57d0757154a515c04d6CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh
06/24/2026 11:22 PM UTC
4a23752840e30c223c693286b38889c60087b7778e5f230b9966d3851e29f86cCOPY
docker/entrypoint.d/ /opt/nvidia/entrypoint.d/
06/24/2026 8:52 PM UTC
19fd18c31685468a030036ab82e0a18326e1fe70f02cb586cb647b81865256ccENV
NVIDIA_PRODUCT_NAME=Triton Server Base
06/24/2026 8:52 PM UTC
45f6d5f37c528a884a9435a1acfdd28deffd66ff3c74549262732e13d91b8671ENV
NVIDIA_BUILD_ID=347409133
06/24/2026 8:52 PM UTC
900992ccc09b03791a5f5ae6a2bca1dcff1ddc292e448046f1cab7395b1ebcecENV
NVIDIA_TRITON_SERVER_BASE_VERSION=26.06
06/24/2026 8:52 PM UTC
f270eca33e696d6747b34e819e76a9ec93a7a8430309a4b3e576e4859e33be4fARG
NVIDIA_BUILD_ID=347409133
06/24/2026 8:52 PM UTC
3cefa4e2f9c1cff054fdacf7c2ed02fcf7fbf5e1a537b90d9bface9129c83487ARG
NVIDIA_TRITON_SERVER_BASE_VERSION=26.06
06/24/2026 8:52 PM UTC
8bbf6d334cc69fd6a02a611213724fcc86477f1dff3734fdd83905e9eccb533dRUN
TARGETARCH=amd64 ENABLE_FIPS=0 ENABLE_MITMPROXY=0 /tmp/manage_cert.sh uninstall
06/16/2026 9:26 PM UTC
9e8685614a2e221bebc38d9d57ebb0696ace96827b606db07ca4114c121a3ea2ENV
LIBRARY_PATH=/usr/local/cuda/lib64/stubs:/usr/local/cuda/lib64/stubs:
06/16/2026 9:26 PM UTC
e97a8e18cd477d81164a0712c412d96e24a9506ba43d9923499993a822013be7RUN
RUN |3 TARGETARCH=amd64 ENABLE_FIPS=0 ENABLE_MITMPROXY=0 /bin/sh -c set -exo pipefail export ARTIFACTORY_USER=$(cat /run/secrets/ARTIFACTORY_USER) export ARTIFACTORY_TOKEN=$(cat /run/secrets/ARTIFACTORY_TOKEN) export DEVEL=1 BASE=0 /nvidia/build-scripts/installNCU.sh /nvidia/build-scripts/installCUDA.sh /nvidia/build-scripts/installLIBS.sh /nvidia/build-scripts/installNCCL.sh # https://jirasw.nvidia.com/browse/DLR-4957 to get the headers and static files with symlinks to the common location DPKG_DIVERT=1 STATIC=1 /nvidia/build-scripts/installNVSHMEM.sh /nvidia/build-scripts/installCUDNN.sh /nvidia/build-scripts/installTRT.sh ARTIFACTORY_CLOUD=1 /nvidia/build-scripts/installNSYS.sh /nvidia/build-scripts/installCUSPARSELT.sh if [ -f "/tmp/cuda-${_CUDA_VERSION_MAJMIN}.patch" ]; then patch -p0 < /tmp/cuda-${_CUDA_VERSION_MAJMIN}.patch; fi rm -f /tmp/cuda-*.patch # buildkit
06/16/2026 9:26 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.