NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
8e415109108108552b0d563f7ecda54226a725be3178b592e0892113f7162c7eCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh
03/18/2026 7:02 PM UTC
a0d7e70563661b4acbda02cd4d74b0fa3e2f973339d9d071c75f4ea8a301e4d2COPY
docker/entrypoint.d/ /opt/nvidia/entrypoint.d/
03/18/2026 7:02 PM UTC
bfd0e08b89a91e752369fdc34cc5b832efa4b78303110183ae7d8bcd7cb952c7ENV
NVIDIA_PRODUCT_NAME=Triton Server Base
03/18/2026 7:02 PM UTC
e54f8b7a7385fee4d1349c15424d5880812d2b6911511511f21ac2a7d4f6721dENV
NVIDIA_BUILD_ID=283962264
03/18/2026 7:02 PM UTC
9ba6423aa01c2df3bfacbfc9884d470f9aada5f41e909f473b122c9b04cfc6e2ENV
NVIDIA_TRITON_SERVER_BASE_VERSION=26.03
03/18/2026 7:02 PM UTC
a941b44e670e77c2ba03f851b87c4f95ea0da5bcf4d2a03a308538c9f642ea56ARG
NVIDIA_BUILD_ID=283962264
03/18/2026 7:02 PM UTC
58ac2110992c24ad208dfe690233d0302426679d911f6c22e3995ac381a4e8c9ARG
NVIDIA_TRITON_SERVER_BASE_VERSION=26.03
03/18/2026 7:02 PM UTC
2e7152c1a22cc888bce5c9e6209db4e9b054bb7a9857e5fa82c47326b4118bb4RUN
TARGETARCH=amd64 ENABLE_FIPS=0 ENABLE_MITMPROXY=0 /tmp/manage_cert.sh uninstall
03/10/2026 7:31 PM UTC
3439e60a2adb361992f5cb1011cfdef7625b30566e5616a1e96ff7f46b8749c7ENV
LIBRARY_PATH=/usr/local/cuda/lib64/stubs:/usr/local/cuda/lib64/stubs:
03/10/2026 7:31 PM UTC
f5e46e63c401f7ae6bde6025f818129ce37f9aa8486a19bb86116220dd94fcbbRUN
RUN |3 TARGETARCH=amd64 ENABLE_FIPS=0 ENABLE_MITMPROXY=0 /bin/sh -c set -exo pipefail export ARTIFACTORY_USER=$(cat /run/secrets/ARTIFACTORY_USER) export ARTIFACTORY_TOKEN=$(cat /run/secrets/ARTIFACTORY_TOKEN) export DEVEL=1 BASE=0 /nvidia/build-scripts/installNCU.sh /nvidia/build-scripts/installCUDA.sh /nvidia/build-scripts/installLIBS.sh /nvidia/build-scripts/installNCCL.sh /nvidia/build-scripts/installNVSHMEM.sh export CUDA_VERSION_MAJOR=$(echo "${CUDA_VERSION}" | cut -d. -f1) # Link nvshmem libs to /usr/local/cuda/lib64 find /usr/lib/*-linux-gnu/nvshmem/${CUDA_VERSION_MAJOR}/ -maxdepth 1 -type f -exec ln -sf {} /usr/local/cuda/lib64/ \; find /usr/lib/*-linux-gnu/nvshmem/${CUDA_VERSION_MAJOR}/ -maxdepth 1 -type l -exec ln -sf {} /usr/local/cuda/lib64/ \; /nvidia/build-scripts/installCUDNN.sh /nvidia/build-scripts/installTRT.sh ARTIFACTORY_CLOUD=1 /nvidia/build-scripts/installNSYS.sh /nvidia/build-scripts/installCUSPARSELT.sh if [ -f "/tmp/cuda-${_CUDA_VERSION_MAJMIN}.patch" ]; then patch -p0 < /tmp/cuda-${_CUDA_VERSION_MAJMIN}.patch; fi rm -f /tmp/cuda-*.patch # buildkit
03/10/2026 7:31 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.