NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
4813e75b152252935a9791e91f110a26d8cbfb8b83571011d2f29dc963533b2fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
06/18/2025 9:17 PM UTC
d82c52d1941962f84597cd6a660ad5fab48f592d270680596ce286966ec001b8LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
06/18/2025 9:17 PM UTC
2531c9ad87fb8b40521976ae39e54ff75b16b779ce814504409f084c613d6b24LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
06/18/2025 9:17 PM UTC
097392740ea897d1e23d8451f1408fab7ab1c101d33af13a8480caeb9c8fccbbRUN
TRITON_VERSION=2.59.0 TRITON_CONTAINER_VERSION=25.06 BUILD_PUBLIC_VLLM=false PYVER=3.12 pip3 install -r python/openai/requirements.txt
06/18/2025 9:17 PM UTC
7ccbf91cf0afd39d3512e6ed9115e4927cd8edfae1bcd50a9e105dcb4a9c6c46RUN
TRITON_VERSION=2.59.0 TRITON_CONTAINER_VERSION=25.06 BUILD_PUBLIC_VLLM=false PYVER=3.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
06/18/2025 9:17 PM UTC
b5470dc478874345717a0237096f2f408957f229f82ab283dcc4f07a2ab57354COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
06/18/2025 9:17 PM UTC
52f9c97d91285f9a6470f0954f20371683fa80edce79748fbc84bf8f60445767WORKDIR
/opt/tritonserver
06/18/2025 9:17 PM UTC
5176a479c1d92724e471daee7dd86196371bf77feeaa8d143743a7de97ef9fe9COPY
--chown=1000:1000 build/install tritonserver
06/18/2025 9:17 PM UTC
97efb3b19fec45edc9d7b3794d9ae02e6852f29a93bf70334cf52e87d1b2a466WORKDIR
/opt
06/18/2025 9:17 PM UTC
2f03af524f0158d0dba88d80e9835243a1243f493c7a2035955e6016c217bf2fLABEL
com.nvidia.build.ref=7a14b7925e0195aab82a319261751f73c3e2369b
06/18/2025 9:17 PM UTC
...