NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
bb8445afc494d94fa51a105648fe9ec46991f786e999f4509048c80ebfd0de0fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
08/22/2025 1:26 AM UTC
fb77d8cf1e268b0d250894847d46193ee9d70b697be6db9afc524565ab2d5874LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
08/22/2025 1:26 AM UTC
582f836aa3ac268e89087e7df7fd7021168a2795cc5c114c620a4cfb8bd212feLABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
08/22/2025 1:26 AM UTC
1a5ff05c37ccfb1b24e8515f49c41f8dd0c2c12dc57ec0ed232aebee26d67feaRUN
TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 pip3 install -r python/openai/requirements.txt
08/22/2025 1:26 AM UTC
366bc6796b303dcd9ec95e8e17cf36d1e2862f06d6566c466dd0ae81950a51f5RUN
TRITON_VERSION=2.60.0 TRITON_CONTAINER_VERSION=25.08 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
08/22/2025 1:26 AM UTC
65ab83325b0ad27a790ed9b4d0a574fe86880f4e0d47fff33dfcea6221ba62c2COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
08/22/2025 1:26 AM UTC
25bc87b7e5ae8cfc59db0c081ec6b978e4c90260014be46353bab4fcce06b1f5WORKDIR
/opt/tritonserver
08/22/2025 1:26 AM UTC
0cbcc2f7f59e1689ccb5f3b882c39cea656467857631158556aa12bc06b8a298COPY
--chown=1000:1000 build/install tritonserver
08/22/2025 1:26 AM UTC
59894b706393deee2ad0e7aeae1a898bed4ecb7b71f820ca6589df1a1b94176bWORKDIR
/opt
08/22/2025 1:26 AM UTC
0f1c7ef61c8d63e5f4b3f1a3ff349428dd4e9d5512882cfa3b0acb9d041a577bLABEL
com.nvidia.build.ref=8ced3b40794a0aed058b85333c4c4bb638de5476
08/22/2025 1:26 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.