NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
968230f1a836fbf2357ca654029647b7f06ef9a83a692eb99af0f9f368af0635CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
06/18/2025 9:55 PM UTC
17af644e5b0051360b8111cd1fbcb5361097f563843f68fc02722916eaa177bbLABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
06/18/2025 9:55 PM UTC
27e971b900714759398f79f899e4cb2269637f2de06207299842173bee7309e9LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
06/18/2025 9:55 PM UTC
4c21f1022274b4a45015a2d88eaa0bda8ded43e27b08747683bca9b8598ac292RUN
TRITON_VERSION=2.59.0 TRITON_CONTAINER_VERSION=25.06 pip3 install -r python/openai/requirements.txt
06/18/2025 9:55 PM UTC
00b18478d2dd1dd28b219361821e89cdbeacbe46d49d8ca9a3d2def7ecb427fcRUN
TRITON_VERSION=2.59.0 TRITON_CONTAINER_VERSION=25.06 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
06/18/2025 9:55 PM UTC
08a5e4a55ecf207790e5dee1265cc716fd0826a0ae85efdd6b86cbe53a43b725COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
06/18/2025 9:55 PM UTC
0ed435ec5fb80141e6f87d39c1fb10a88bb9d04c50d66f01b68bbeb3ab749f41WORKDIR
/opt/tritonserver
06/18/2025 9:55 PM UTC
0aa6da994e89e408abab18ab8fe7367fc32edcfbcf36447ab9608bc75f0508adCOPY
--chown=1000:1000 build/install tritonserver
06/18/2025 9:55 PM UTC
4a81f9169d9d7e97446880e4236681fcd64005a721d2ac98d1f582395f80de30WORKDIR
/opt
06/18/2025 9:55 PM UTC
e12f0dc0d204fe908e172a49f22dd32501dfba417da8eba6557531588abad391LABEL
com.nvidia.build.ref=7a14b7925e0195aab82a319261751f73c3e2369b
06/18/2025 9:55 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.