NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
73915b0cd659c0ce2a501352bf064e2f15484e607c9d39b5b80a009ef1a66122CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
10/03/2025 5:51 PM UTC
b81a6435653925b8bd2f5ecc2a1b40ca721cbcfcf553eb3e947955c75475fe1bLABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
10/03/2025 5:51 PM UTC
e67a1572a947332e6a7bc1cce089bb967776e6330e0c052afade819e79021530LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
10/03/2025 5:51 PM UTC
bfef3a8b589b2b76b458339677cc9e0bad926cabbe7c3fac6fc61174173937abRUN
TRITON_VERSION=2.61.0 TRITON_CONTAINER_VERSION=25.09 pip3 install -r python/openai/requirements.txt
10/03/2025 5:51 PM UTC
3357e7336754cbb6361ade3c610e3452d6a7ee09299da254e6aaa58b114239deRUN
TRITON_VERSION=2.61.0 TRITON_CONTAINER_VERSION=25.09 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
10/03/2025 5:51 PM UTC
7a1c74454967129de53f0bd16b6116979b6eb745bfde8b92f68a6756628aaea5COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
10/03/2025 5:51 PM UTC
9cc7cfb9fd187efc55d595a23e4bbf42f25dd90ff5f806afd61acdae2d694e84WORKDIR
/opt/tritonserver
10/03/2025 5:51 PM UTC
fd748adfbc6a38cf4587d7ea1edcf9e0484efeedf4fe4bb9495e06ca899acf7bCOPY
--chown=1000:1000 build/install tritonserver
10/03/2025 5:51 PM UTC
8f053c8f8c12598fe3a8d4702e968791ad69cb954737ca5bc669a6e3a9084077WORKDIR
/opt
10/03/2025 5:51 PM UTC
aa5ccc435f465849f946a9ed1391db5b7acc3f002d79db9d37ce85ff7955feb4LABEL
com.nvidia.build.ref=e083729f123ef30ce5afc235aec51c3bcacb5c1a
10/03/2025 5:51 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.