NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
79125e827cff3694ce7c02aedc886e0d0a28c2bf84aedbe06558fdfae7e6d41aCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
10/03/2025 5:34 PM UTC
bd61452eb2c0f4983bcaabe980a6a9ff0bd88faabb62db1956185d2c9162095aLABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
10/03/2025 5:34 PM UTC
be508d776ae5f0471d9a6a4eb1981aa1a3c2ef7880784e1ca92fcf4a72a02780LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
10/03/2025 5:34 PM UTC
2a892da1ff3b523d8433d31dd1343f2c91d2fb9ffb045755dd8b322f80ae1443RUN
TRITON_VERSION=2.61.0 TRITON_CONTAINER_VERSION=25.09 BUILD_PUBLIC_VLLM=false PYVER=3.12 pip3 install -r python/openai/requirements.txt
10/03/2025 5:34 PM UTC
c380ac5912b337db3ea8aaba0d8bf8b8a1bdf12c2ccc67ea2c1a768d2b7da772RUN
TRITON_VERSION=2.61.0 TRITON_CONTAINER_VERSION=25.09 BUILD_PUBLIC_VLLM=false PYVER=3.12 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
10/03/2025 5:34 PM UTC
2d9119516a77a54394a627c763d6d2bd4d7e9c9d1cc978adb0aa1949f616cd27COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
10/03/2025 5:34 PM UTC
4ce6cbec2b716092caa48642bf381796ec524b021d1f4a9912b8912509ec0c12WORKDIR
/opt/tritonserver
10/03/2025 5:34 PM UTC
a313f5a9ab9c6a4145cc05fb63942e170eaeab5484adbd7c68285029d30cb662COPY
--chown=1000:1000 build/install tritonserver
10/03/2025 5:34 PM UTC
3eaad0543fee06b0595f83c4ae07726a4969c584f4418786ee89f6a190a5cd69WORKDIR
/opt
10/03/2025 5:34 PM UTC
93f106b4fd9082626169e8effaa8cf9d1fb0f9c51af1ac667804bce10e60a646LABEL
com.nvidia.build.ref=e083729f123ef30ce5afc235aec51c3bcacb5c1a
10/03/2025 5:34 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.