NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
4f7f17da5fb978c691024d076ccbbc19e2f19f660ebda69921d0a12821e103d4CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
04/24/2026 1:06 AM UTC
dc92e1726d6de8a7a31cdf5a56c80d5286cb8f2b6264e29c3468795ed457b4f7LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
04/24/2026 1:06 AM UTC
1919ceee09db8a3b05f212fcd6795401d88fc730b4bfacb6a01252cececaa92bLABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
04/24/2026 1:06 AM UTC
b99651815e68e1732e823e6d489126c78a3e9b3cb1c159e5ee26509df682c86aRUN
TRITON_VERSION=2.68.0 TRITON_CONTAINER_VERSION=26.04 pip3 install -r python/openai/requirements.txt
04/24/2026 1:06 AM UTC
d4883b6e5a1a10808f4a425c4f311d07b0431572df58a0adef022d2887f53f3bRUN
TRITON_VERSION=2.68.0 TRITON_CONTAINER_VERSION=26.04 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
04/24/2026 1:06 AM UTC
0c5248df08a56264d28c9237b1d054f825c4b661b0f18d4b71340ced84956f80COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
04/24/2026 1:06 AM UTC
6e42671286ac35b9ba6140f349d09b35153411a3073aa218cb5753e641310418WORKDIR
/opt/tritonserver
04/24/2026 1:06 AM UTC
b951b527f0e4e660885cd21d912966a9eda7405831c9969a06848501bdc564efCOPY
--chown=1000:1000 build/install tritonserver
04/24/2026 1:06 AM UTC
eff556be1e9e754a2f357193a09863ca112fb6a237d3321adddd6ff2275391e5WORKDIR
/opt
04/24/2026 1:06 AM UTC
f55016189d05b9b36626be70983c353ffd97870e3f450d44380291e16079e253LABEL
com.nvidia.build.ref=e17fe849530a72c6bc2a5a78890a26341617affd
04/24/2026 1:06 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.