NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
b4e2bfe2273b03602cc7b8875ed91fcaa1fa9333ae6a1882b1fa9810ad949941CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
03/18/2026 6:57 PM UTC
fae4332f06c61640463de1b4ca5c964b6720ce23bcd29ce145b0559b40d058caLABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
03/18/2026 6:57 PM UTC
64fe52c5726c6f4f7a9c221cae4e688f828b7b8225f998a44808f36c77815e15LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
03/18/2026 6:57 PM UTC
d5cf8048e6fe887eb2eda63d09fe8f1f38423f3895c4b0c777584a9c43f43ae6RUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 pip3 install -r python/openai/requirements.txt
03/18/2026 6:57 PM UTC
6ef0477f12d2db22d0fcb15753756e03ab9919dad4c2ebef2895174af111b2dfRUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
03/18/2026 6:57 PM UTC
f135c4dc8e15d9ad01e664860bc57068ba33fc6bb4096b4defa35a55308d1181COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
03/18/2026 6:57 PM UTC
085ba064af2fb3dac05a1a7cac47949e4f6771cb4c23371beead1e2e29a1c07eWORKDIR
/opt/tritonserver
03/18/2026 6:57 PM UTC
b733826eb981999b126cc43b2e5022764415c9909538e61c482387546f139e63COPY
--chown=1000:1000 build/install tritonserver
03/18/2026 6:57 PM UTC
83fd3eb0a9014582f99773c9ccd6a8ae49e51febe05dbd620aa9f464d545821aWORKDIR
/opt
03/18/2026 6:56 PM UTC
f43e5e9eb70ecd207ffc6a020b722be4ad3dced3c3d0367ca95f08c84e8a502fLABEL
com.nvidia.build.ref=9ea64b3c1d6b4bb2e5f683a07d90ba4362524124
03/18/2026 6:56 PM UTC
...