NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
bf6945d8c4005babc7472c9ddca6ca1c73477fa64b136982a5fd65e27de9dc24CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
04/24/2026 12:58 AM UTC
c2a98b6b12c32ea1677f84ae9869cd14287dc817e324639601ca8af492336bc9LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
04/24/2026 12:58 AM UTC
e5c2498e2eaca633900acfa55b1fbdda900e9232a68cd78c2555aff928299779LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
04/24/2026 12:58 AM UTC
db7153f59c5fd66ccdeeeae2837a74cd2cde0358f7292931cf93cc334d1ac045RUN
TRITON_VERSION=2.68.0 TRITON_CONTAINER_VERSION=26.04 pip3 install -r python/openai/requirements.txt
04/24/2026 12:58 AM UTC
4e67dfb62e0dc694903757ee677e4f47bfb1a2e7930387c4f71c53c55b4b05f6RUN
TRITON_VERSION=2.68.0 TRITON_CONTAINER_VERSION=26.04 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
04/24/2026 12:58 AM UTC
a31a77e44f0ed517988d433b81ca214314125946452cb4f9fdec2b1e676a972dCOPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
04/24/2026 12:58 AM UTC
370c4fa3a001b7401c0d36783aca6c7ce9b5bd6858ce5aea68eb5072bf2a38eaWORKDIR
/opt/tritonserver
04/24/2026 12:58 AM UTC
fc1f8bae9ada9f2ed359eb6251af4d92ded7451d3de9ee0806cdf6dbd4c14e96COPY
--chown=1000:1000 build/install tritonserver
04/24/2026 12:58 AM UTC
f96be117034e9ecf9c52df865199b2a7baffa9c4ba494935aa991d5dd311bf8cWORKDIR
/opt
04/24/2026 12:58 AM UTC
9270110521cb57f339238d83093692c616610805793506e2a304fc37ada31db6LABEL
com.nvidia.build.ref=e17fe849530a72c6bc2a5a78890a26341617affd
04/24/2026 12:58 AM UTC
...