NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
d78c2dabbdcf145252a36899226595d1353192b04a74783e1bb189bcd7a9bfecCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
03/18/2026 5:36 PM UTC
24b8a039c9162f2938675e7e8685ea1bb58da250d71a6361ab4571167aa3d45aCOPY
--chown=1000:1000 docker/sagemaker/serve /usr/bin/.
03/18/2026 5:36 PM UTC
30030123152fae9c5adab4f192dc26e0aeb2fe9f09118e728c384e8fc4af4e71LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
03/18/2026 5:36 PM UTC
9b3b3a484fbcf29d9f4869d05bcc08c018dba8c9169a76f4951e140f938cf10bLABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
03/18/2026 5:36 PM UTC
efc7b16285e16df88a2701cd6f0f7fa4ee1c5dbb62ae0030b275aac560e51558RUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 pip3 install -r python/openai/requirements.txt
03/18/2026 5:36 PM UTC
2fe50df5f3a4810b081191cd94c0f67ea30f21c4350f6b1d7fc52e1e070c10ebRUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
03/18/2026 5:36 PM UTC
e84054b41c34ae69a82a8801c927a552af55ed8e8c5239a51604930c17d331d9COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
03/18/2026 5:36 PM UTC
fef947164638c62f58785809ee9ecdb2836a16883c0b325955bfafb985f646dfWORKDIR
/opt/tritonserver
03/18/2026 5:36 PM UTC
e569041a0ba35557ee2171bc94e4ea0b86da78e2580369a7eaf8f217c162b014COPY
--chown=1000:1000 build/install tritonserver
03/18/2026 5:36 PM UTC
1e253e81c2ffe166786cb77ab2a5bd841f64d607c9018b2f51db6d9b310dacb2WORKDIR
/opt
03/18/2026 5:36 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.