NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
45f94ec06b1b458fbe9ea467a6404b9c9daf240b14a5c52cb26f234c496557f3CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
05/19/2026 12:59 AM UTC
c371fea2dadb843b4daa5d40b4bb233f14dd3e1613df7c8d3ade4b78d7dd5130COPY
--chown=1000:1000 docker/sagemaker/serve /usr/bin/.
05/19/2026 12:59 AM UTC
4c6f61483cef2adffe3d12497feb09cad70d8aba44e980c7d7f15defe51ef688LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
05/19/2026 12:59 AM UTC
7b962b62b4b20e140edc056873481fbf3cdff53275c433f4e8a993b3008b761eLABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
05/19/2026 12:59 AM UTC
a0ea46250add700e92cd9190a6c12ff48b7c0fd763560b747680616d93dab4b2RUN
TRITON_VERSION=2.69.0 TRITON_CONTAINER_VERSION=26.05 pip3 install -r python/openai/requirements.txt
05/19/2026 12:59 AM UTC
ddf920621186938cc64f158ffa1cedaa2a400d16da4e8305139697bbffa56116RUN
TRITON_VERSION=2.69.0 TRITON_CONTAINER_VERSION=26.05 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
05/19/2026 12:59 AM UTC
a855f1a04fc7fcaa9b6003a6dab99ff28bb5ef831cb7a5b8a23a3f4ba88e5a63COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
05/19/2026 12:59 AM UTC
8ca2e1e56370343ec6c8f57910091887f61b07771e69ada7aab1e959a11ef3f6WORKDIR
/opt/tritonserver
05/19/2026 12:59 AM UTC
cee526c93b7e2dcee94f251dfc72500c3b9fc3c936da9d318927fa0b9b973708COPY
--chown=1000:1000 build/install tritonserver
05/19/2026 12:59 AM UTC
f618583197e39ac13f346d5cb177c5060ba222c065ded12860e73e13195c87b9WORKDIR
/opt
05/19/2026 12:59 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.