NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
0b8643c0ccc2f224bc4785221ff27df3216aa559e69088e837a63433851a6d0fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
04/24/2026 1:06 AM UTC
413865a987463456fad3287ff417ef0d6e133a751a1cf69a61b184461811f87bCOPY
--chown=1000:1000 docker/sagemaker/serve /usr/bin/.
04/24/2026 1:06 AM UTC
e3a6c33b866c8b78d95c7f88964347e45fc689cd3e305e2b33bab222131e5600LABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
04/24/2026 1:06 AM UTC
567f8f37814d5ba7bf906b8d1bb281d4cf8bb969c4be67a2b479b3f1dfd50a48LABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
04/24/2026 1:06 AM UTC
7bf3b02c2ab7787862c0d5588c280b7b9ebd3f03d1459f71dcb3262173354355RUN
TRITON_VERSION=2.68.0 TRITON_CONTAINER_VERSION=26.04 pip3 install -r python/openai/requirements.txt
04/24/2026 1:06 AM UTC
1a5842b0208f0126f37763e4110a5cc81d09141ed95eea10cfb8d90796077200RUN
TRITON_VERSION=2.68.0 TRITON_CONTAINER_VERSION=26.04 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
04/24/2026 1:06 AM UTC
0db6918d2faa1e0952af76787a807aa52748d5d0c93a6bc964566bfff2a033e5COPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
04/24/2026 1:06 AM UTC
8d2d91730c1dba0484ca8682ee4b164dd304d78026a758659b31e214436ccea2WORKDIR
/opt/tritonserver
04/24/2026 1:06 AM UTC
c222d3f3899ac1dd52448584b74212e7f267fbdf428fe685a68d4c99a7b7fd5cCOPY
--chown=1000:1000 build/install tritonserver
04/24/2026 1:06 AM UTC
c36e7d750d4d578b46bf762d9a8af882f5ba1fc6fd409f42e3702bbc3f701c51WORKDIR
/opt
04/24/2026 1:06 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.