NVIDIA
NVIDIA
Triton Inference Server
Container
NVIDIA
NVIDIA
Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

LayerLabelCreated
60419c22f560ee8a847a58bb3fb7301cc4e5e7d835c63b8563aa9715384a65ccCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /opt/tritonserver
03/18/2026 5:32 PM UTC
b35caadd0d545f31347dfec09b2bafcb5816009fa812af780f321ddc1e5cfe4dLABEL
com.amazonaws.sagemaker.capabilities.multi-models=true
03/18/2026 5:32 PM UTC
ab3d8153712ed1e10f9577f99864a5ca2c3c7e498ff713aa1a84089e81d0b11eLABEL
com.amazonaws.sagemaker.capabilities.accept-bind-to-port=true
03/18/2026 5:32 PM UTC
7f72b23e9a21ca9e437db44141b4876054d30ed3b54ac55c929858edfd7f6995RUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 pip3 install -r python/openai/requirements.txt
03/18/2026 5:32 PM UTC
af2aabbea19eac9c2a48e3d50ec674e8fa07595f8590772eea55b5f250058519RUN
TRITON_VERSION=2.67.0 TRITON_CONTAINER_VERSION=26.03 find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonserver-*.whl" | xargs -I {} pip install --upgrade {}[all] &&
  find /opt/tritonserver/python -maxdepth 1 -type f -name "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
03/18/2026 5:32 PM UTC
a2d56bad2dee0ac95ef2c72d1417933462affb91f15bb8659c55be05f6fc0daaCOPY
--chown=1000:1000 NVIDIA_Deep_Learning_Container_License.pdf .
03/18/2026 5:32 PM UTC
7da40801bd4c100b80a0d24b716247e0361f5c77206d0552ce580f1357e64d2dWORKDIR
/opt/tritonserver
03/18/2026 5:32 PM UTC
4a8cb77d464694f345e36b9b2b02bd93c65b2429c4172492c4291b8f85d9b7e7COPY
--chown=1000:1000 build/install tritonserver
03/18/2026 5:32 PM UTC
55eabeb654b474b4928a73d66d202e491254ca0cc65ccfcd1015432375b097c1WORKDIR
/opt
03/18/2026 5:32 PM UTC
7f0b80aa316510588cc695e293a8b6c99707671cc69f6b403823546ac4832c6cLABEL
com.nvidia.build.ref=9ea64b3c1d6b4bb2e5f683a07d90ba4362524124
03/18/2026 5:32 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.