NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
643277d96b9b98ceb4efb17d30dbb44465dbf1a4c9ac2cbf40d9ac69be721b8aCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
02/14/2026 5:34 AM UTC
4ed7f3b96022cd009a3f1b96de950a59b8630dde84c05c0bb589e0c2582af634COPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
02/14/2026 5:34 AM UTC
175bb8e05651813a284cac7013c2209ea3e23f055f5014edf54af72350317b0bENV
TRT_LLM_GIT_COMMIT=26901e4aa0a51a1863a796999e0476a4b72e5bd5 TRT_LLM_VERSION=1.3.0rc4
02/14/2026 5:34 AM UTC
5fd378ca01db8d8cc15f91e5b6fd33dcdfa440e3a01d79e1bbbc6e588f9a9fd4ARG
TARGETARCH=amd64
02/14/2026 5:34 AM UTC
4a7fa1638e2d55d2e5f45f4784bb93e33644b6b03a198b08c201b0443f7a1490ARG
TRT_LLM_VER=1.3.0rc4
02/14/2026 5:34 AM UTC
551e5afe0ff01240772d6ac4881ef5f1c1cfa986010af75bbd5d9e9e4eb9a1eeARG
GIT_COMMIT=26901e4aa0a51a1863a796999e0476a4b72e5bd5
02/14/2026 5:34 AM UTC
778ef071d25819c5952946dcbf3b8622c06b8ee9303b203fac5b690380936dd4RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip &&
  rm -rf /root/.cache/uv/archive-v0 &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/jaraco.context-5.3.0.dist-info &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/wheel-0.45.1.dist-info
02/14/2026 5:34 AM UTC
42836489135df72c06c945239f478a7ce2991fa0afff30ae93e0c21574f296f2COPY
examples examples
02/14/2026 5:34 AM UTC
62f0598fec47c84c306c10082ac17a4193d233fb3faa328f6be5668617812b17COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
02/14/2026 5:34 AM UTC
bb24d5534d8260411f6c52b5f9d495250eee1351eefb22afe42e55967be07d85ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
02/14/2026 5:34 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.