NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
ee43fb97dface2beea6df17c4bcf850ba445cacfcae819de65f15c09b00cf6feCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
04/17/2026 9:38 PM UTC
9a3ac671364400741521bcac6b5044bd64e8983507773efcf7605376b5eb1d5cCOPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
04/17/2026 9:38 PM UTC
c756ed7708cff8bbe74f8bce8a4e9fd61acccc5ddbc6303043b2637af8894199ENV
TRT_LLM_GIT_COMMIT=92292b897ff958c987bfe786927bb3271e3baed4 TRT_LLM_VERSION=1.3.0rc5.post2
04/17/2026 9:38 PM UTC
edb8ddbed280c4d290713b8105961b76d117c4c2a15ff39d56e763e87b6bee2cARG
TARGETARCH=amd64
04/17/2026 9:38 PM UTC
3b09b692d0678e1cdbcb4257a073566a76a822bf3762577515c78cd93f13d0fdARG
TRT_LLM_VER=1.3.0rc5.post2
04/17/2026 9:38 PM UTC
a66e6a3f127e34f0aced9411531d30e222b39d79847c29f6b7f35c6c5428e2e2ARG
GIT_COMMIT=92292b897ff958c987bfe786927bb3271e3baed4
04/17/2026 9:38 PM UTC
0c912639f2729a1b2f48362951898308e9c2eed4ee1063691fc99a53d9aafba7RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip &&
  rm -rf /root/.cache/uv/archive-v0 &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/jaraco.context-5.3.0.dist-info &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/wheel-0.45.1.dist-info
04/17/2026 9:38 PM UTC
7d6f128081cb412a08c13d5f7495ae207684fd6f4ba02fd9674f01f28054d341COPY
examples examples
04/17/2026 9:38 PM UTC
f7398eee59f08425e44f2199785345f927cb2659d1b0668ac276f64d021896c3COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
04/17/2026 9:38 PM UTC
33043f600f6c67c353f8a18180a4750e409b60e7369dd993bbde7f99921c7a81ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
04/17/2026 9:38 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.