NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
7b745e1fb9779553fd7c8539b4acb30c85a609671ea19d8570704cdc2e54b08aCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
03/09/2026 7:56 PM UTC
37e5b4c0e377f3c14d9bdae10139942f35ccf88d13996275cb9612352e92baa2COPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
03/09/2026 7:56 PM UTC
d21e2b7345fbaefdc9f2ae8b2bbd376008a8e42e5a55621c22a9ee6100e8dcd4ENV
TRT_LLM_GIT_COMMIT=69de4a60e7db73177e30bb80f333e6a35091a3dd TRT_LLM_VERSION=1.3.0rc7
03/09/2026 7:56 PM UTC
6b1c4d50cd1ec862601644fd2d7dc1c28664f180032a37f8b18896a8965fcb17ARG
TARGETARCH=amd64
03/09/2026 7:56 PM UTC
d0031c648c4fea10f71df4d774f2497bb7ce1296256641266ff9299e9b73f6dfARG
TRT_LLM_VER=1.3.0rc7
03/09/2026 7:56 PM UTC
b620e7cfd087bcf05a09b0f89acf8cce700e7457063e5d2db52725411127a766ARG
GIT_COMMIT=69de4a60e7db73177e30bb80f333e6a35091a3dd
03/09/2026 7:56 PM UTC
56e168c5264e66304fc2da5693a50f4ccf0ae41b3533ff6306c85670f1695157RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip &&
  rm -rf /root/.cache/uv/archive-v0 &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/jaraco.context-5.3.0.dist-info &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/wheel-0.45.1.dist-info
03/09/2026 7:56 PM UTC
02b0c9d85d28e81b15cdf1ea2ba91ead8895b655800fdcc10d6d5d3959bd7454COPY
examples examples
03/09/2026 7:56 PM UTC
96901131ec54d84788c72a6fea5fb91ebaa882057c5fe3280b366987d893bc4fCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
03/09/2026 7:56 PM UTC
03452be73dab0ac8b6934707e4653e8cffb9f5c6e3f9ab349afd5cc3e3a61bc3ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
03/09/2026 7:56 PM UTC
...