NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
5a81927069464d94ac95b699c373bdc53b9652e78dc6baeb9abd96c4f81084bcCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
03/22/2026 3:21 PM UTC
ebd9bb4f5a119abadc46d38cdd77e5ef4144f39e45b69d85e35dc9a054115f0aCOPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
03/22/2026 3:21 PM UTC
9d9724aa08b487f92d6657e648d1b119b64616180738e2111a16889dfc4f8c06ENV
TRT_LLM_GIT_COMMIT=33533348196f236dc291ad573710184858501f79 TRT_LLM_VERSION=1.3.0rc9
03/22/2026 3:21 PM UTC
969c4d09f7f5e4021bd61b9c5206887435ec8eb9d3ef455e297881296dd5d9dcARG
TARGETARCH=amd64
03/22/2026 3:21 PM UTC
01dd8ff88c0d98f61f5fa7caac84a19b05b9fbb3bf4c487c68604bb66aa8cfb6ARG
TRT_LLM_VER=1.3.0rc9
03/22/2026 3:21 PM UTC
b8cf5bece8467d2978fc632bfbcbecaadedffa944fa605fc4a75098f46ede1ebARG
GIT_COMMIT=33533348196f236dc291ad573710184858501f79
03/22/2026 3:21 PM UTC
c6e9c27434699a2af00ab3d192be63a541b002314257561bcaa8a670a2bfe6acRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/uv/archive-v0 &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/jaraco.context-5.3.0.dist-info &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/wheel-0.45.1.dist-info
03/22/2026 3:21 PM UTC
9ef34687032bca876caaf58f66694db98339570554c136029286e92ea42ee387COPY
examples examples
03/22/2026 3:21 PM UTC
53e368998e75e412afd2ad95af8d28ffd871bc56eb045dea1c7c3d2c79041071COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
03/22/2026 3:21 PM UTC
7fb528c328f41e15e3e46d639a6dbbed00188bf081f7e2c5ac02b0ac2b2d62a5ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
03/22/2026 3:21 PM UTC
...