NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
b640d0e0f4646d6a9a1e2f0f706a517875c6e600c3d5958a35ffc20b49f9ace9CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
01/07/2026 10:22 AM UTC
046c23c89c672fac01b5bd8279ae2722a444acc0fdc92a7ac0a985c252fa4459COPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
01/07/2026 10:22 AM UTC
f298f76974e9085a6b14c072e3279d813d62987e8a349a55ef0b00466f18cb8fENV
TRT_LLM_GIT_COMMIT=e4a6c9995dacf66bab4410475a6774152f95a0a6 TRT_LLM_VERSION=1.2.0rc6.post1
01/07/2026 10:22 AM UTC
87d5dbd6ac3e932f82e5c2b0d0d8d2c95e0bb145ff030ecf36c3fef851590f34ARG
TARGETARCH=amd64
01/07/2026 10:22 AM UTC
d6729b12caf1a61bbb8de8aebf7866a289a74a4fc14ae86f930126792570ca3cARG
TRT_LLM_VER=1.2.0rc6.post1
01/07/2026 10:22 AM UTC
6427d80ce126509643e79f00cfa30983180b1bd50ef69ae64d8ac311d86bfbd7ARG
GIT_COMMIT=e4a6c9995dacf66bab4410475a6774152f95a0a6
01/07/2026 10:22 AM UTC
77c341e0aa40cfabff4592fe5924d66c80521d2ffea0397a409db5f42bd8fd50RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
01/07/2026 10:22 AM UTC
cfffcb6195a39b3ea10704322434f222002a9b5f8982cde513061987b8866bc3COPY
examples examples
01/07/2026 10:22 AM UTC
b17be70f634d968e733d4b369c094add469e7b78fab86ce20bbd2b1ff8ea9b27COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
01/07/2026 10:22 AM UTC
0317718366773a28f68b871fdde74ee1c73d4825dd0fb1621915acd548226c02ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
01/07/2026 10:22 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.