NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
e4a928adb1f32fe7d94c30a360dacc2543d4e8dcd79bfd77bbe56fe7f1a0dab2CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
02/23/2026 1:09 AM UTC
ecd2120d7a3dd36bf930036fa90934722e3ededf7c1cf55598bbd5b5f9fe8e4fCOPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
02/23/2026 1:09 AM UTC
ff69b80cf5fef4fceae06d4c73d77891651008c18b05461a23e7de12725fc4a4ENV
TRT_LLM_GIT_COMMIT=630fccb3ca072a0560f5ea461d57bdab5faa434b TRT_LLM_VERSION=1.3.0rc5
02/23/2026 1:09 AM UTC
0ebb02187c2caa45b0026c00543b2e2e5954fa3ad8f66b9697111548cf7faa1dARG
TARGETARCH=amd64
02/23/2026 1:09 AM UTC
f8c35b9236b93d49deb2049e6c662160f7bd54a1f15c5d95937ffa1abb1c955cARG
TRT_LLM_VER=1.3.0rc5
02/23/2026 1:09 AM UTC
ce31c81bdd91cbb2a8cb1d2db743a6bae4bfa5661626bcfa491635865cc636d7ARG
GIT_COMMIT=630fccb3ca072a0560f5ea461d57bdab5faa434b
02/23/2026 1:09 AM UTC
128a1ea8b28ec23df5a734c725b50a1b6dfb0e7c330dee48355893d971406e90RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip &&
  rm -rf /root/.cache/uv/archive-v0 &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/jaraco.context-5.3.0.dist-info &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/wheel-0.45.1.dist-info
02/23/2026 1:09 AM UTC
d89d70f220d834f17ccc412fdcd6682ac373158f1150a1c5e5ec30e8ae4aad2bCOPY
examples examples
02/23/2026 1:09 AM UTC
796f1393a2c5ade86df5715c8b51c2cfd200ee7d268fe720f097c8600e22beaaCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
02/23/2026 1:09 AM UTC
5b679c7db4932794f0ca329ebbb667d2601cfb4326741f90cfed3821e6aa22b3ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
02/23/2026 1:09 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.