NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
8c2d2e39c46885d629518fcfbb706f294943147e2e072e6e0365d7098358dc28CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
04/26/2026 4:13 AM UTC
862fcd29a27a3c618d4b281e1365765ab943e6c1a766ffc5258e1f2a58bbe90fCOPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
04/26/2026 4:13 AM UTC
1e77c0a8ca57b9a6b2b2fbb6bd352ddfee44bbd246ba00a1ea38db816cee2195ENV
TRT_LLM_GIT_COMMIT=b9ce4b69d12fe5ba65d13893111b1a2ea29413ee TRT_LLM_VERSION=1.3.0rc13
04/26/2026 4:13 AM UTC
4b87a43c0294204ebca492df0aa947cb0a3ce0973d3979d1ccb3fd7331396265ARG
TARGETARCH=amd64
04/26/2026 4:13 AM UTC
4014e4d51b7190949d8406f87932a745d8ffd2b1249871ea2db0768cc1a8f836ARG
TRT_LLM_VER=1.3.0rc13
04/26/2026 4:13 AM UTC
6155be58e76b04c4346d2af1beee063eb1666ebda375f83bddc53c6aa75ea48aARG
GIT_COMMIT=b9ce4b69d12fe5ba65d13893111b1a2ea29413ee
04/26/2026 4:13 AM UTC
eb78975ec846ab0a277463cb86c1c944454054884fc0a1de17fd0cedd8a90f2bRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/uv/archive-v0 &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/jaraco.context-5.3.0.dist-info &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/wheel-0.45.1.dist-info
04/26/2026 4:13 AM UTC
e82f3303fbd3d33015895468310282c57ce3ae4c6d82b1e1c2c14db9151f1869COPY
examples examples
04/26/2026 4:13 AM UTC
8a52e91a0d62dd292ad0989ac66ca38f52d20f119b66a2d13987e2f7cf5a6f81COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
04/26/2026 4:13 AM UTC
11eb9f3fab752c7e7264c3a83166434bd17f15a7834164fc4620919e0c093af4ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
04/26/2026 4:13 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.