NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
21f9f9a23422e4e9078d4646eed0fdce59387990ccfd353521517c5c3b2477efCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
04/08/2026 2:42 AM UTC
f69c5d69e2826ffcf94ed935c7bce7f2e14558b8a046c007149d2f4b00136ad5COPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
04/08/2026 2:42 AM UTC
3204c0adca2cd6c4028bbceeff78f77eaaf988a539bf1a9526cb090e1ee0160aENV
TRT_LLM_GIT_COMMIT=4e69c14f732a6e6afce4f71616db5b5cd2b10530 TRT_LLM_VERSION=1.3.0rc11
04/08/2026 2:42 AM UTC
cb94f5d887ab10e986e06fcb192c20b41b7edc3bb355ba4a7e6be6730d37e287ARG
TARGETARCH=amd64
04/08/2026 2:42 AM UTC
d768456a01ff06bc8a116c856871bb5cda8b5eafc56852c882a37611de9603bfARG
TRT_LLM_VER=1.3.0rc11
04/08/2026 2:42 AM UTC
4f8f551c19e8177ba874dd7c70eb07c442638bc6cc81679cf20a7db3e4db0917ARG
GIT_COMMIT=4e69c14f732a6e6afce4f71616db5b5cd2b10530
04/08/2026 2:42 AM UTC
e1d507c6bacf622a487e1e3ddc84ccbc5fde4a3ef7284fdf69c159a07d4f975cRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/uv/archive-v0 &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/jaraco.context-5.3.0.dist-info &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/wheel-0.45.1.dist-info
04/08/2026 2:42 AM UTC
1ccee22aafe5aa70816093b68560d52c5b2d80a09f8a541c760162bfc875c5d0COPY
examples examples
04/08/2026 2:42 AM UTC
2a67018d95ed737cc76fbeb75fcd36ecb81eb8dfbca31e4699756b8fe5fe34b2COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
04/08/2026 2:42 AM UTC
380d5dcce7606f470525b97865e45f8b33f39631ef019f50288727d5ed7d1240ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
04/08/2026 2:42 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.