NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
31af4301eeb1e2c4f16f28b1791ba64b6604d8c85359e5404c84d539e2e87c97CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
01/18/2026 7:57 AM UTC
aeefb3bf60c60f6b18a13272adbd1abe75690d73809c0af83b11d645258315e3COPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
01/18/2026 7:57 AM UTC
e5da5fd3088071f41a08b2aedd3da3f4e6d78c98f1b2a0b5cb689b64dba27308ENV
TRT_LLM_GIT_COMMIT=0af1a0e4789eed690f91d2efde77f09ad35d1805 TRT_LLM_VERSION=1.3.0rc0
01/18/2026 7:57 AM UTC
c6be3d582b463e00d53f04e3962e090f33e6ddeee86dc6259039d5643f24c745ARG
TARGETARCH=amd64
01/18/2026 7:57 AM UTC
e5f845d698a97a43c9c241166aff189e8d23ad16f9c282233586624e520beffbARG
TRT_LLM_VER=1.3.0rc0
01/18/2026 7:57 AM UTC
b325249223ff19d016fc1a1c613924050fe33e1f66568aeb541aadc29b529c99ARG
GIT_COMMIT=0af1a0e4789eed690f91d2efde77f09ad35d1805
01/18/2026 7:57 AM UTC
7e0c5880f022005a00e517560fa2507299d89c09a4648f6d883fc7f43b5c0224RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
01/18/2026 7:57 AM UTC
98004209d83df94a9849214c84cdc54945b7ea2ea86d5130502dbf5d19484339COPY
examples examples
01/18/2026 7:57 AM UTC
884aaf02036a674cffba9908b3dc29299afd39f44524b3373e30ddac1902545bCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
01/18/2026 7:57 AM UTC
0eed567fd17eda947745b13ebccfe0cc082c3d90053a8076c8c9d724ff12305bARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
01/18/2026 7:57 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.