NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
082393e2cd74b739a230b5bf831dce1e94e4664b2731992900b2185bb2f667bbCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
05/31/2026 5:39 PM UTC
a594dceab806200d6747d21d19840ed7b478d0d4e18d8447c81f5c5bd80c28beCOPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
05/31/2026 5:39 PM UTC
aecddf2138b0c9f15ebee6f2a69f8323e73b3c7617e209f569a1187823ed1e0fENV
TRT_LLM_GIT_COMMIT=34a563ac6d8cc0ca7068c7f619e869fb8a625333 TRT_LLM_VERSION=1.3.0rc15.post1
05/31/2026 5:39 PM UTC
3982381622c933de7526cfc43cdb011fc9f3f15eef5be2ca162e07802fca200fARG
TARGETARCH=amd64
05/31/2026 5:39 PM UTC
a95402be3b1071cd31e3448c83e6c8a17787f94f48412fa97d7a96ca597e2f8bARG
TRT_LLM_VER=1.3.0rc15.post1
05/31/2026 5:39 PM UTC
f8db712f52c9455ebf78806c11ba1f880f1a0bb7961a7be2b74e68812bc8bae1ARG
GIT_COMMIT=34a563ac6d8cc0ca7068c7f619e869fb8a625333
05/31/2026 5:39 PM UTC
f76ade5553dc76fba2c08c62648bea198db84816e2eb267aa433d88c4a6ab44cRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/uv/archive-v0 &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/jaraco.context-5.3.0.dist-info &&
  rm -rf /usr/local/lib/python3.12/dist-packages/setuptools/_vendor/wheel-0.45.1.dist-info
05/31/2026 5:39 PM UTC
1cde9bc5c3f9b3b88e9da220a7535a4eeaf4d4fce2d54bb3e6e9eed94880f5f7COPY
examples examples
05/31/2026 5:39 PM UTC
8adc86bac27501fe8e5e5fc651be63620a08ec75bf073ddfd96a4be193b7c6dbCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
05/31/2026 5:39 PM UTC
9b18d6fbee430d863778aed04c85bd4c09f4ff95b28b72a8e2266fdf08b86a23ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
05/31/2026 5:39 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.