NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
3121e323a6f976a8eb28cacaf8d710bb07e724a8d7d5291434a3bb5c08fd9404CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
02/04/2026 5:32 AM UTC
fc76883b89d0acd43a505a76dd3b9ed3fc5feb594a56c2a9cb455f5a50b2651eCOPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
02/04/2026 5:32 AM UTC
32855dd5efd9411f20fc3b0876e9bd1f246d1247880eed2338ac0990ab5b22d6ENV
TRT_LLM_GIT_COMMIT=7c6df0e19f92c7eba91fe58d67d64e968aa909e5 TRT_LLM_VERSION=1.2.0rc6.post3
02/04/2026 5:32 AM UTC
b1105fec3c2c8cafcb4aad7d4af4ddd36a4d70a5822c8dc878444e905b3a71deARG
TARGETARCH=amd64
02/04/2026 5:32 AM UTC
fc6e6dfb076d5f186406eb2a35c9af0809f7d3c8261deebe39bfec630babe55eARG
TRT_LLM_VER=1.2.0rc6.post3
02/04/2026 5:32 AM UTC
318fe6de9ace34a5b37e6fa4af79e63af79023a574c42ff0a66628efba58e0ccARG
GIT_COMMIT=7c6df0e19f92c7eba91fe58d67d64e968aa909e5
02/04/2026 5:32 AM UTC
fd765a4567d33da515d0cb4beb5a9426559ae9fc94628ca16d159a0fdb46b943RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
02/04/2026 5:32 AM UTC
877741bb33bf85ad96b6ac8c8ab20c19ae52faace000df849887667196a5f993COPY
examples examples
02/04/2026 5:32 AM UTC
910f457577f82370e2a327ba7f9c2632553786ca8433f92187a0610e28f19d56COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
02/04/2026 5:32 AM UTC
7bf5915670dff51e6d3e4bf2e69cc829c506808758558b3db0a6d1e9126cfd05ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
02/04/2026 5:32 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.