NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
f73b8fd35bce81cb916ff8acf0c2ed1237cbb429cb1748973c8f3ed8fb76e047CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
01/11/2026 2:46 PM UTC
bb8899af74b074d245d98688fd74edc6061ca67358b22b7031d240ce21ca5dd0COPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
01/11/2026 2:46 PM UTC
dd1dfc3d0c8a91bd430168fb0e689ac0f7decf687265886bea47c5edc6523c85ENV
TRT_LLM_GIT_COMMIT=80649a8b78b2fbe9ac4fa9b004fcceb251ba7c2e TRT_LLM_VERSION=1.2.0rc8
01/11/2026 2:46 PM UTC
91a562dcedccd535bee9540166a62910feb3aeb83b9aedd4f27944f3c79566ddARG
TARGETARCH=amd64
01/11/2026 2:46 PM UTC
ccd727f5b3e834217d25a13b3c11cfd194e5b180420e7f5e1e07742e11e67a94ARG
TRT_LLM_VER=1.2.0rc8
01/11/2026 2:46 PM UTC
ec9424fd1f9dd91ab2b8394240bf8fbed3beeea3537622e8c70c875e49f3f14dARG
GIT_COMMIT=80649a8b78b2fbe9ac4fa9b004fcceb251ba7c2e
01/11/2026 2:46 PM UTC
830083e67adcf21f345e7927ec1393ac2aac879aefdbba87772dd9016e8f099aRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
01/11/2026 2:46 PM UTC
4519216a8ab8ae2e599c0bffbf396d53097e2bcdcebf43ebfdba498ef9b4ebb2COPY
examples examples
01/11/2026 2:46 PM UTC
b24e2bbd200c1d9d2387aa7cd725322605187ebb617e9d94d41d3c52f70628faCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
01/11/2026 2:46 PM UTC
294dc94f960ce8690cd89a4262e1396eabf009075af0a58b4f54db13376807e5ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
01/11/2026 2:46 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.