NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
ab50c022839867891748c97a8e75d63baf7863b7132ce5c775b211128e3a85cbCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
12/08/2025 3:14 AM UTC
7d9187d010062822bddff004fe08d8f51967b38ff5c53528f51a0f69c955d342COPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
12/08/2025 3:14 AM UTC
0abc00fe37d927d5df70828694fb314d8fcc3447d7abeb80d18d19f4b4c09709ENV
TRT_LLM_GIT_COMMIT=e4c707845ff58fcc0b1d87afb4dd0e64885c780a TRT_LLM_VERSION=1.2.0rc5
12/08/2025 3:14 AM UTC
45d7c62b3e05343f60dc345d1ab3d2ae7373a1ad902d979287abe5b86576c3d7ARG
TARGETARCH=amd64
12/08/2025 3:14 AM UTC
7adf66ee52255badcbd5e3c6a056615b094fa94f7d7e745edf7f0266cbe3d8dcARG
TRT_LLM_VER=1.2.0rc5
12/08/2025 3:14 AM UTC
dc4cd5cdd665f61ccfc497fb31be97d882a7127286e0a0e80c07db03e7e68f86ARG
GIT_COMMIT=e4c707845ff58fcc0b1d87afb4dd0e64885c780a
12/08/2025 3:14 AM UTC
84d9c57bc0fb46b48a91ea0b3e4c86cc9b067dbe5998d8aee782d1ad30d29733RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
12/08/2025 3:14 AM UTC
4b6b52456c62724669ffdc988b19a0c3386bb56ea3a29104be18b966432f5bd2COPY
examples examples
12/08/2025 3:14 AM UTC
6f781c4b75e8ed070632893c2d09b899f333c725ab2145196ae80b5e3e7fbedeCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
12/08/2025 3:14 AM UTC
2a4c45acf6f79528ce208b7b58ecc8ae9eee27f04e23c055cad0b38bda9744c4ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
12/08/2025 3:14 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.