NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
df4f7c9f7c21d15a4dc7c270608e72f2542a3cbd66a81527a146aaa3daac2aa0CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
12/15/2025 6:04 PM UTC
5fb14cab6cd93e273ef27a93bdf8bb0df196d55f246173d4ae50ccac5bbd8798COPY
scripts/generate_container_oss_attribution.sh /tmp/generate_container_oss_attribution.sh
12/15/2025 6:04 PM UTC
5ccc58694feade09397e551c7330939bfa9a55c10e0c52684f52ad45b79d1ec8ENV
TRT_LLM_GIT_COMMIT=9ba14263db0045ed3fa0860f949b5ce320107eb3 TRT_LLM_VERSION=1.2.0rc6
12/15/2025 6:04 PM UTC
753e6cf3e2234f4e9cee7bf271bedb7b03952f1c3022574539d1dcc597c839b9ARG
TARGETARCH=amd64
12/15/2025 6:04 PM UTC
13057d3e107d85d882ec08f3d77a937e604a53d286cbd1cb39149bd413b52b69ARG
TRT_LLM_VER=1.2.0rc6
12/15/2025 6:04 PM UTC
038e981601b6edf8c01ff8f128b09ab645343ab966e74c3d7b7fc95b7d0dce75ARG
GIT_COMMIT=9ba14263db0045ed3fa0860f949b5ce320107eb3
12/15/2025 6:04 PM UTC
fe6e35198e96a0b9c10bd70be5c7c8affaa713ab1fe0546adf1df712d1176c59RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
12/15/2025 6:04 PM UTC
f10203501ca9e1f372f53ad63166dff2d59eb4cf7a6e27a406a366b87d5bdea4COPY
examples examples
12/15/2025 6:03 PM UTC
674388d9a9a12e5fb3433087906c64963b9ddd9a764a04d768976f3e6ee5e52fCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
12/15/2025 6:03 PM UTC
ad5eea3acace0203bab77ddc77b49ccd5d423e6897fc471e0de7321fc9699756ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
12/15/2025 6:03 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.