NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
1aae1da6af591f5dfa4e2998c52ce8bffb8910bb36f31a68116368be035880d2CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
12/19/2025 2:41 AM UTC
e5a54caeab4dab7c19384c67d4b57551206a5d960937a02938a3504b58e4d674ARG
TRT_LLM_VER=1.1.0
12/19/2025 2:41 AM UTC
f0f5092914246480e789d312168cddd662603d545a18711cb6d355829077467bARG
GIT_COMMIT=48b7b5d8b7ed1e6bff57aecc6ff7d1533288bed8
12/19/2025 2:41 AM UTC
fcebe2d198160c2e4e4e8937cbe3bbc8270f19d6d88cf38d8edb0e66f0e743c3RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
12/19/2025 2:41 AM UTC
e5ce3590ae5f37c223b6b5460eb40f27ef2949e67e45ca05d4a363d7f5ad5f18COPY
examples examples
12/19/2025 2:41 AM UTC
89943a5173880a8f1df0db774d00e0d758f442c55dcdf720cc2c816027039070COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
12/19/2025 2:41 AM UTC
40af76dd862e365b305d0bf417666f302133aa8cbb535f39b85c15a45ec2c8ebARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
12/19/2025 2:41 AM UTC
af79d497aab4c7ca6c6153c042bc1e02b041d66222fdb478f773beabff91c47fCOPY
/src/tensorrt_llm/benchmarks benchmarks
12/19/2025 2:41 AM UTC
99bf80a24d608cc3750451970c363ee5199f6b66ccea667d810ec3586b9077c4ARG
SRC_DIR=/src/tensorrt_llm
12/19/2025 2:41 AM UTC
02772c9bfe3f80d2646d27dc32bfede40027b7368981b1e622f30fbda6e3c49dRUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
12/19/2025 2:41 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.