NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
66bc59e177525804a2d55ab4a56b328de89a14ae5bc25a165fb99cda6afad5d1CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
11/23/2025 4:03 AM UTC
75835602292743d0e07761bb07c582ea790c35682454ddedadd428f873259596ARG
TRT_LLM_VER=1.2.0rc4
11/23/2025 4:03 AM UTC
e42e31222eda99cee2f96cdc2487645bf1499f9adcbf9d69392fca4468e29b6dARG
GIT_COMMIT=a761585d9c15b4c1249aaf65a8f90764efa83a3c
11/23/2025 4:03 AM UTC
0e89a125c6b3598ba1d3cfca950972ea532d6d7329b477284ea4612ee91c6556RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
11/23/2025 4:03 AM UTC
77d2c160b0a2ffbd2b233556e3a015eae94d8befda0033e391f1248dec95fb81COPY
examples examples
11/23/2025 4:03 AM UTC
e13371ea4fc926aae69de9e6e2f0a826d02d5732f22a461db40bddd930fb6dfeCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
11/23/2025 4:03 AM UTC
384711484bd6f105f0de98b1413c56f822f78030aaf050df923f305894567e34ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
11/23/2025 4:03 AM UTC
9778b5ae0aa0ee246708172ac34211981f4da8c0d301f0a08721c70a7c99dcdbCOPY
/src/tensorrt_llm/benchmarks benchmarks
11/23/2025 4:03 AM UTC
f8ec89135d7e1aef27920917bb8dfdc702f1ca08c4a629c8383ae253de098563ARG
SRC_DIR=/src/tensorrt_llm
11/23/2025 4:03 AM UTC
24af5ef61a145058b6b5ed8c0623594e535dfa890fd7a3936f8117e53e130113RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
11/23/2025 4:03 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.