NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
d319a6318dadfdf2eacf6e9b03f690f83c621f7ac51e0ca50bb3dd4be6d7355eCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
09/09/2025 8:26 AM UTC
7c544d8b0e60d3fac36686d710396f88680a06506d0e587379b63aaf94c9e836ARG
TRT_LLM_VER=1.1.0rc4
09/09/2025 8:26 AM UTC
14c25be8d939303b12510ec07f94da92fb53cb2bde3ab767951f29fb9725eda9ARG
GIT_COMMIT=62b564ac3c3347e5edc24d1bfa42edfb434de5b4
09/09/2025 8:26 AM UTC
d2cfe3b3e04c6f77a77307a5f668eb489fd2c9f6dd4ce46364309243367e5269RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
09/09/2025 8:26 AM UTC
3c7e86c43a248efe4dc7425ea4944f112ecbda39cba8b5c54f2d03aafa6f6355COPY
examples examples
09/09/2025 8:26 AM UTC
38b2684501f1f62e9ff94093f762dfdf77c131fa71af806fb146ca5b14424720COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
09/09/2025 8:26 AM UTC
af0c0f4ea5fd4d28a371cfebd465292c320237c72113baa752d93ac546908e30ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
09/09/2025 8:26 AM UTC
6cbaffcd144c1e2c8d756b1755860c3f9ec1b191ec08a0df7b7f9c4abc275385COPY
/src/tensorrt_llm/benchmarks benchmarks
09/09/2025 8:26 AM UTC
55fbc684fd3fdd04c8ca9f6daf271d91aea3b26d7c84db7ececaab548ba2fca7ARG
SRC_DIR=/src/tensorrt_llm
09/09/2025 8:26 AM UTC
becb8928aa4d52c063bd68c9d4b7bc8e29834f5aec928ee9b90a3ff7baef043bRUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
09/09/2025 8:26 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.