NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
56d26e8a1c9f271d6cfb1af55962ff131a0d755318fd9bd1808e6b2d0cb909a2CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
10/01/2025 8:23 PM UTC
396de5e29c677057947892bf6ddd66c2e31b84b4aa2cf5ba2dc0ae17cecfe5bbARG
TRT_LLM_VER=1.1.0rc3
10/01/2025 8:23 PM UTC
647337b8d8d3eb7a020d17ee72ee582884d6eed057835cefbf87c3308fc22265ARG
GIT_COMMIT=8505c3ad88e75c511b880dce998c0ea862802bd4
10/01/2025 8:23 PM UTC
3cc98cd1a7d24b7b0543d2cbeb8f17c1605bd5e9dfe9cf4b74bf3150b3d75a96RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
10/01/2025 8:23 PM UTC
be2bc3ce64654b309124042b8b826bcb53ea673e276a4719e8cbef1864dea188COPY
examples examples
10/01/2025 8:23 PM UTC
221df1134df4b29b80d4d695a3262f7f48eb8cf0668625bdfb16a4e71f12fac2COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
10/01/2025 8:23 PM UTC
317bbf6cbb81d805b5948ec8dfbd32b91895b14e141f5fd050f57b82ea22e966ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
10/01/2025 8:23 PM UTC
2a6645da96f101fc5804115f81c50bfcd54bf4a9d9d8ffcfa8f9f194a8ab8964COPY
/src/tensorrt_llm/benchmarks benchmarks
10/01/2025 8:23 PM UTC
217ba9a6a03e48bba9fbdb6d29b84b2a8bb47f90b3f1219bb65881b8a7d655d8ARG
SRC_DIR=/src/tensorrt_llm
10/01/2025 8:23 PM UTC
a7aa399f6671816bfaab4fcc101b30c23c1d9e135fbba8a1d6341fbaaa5a84e7RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
10/01/2025 8:23 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.