NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
06ead4bc1dce7d31e9dd0dfbcc2886f4e824b35f9b123514aa24a055d024f1edCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
05/21/2025 6:11 PM UTC
4a9a5dbb1be3df6891d19d9e7d32a9b5debe22e4f459879a04e72d94146ee5f7ARG
TRT_LLM_VER=0.20.0
05/21/2025 6:11 PM UTC
39a9b9fffa7f02d3a47f4028bbb9583f36472233998d6b225c0c67124104b107ARG
GIT_COMMIT=819cc994df9ee920d874d42da9df6b44f890cff6
05/21/2025 6:11 PM UTC
eb898fadf09ed56a619b1f9adf648149b1f02e31ffa5e19907d75770fb411b32RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
05/21/2025 6:11 PM UTC
5dbe9f4099a2e7c0279c6718b230d4ec99c9641b397ab488f85972d1ee1210dfCOPY
examples examples
05/21/2025 6:11 PM UTC
e8fb6e7d2ef3afc5e4b158f49a52ef28b79eb2f83b4a65acada9bc7865516368COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
05/21/2025 6:11 PM UTC
0f96a2868489685bb3b662f456d48765f82f1756cbf41cb44117b7acc3c4e2f6ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
05/21/2025 6:11 PM UTC
88ff4c518ed621560849b1ab1c593a2f90c78d12859f2e1dc6ec56a5a5a0131cCOPY
/src/tensorrt_llm/benchmarks benchmarks
05/21/2025 6:11 PM UTC
c6ce8c2c136ba4590b52800d5d59527012168af151d6c376a1c4b1014f063bafARG
SRC_DIR=/src/tensorrt_llm
05/21/2025 6:11 PM UTC
c0d2ba614e43023923a9e70d0895913f78ccb06c65a1f90fa0f656dcaed19383RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
05/21/2025 6:11 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.