NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
5c8d20898dc81db7c7d1592aadd48ffb7bb46ae9636738379bfa6c1c2b2a12beCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
08/07/2025 3:56 AM UTC
44bd2c12bd4862864a0bc459db9d1675cb682368c568b9a8461651a965f4bc43ARG
TRT_LLM_VER=1.0.0rc6
08/07/2025 3:56 AM UTC
20c9b74e64c0a838df7b9ae31247804d241dbbfb202c7b4e0407222df414cf22ARG
GIT_COMMIT=a16ba6445c61ed70e7aadfe787d6f316bb422652
08/07/2025 3:56 AM UTC
873b18b83ddda63c3dbb442633223a0f0a2a050b1a009cfaeee9112c5268e3d5RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
08/07/2025 3:56 AM UTC
5cc1d4dfeba7b47b81a627b412e243b4c1ebf7e1cec272c8092ff365cb2eed27COPY
examples examples
08/07/2025 3:56 AM UTC
fa2fd2f8f0be9350d174047732a7f637ac24468da38289aa36cf0e672e3c61d6COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
08/07/2025 3:56 AM UTC
3ce7818d2bc9f4a9947efb57e256377c84048f9ab731effd2a0b4b747e6a553aARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
08/07/2025 3:56 AM UTC
07ba6f3c7e2421bdaec974e248968276f3d98f1492e7aa67eb11997d87d5752dCOPY
/src/tensorrt_llm/benchmarks benchmarks
08/07/2025 3:56 AM UTC
f152ed8bacd42ee5d0e5f5bcc572a527a4de83bfe40017eaa3990560fe77b2aeARG
SRC_DIR=/src/tensorrt_llm
08/07/2025 3:56 AM UTC
af27f5df7d61c4482c76d995c40be5ebb4f4d5a99bfb819c3fe19e6cd8d86fbfRUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
08/07/2025 3:56 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.