NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
1f5d98ed97c7b888115d0312b949737bc0fadb3b54e78710fc4f88f13aa88fdaCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
08/01/2025 7:16 AM UTC
ec4a6a68dc0313dd931c89259e903236b04c37fd688eb3a34a10236e4a390118ARG
TRT_LLM_VER=1.0.0rc5
08/01/2025 7:16 AM UTC
39344b230ae55a4f94d5e2a36297b86730cfc3bcafca1946f5ab849a92e3cff4ARG
GIT_COMMIT=fbee27990917affc73d2050384dd7e33d594a21b
08/01/2025 7:16 AM UTC
b2a957d44f75160479d637be890e17757c31610e9525eaa12db51b8d098f48ceRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
08/01/2025 7:16 AM UTC
06d039e2e51b4765405600b605caa90b2cc351025ec59c00906619637325eb9fCOPY
examples examples
08/01/2025 7:16 AM UTC
564ed000923f3d30da327622deec4a9587ad53f3db652e148438846314cf298cCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
08/01/2025 7:16 AM UTC
3c46c1fa7fb4a4c13cc6f443868a910cb2ace97459a580899c9a6405765b0b3eARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
08/01/2025 7:15 AM UTC
83dfdcb3dcdb74c1a78eac798c572a149a8dbe2ab24d1f35503a59a957c56b19COPY
/src/tensorrt_llm/benchmarks benchmarks
08/01/2025 7:15 AM UTC
829f2bdb7cb57c91336e9cab5ca31c76c827d65d43900af7e3a1555493b4fb53ARG
SRC_DIR=/src/tensorrt_llm
08/01/2025 7:15 AM UTC
b142a64647adf2ce416e6b08a5396e2582f4cca95f869dc2eb92dba1550b3687RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
08/01/2025 7:15 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.