NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
f054982694d245e2bd80f04348ed4ebad48e792f624349c3fd9fdb6635aa96f1CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
10/19/2025 10:51 PM UTC
099657c4c3379164d05d2c49349224f0c7da118586f1018e5230031307141788ARG
TRT_LLM_VER=1.2.0rc1
10/19/2025 10:51 PM UTC
cafdc9500dd9d3848d8c1b8de63d4c55e0b3a1a98e60de1c8485f4f6674e50d6ARG
GIT_COMMIT=796891ba2a6959bad58c0da9645416c7264349e9
10/19/2025 10:51 PM UTC
425a8a674ea75194772ba26956884bd408ea78df4861da1dd414664c1ca2ff04RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
10/19/2025 10:51 PM UTC
df8812fc6011ad13c700a01d9c810b4d2333ba1e1cb456deb1f54a4db0416b1aCOPY
examples examples
10/19/2025 10:51 PM UTC
754c3a44d7f7f755bcd60063118c9254cc4f753cbc36175b2f4376c828d83fe0COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
10/19/2025 10:51 PM UTC
e6a5bc97b6fcc1ee00d692527a3fc6aa79a70a98259f1f57b3d2fe8fa7455fc7ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
10/19/2025 10:51 PM UTC
20582e53e7ce21a333d291fc57e9fad25de2655c9305122c4d15b929fdb645f7COPY
/src/tensorrt_llm/benchmarks benchmarks
10/19/2025 10:51 PM UTC
93ccbd2e301c408b023e2de4bfc5beb66c281fc67d5f9868aa89c623e1aabc1dARG
SRC_DIR=/src/tensorrt_llm
10/19/2025 10:51 PM UTC
d14d545e545f1e90c8b1a519c51e557a96cb43034e409345129b8cf3a0614e0cRUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
10/19/2025 10:51 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.