NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
ff8fa2f24a8ca95c57e0a3c7a849b0fba11f0ad598a113073efd75c891072b09CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
11/05/2025 7:49 AM UTC
20fddcbbff939c15782b5083bb0597689e266b913a4c2e87935b779abb9b1110ARG
TRT_LLM_VER=1.2.0rc2
11/05/2025 7:49 AM UTC
c22c7d915d89b06956cb57a44c635653a150601e5c38fec3a9632e13628c20b7ARG
GIT_COMMIT=31116825b39f4e6a6a1e127001f5204b73d1dc32
11/05/2025 7:49 AM UTC
ba528552cf2a16a612545aaf479e3125989a0bd709f3e46da8fc2020364e6d25RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
11/05/2025 7:49 AM UTC
5e66f319a9227ff35be8678c12d31d2f6fcd2402a227b65512edd65b4db94586COPY
examples examples
11/05/2025 7:49 AM UTC
85e5f536cb29ba1926ad1c4e240afab517d3989eaa3e580b3966a08f4e8babbcCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
11/05/2025 7:49 AM UTC
0ff92802964b4ee683fde343e3de57acd8d3ab4ad9bae26ae9fe136d81b2e5d2ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
11/05/2025 7:49 AM UTC
be91c7cc40c631f39839e296c7625da72ff276045fa679d308a2439251846ca8COPY
/src/tensorrt_llm/benchmarks benchmarks
11/05/2025 7:49 AM UTC
3d3a6a729caf600ce6eae2f31d27beb52f8d8880ea8531c5e6fb2c5b023218fdARG
SRC_DIR=/src/tensorrt_llm
11/05/2025 7:49 AM UTC
0770ae90ece20c9e4ef3b33138c619aca3d0836bb378f275d3bb87e147286316RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
11/05/2025 7:49 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.