NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
d13ec797bd9abad3b12d4ce323a5e61fb2d4262c4caf7de5522e7acc59bc62eaCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
08/05/2025 9:24 AM UTC
266f5ca61fb275d5a8f4490e5efdd49404ea615a4b506ffa8edd14fab1404a49ARG
TRT_LLM_VER=1.0.0rc5
08/05/2025 9:24 AM UTC
deca08b3e13b39a7eadbaf655e292e1e9b8c6ada412b1801b7fe7678900a7eaaARG
GIT_COMMIT=791fd56a9aef101fcc39abdf972f160f93502a0b
08/05/2025 9:24 AM UTC
b148dc77e080da29da6a3c1ff95ec625450d474739c02d1ac06ecbf6bf4d802aRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
08/05/2025 9:24 AM UTC
4d02a1361941819579e76d3887cac25f48204496e3a69b1448f50aba57227eccCOPY
examples examples
08/05/2025 9:24 AM UTC
e5eebe808caf526a5f0ee407edd6cae0409ad34726114574e883248ceed4f964COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
08/05/2025 9:24 AM UTC
1a2ada5082f29bc0bc4338099a5836b81a5305af16d53bc857ba543e8d378858ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
08/05/2025 9:24 AM UTC
35d8c400bb92e11f5026488bec33078593d25e79602749636ca1c0e0d0840124COPY
/src/tensorrt_llm/benchmarks benchmarks
08/05/2025 9:24 AM UTC
14d9cd435be8629a48307a52247cc7b7be7c6a7ffe7ab2f4b68c9cb0eeece01eARG
SRC_DIR=/src/tensorrt_llm
08/05/2025 9:24 AM UTC
037ceb939bcf617d72e6dc546729fc69975ac19cdf7bf0e4c686df5b31843b34RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
08/05/2025 9:24 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.