NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
80f38431acd7477eee73e40cb7f5c5cd6123206d6cd943897928d48c2e262748CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
09/15/2025 3:28 PM UTC
a6e4bc4eafd148f9d3a014a4115ecdc30950b84e4c63138bc97d693412ea4247ARG
TRT_LLM_VER=1.1.0rc5
09/15/2025 3:28 PM UTC
4126454b7fa86aa53de683f2d8fbfc0e82eb77f3b5ea2c8644383d229b9309daARG
GIT_COMMIT=0c9430e5a530ba958fc9dca561a3ad865ad9f492
09/15/2025 3:28 PM UTC
9db222b558753bcaa5c5f63eb7d50ab9e6919c676ec207517ba40e3d65c4940aRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
09/15/2025 3:28 PM UTC
eb7a2febecf4f1cf55a095f8b55abb4a2112d2d4a7e4c59272cf8bc8aa6c4cdbCOPY
examples examples
09/15/2025 3:28 PM UTC
d053b7c47a7251ff42051ac26abc76643b06375ae8a88e1abcbbaeb5d5ada907COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
09/15/2025 3:28 PM UTC
5cbea5d5761b692a74bbea8b2793f2b68aee7b33caa901cf980953cd250aa815ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
09/15/2025 3:28 PM UTC
5e14f46d568a0aaeb4e97002eb4cba1734763589675f5c0e041e8e5d008c2fc1COPY
/src/tensorrt_llm/benchmarks benchmarks
09/15/2025 3:28 PM UTC
b0cf6048fd4649779a02c792f8d16a90404ddee0546cbf2c981c285c6d066bdfARG
SRC_DIR=/src/tensorrt_llm
09/15/2025 3:28 PM UTC
cb385a38f446f246dbfc130d658499af347660cd4735bceb67adae807a6fe16fRUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
09/15/2025 3:28 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.