NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
671557915a99c85e74d05ba1c3b663298491151a7d65dff8a86e274670c84c8fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
06/29/2025 4:14 PM UTC
f5f2dd15ce71fd3fd61f73a89aa74a5a07832759bec0acb63b8863cd010d0402ARG
TRT_LLM_VER=1.0.0rc1
06/29/2025 4:14 PM UTC
b7c75dd8a1ab5b2d896aaff271735eaf87feb7260e2cc0bc131cfefe3384581dARG
GIT_COMMIT=de9779900c4cb1bcaab8da13169ef19ec7a7533c
06/29/2025 4:14 PM UTC
3f64e22b4542039a31a701c43555af07055f646949c45b7c2be14bd8a3920ca4RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
06/29/2025 4:14 PM UTC
cf79d1e3d2f1d3384927d6dfb646725db72ffcf274cae47c019586d25f9033c3COPY
examples examples
06/29/2025 4:14 PM UTC
a23953480067950e5f8a9208492b3cf1347eef69ecb038b40c548813cc8617d2COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
06/29/2025 4:14 PM UTC
a4d4ee9d4c2ad7526c1785369cd2063a7827f94c2c136ec77df469738927d617ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
06/29/2025 4:14 PM UTC
462f616d5b32873bf4fa3f3b8efdc7a23457372a7c0cad8449742b463e31c454COPY
/src/tensorrt_llm/benchmarks benchmarks
06/29/2025 4:14 PM UTC
b4901eaa2e38dbcbbb473ee9fd7fbfbae59fb46bae9f7165dd3b70699e359dc3ARG
SRC_DIR=/src/tensorrt_llm
06/29/2025 4:14 PM UTC
136ba72b3c495848b583c63dc03b7d711b1cc12c2d3ad90134e07d82a40a5ae8RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
06/29/2025 4:14 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.