NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
4a8c487899e6eda3ac1d0342700032bd86f1c8206c679b02fd6fd6598507342fCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
06/16/2025 7:40 PM UTC
d97230a48b6e69558da311a3d072457b4a3cec5b560363cae110134834f8a47dARG
TRT_LLM_VER=0.21.0rc2
06/16/2025 7:40 PM UTC
26540c689f9fc21acb18fa30889346af31331ca0d11f358fcbc4d7cef927950eARG
GIT_COMMIT=8445416c39e7c6b19cb01178994edc06c083f6a3
06/16/2025 7:40 PM UTC
1dd47915cb70a1053080888128f5e007f9dc7d2452e880ad2fe65dd3a705758cRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
06/16/2025 7:40 PM UTC
c5bed6131a5a69d23b2834c809ae16c704e652b32044561d53460f0975d5795bCOPY
examples examples
06/16/2025 7:40 PM UTC
6b8ead4439baab686e00174193aa5070ee4da22d97ff57410edd979b92adaf6fCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
06/16/2025 7:40 PM UTC
3b3870221f89d22a1bc52fbec130c8ab53a8ef9a192678ec0b422de48e2822f7ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
06/16/2025 7:40 PM UTC
ba791e0e6fe1280e8e8daec9e34f6fd95999ab60ae1c24f859013a6ff156c546COPY
/src/tensorrt_llm/benchmarks benchmarks
06/16/2025 7:40 PM UTC
8568015d74779e5053b8b8b7abefe4a5395fdf035cc72809fc025275c53798e5ARG
SRC_DIR=/src/tensorrt_llm
06/16/2025 7:40 PM UTC
d0921bcdd697d6697104e82aa51f6fc057ecbde21aeec5b6a70018058e2268a1RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
06/16/2025 7:40 PM UTC
...