NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
f25f941380b95b4215c2ab4b8b240a01a3f27e6901e8e3978a4c70f2106d8f5dCONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
06/18/2025 9:39 PM UTC
90deb4559b6115c228591e4a1a67cbc824beeb542220706d831c6d28c0e62f9cARG
TRT_LLM_VER=0.20.0
06/18/2025 9:39 PM UTC
dd226c0ade942bc04f77a8f85d0634cf4ba5cd731b02f37cd9965252b2f391b3ARG
GIT_COMMIT=7965842954628b0b5456a2f7d59786d3dcd41647
06/18/2025 9:39 PM UTC
7ba56b2d282af672c46e0e8eb9cfd02d19babe91a2db3b4203988573493bbc87RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
06/18/2025 9:39 PM UTC
9ff0c06517174ab2c96343a3fd585fc885ee868b3e07f9e0131d8e0340c3fd77COPY
examples examples
06/18/2025 9:39 PM UTC
1fc623612acdf94c92a1ac2b063f2718587d406cdb55fc32a38a6cb14a233b81COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
06/18/2025 9:39 PM UTC
e220a0ce642829b6ad1f40e4074a66f4ceeb53ca8a48538d1151c891ef1f3f46ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
06/18/2025 9:39 PM UTC
16bcc3749a9f4835c94cdcf520a26f66204aade97f9e433317490741e4f0bd50COPY
/src/tensorrt_llm/benchmarks benchmarks
06/18/2025 9:39 PM UTC
f2a82c0f336ca20e1dac960dc596f4b6251bed57dcfe0c11759f673bd2cdebd6ARG
SRC_DIR=/src/tensorrt_llm
06/18/2025 9:39 PM UTC
74fcb1775768cad8c534ea710fa8448f152bb29b94416e88e4c6396200805654RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
06/18/2025 9:39 PM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.