NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
15d90f54440854d985f4de6461755aa8531572174f90fa6fd93409decd9a8589CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
07/14/2025 9:00 AM UTC
d748ea1f7b81a92b1765060ab68013eb2f3665f4188a9a340ece5440606cdf71ARG
TRT_LLM_VER=1.0.0rc3
07/14/2025 9:00 AM UTC
fff4cc55f277b3a0fe5b948a5e871ef0cf747ce12ef28b1f0eddafdb6b109e1bARG
GIT_COMMIT=cfcb97af0e4d191dca141c242fd94f999bf9b97a
07/14/2025 9:00 AM UTC
634f6a7832bfc0602fc0a0f8145d3fc9e21473d55aa3c3ef7cfa5a026755870bRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
07/14/2025 9:00 AM UTC
3539f44a353b6f862053e54b2d9a7a4f71047c85b7b174383e1860ea4746c9a9COPY
examples examples
07/14/2025 9:00 AM UTC
3a2d7025bc4ebf7a882ff2c4014053a6a2a84be6ff0ea7a5fecf44c0b6b14b37COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
07/14/2025 9:00 AM UTC
04093bbefd0dd1c63b33cb13991044f205afd300a6fe9af90ab170e05c7948d8ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
07/14/2025 9:00 AM UTC
1c76b12ff08a0fb71e91d936b432be68e541c496c20f88bfe255e9a2d6914899COPY
/src/tensorrt_llm/benchmarks benchmarks
07/14/2025 9:00 AM UTC
7edc1193c7b301ffa6a8564e03a0b1c1ca36bbec864fd87011da7953001d08e7ARG
SRC_DIR=/src/tensorrt_llm
07/14/2025 9:00 AM UTC
d0ce62888cfa8e71db3f55bf6d047a96bcaf7a3cf142c372350c9230f5a2e5f9RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
07/14/2025 9:00 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.