NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
7a666ebc5db2481a98d8a4c133ee602a45bdb1441aa5b0643517d69635f46151CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
06/04/2025 5:24 AM UTC
f30608e56fa68c5ad9a2c92610e638a35b3bf9a7178ec215d09a1d8446c26b3aARG
TRT_LLM_VER=0.21.0rc0
06/04/2025 5:24 AM UTC
d6d1cb754c6208456efce118c5be69d99376aa86d5a20dfb979b3d56d81dbe43ARG
GIT_COMMIT=9ae2ce6665539a4c40a3e87eff63eac85cd773a7
06/04/2025 5:24 AM UTC
97bd43f8d9f69c254c4b4266fc318863628d6e44f6b265288bee138bac245899RUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
06/04/2025 5:24 AM UTC
8c39c6f56b151bae41efb20bae6ae6fde93fdf08c4e9eecc7bb5712ab97391a3COPY
examples examples
06/04/2025 5:24 AM UTC
8b5700786408129cc668bdfc367888001d65cc6b7f69ad1940edfa9f231523d3COPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
06/04/2025 5:24 AM UTC
76cb37b61257a2e0393aeedf2adbfedcd56e76e626fa9a91105c1d596be256e6ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
06/04/2025 5:24 AM UTC
fa9209872e9d3df3b099d8775be82fa86e0f1ba79a2d2c68ab5ba544262ded68COPY
/src/tensorrt_llm/benchmarks benchmarks
06/04/2025 5:24 AM UTC
a58dd3fd35abed9259b4b2079459ad63733cdfd9b6013fc65fd585705bbf6f93ARG
SRC_DIR=/src/tensorrt_llm
06/04/2025 5:24 AM UTC
5a996b28e0c1f9851786ecaaa1fbdafa3f1ffa49834dd7fa0b500afb5c547762RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
06/04/2025 5:24 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.