NVIDIA
NVIDIA
TensorRT LLM Release
Container
NVIDIA
NVIDIA
TensorRT LLM Release

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.

LayerLabelCreated
2132bc743994cb5fba560218b1b48cf4392c7dca8e878ee171e06aa1911a8f93CONFIG
Entrypoint /opt/nvidia/nvidia_entrypoint.sh; WorkingDir /app/tensorrt_llm; ExposedPorts 6006/tcp, 8888/tcp
08/20/2025 12:06 AM UTC
1200380ea537e59035adc4aaa5d33b476b68f0f50246da60e782ecba5dbe485dARG
TRT_LLM_VER=1.1.0rc1
08/20/2025 12:06 AM UTC
441535ff861cbe870752be8f40619d25a97cbdc17910a5b76d3ad36fb7b2d1d3ARG
GIT_COMMIT=7334f9390c7528f2363eb41b1e6b9e592ea25d6b
08/20/2025 12:06 AM UTC
fb659b4ceb392a6e4251dc24ce7112958e7153a8496b6568299e74e809335bcbRUN
SRC_DIR=/src/tensorrt_llm CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build /bin/bash -c chmod -R a+w examples &&
  rm -v benchmarks/cpp/bertBenchmark.cpp benchmarks/cpp/gptManagerBenchmark.cpp benchmarks/cpp/disaggServerBenchmark.cpp benchmarks/cpp/CMakeLists.txt &&
  rm -rf /root/.cache/pip
08/20/2025 12:06 AM UTC
42ed9f2f924aba0cd1f2dcf0404fc33871681cb3f71c7065e49bdd428a39f799COPY
examples examples
08/20/2025 12:06 AM UTC
cf0f76361c36e144d7a6a97e0bd006cc513f1fd89f2d98dbe304a7e328c938bbCOPY
/src/tensorrt_llm/cpp/build/benchmarks/bertBenchmark /src/tensorrt_llm/cpp/build/benchmarks/gptManagerBenchmark /src/tensorrt_llm/cpp/build/benchmarks/disaggServerBenchmark benchmarks/cpp/
08/20/2025 12:06 AM UTC
03f4d8ac313ab3a5786a584808b616911809d2980147e9f06e2526b3226dd5c4ARG
CPP_BUILD_DIR=/src/tensorrt_llm/cpp/build
08/20/2025 12:06 AM UTC
57026728c1a7ce2d8f8c60d47a09904312ebadd04adf76e33ad840032cbacc60COPY
/src/tensorrt_llm/benchmarks benchmarks
08/20/2025 12:06 AM UTC
a3823a50724987a6cd879aaa88f45424c67ab65164031073382fea0fbcf8f119ARG
SRC_DIR=/src/tensorrt_llm
08/20/2025 12:06 AM UTC
c5a01ea626b984389f4e3339cba82a40b05f1af79596cce10d38184398613a47RUN
/bin/bash -c ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/bin")') bin &&
  test -f bin/executorWorker &&
  ln -sv $(python3 -c 'import site; print(f"{site.getsitepackages()[0]}/tensorrt_llm/libs")') lib &&
  test -f lib/libnvinfer_plugin_tensorrt_llm.so &&
  echo "/app/tensorrt_llm/lib" > /etc/ld.so.conf.d/tensorrt_llm.conf &&
  ldconfig &&
  ! ( ldd -v bin/executorWorker | grep tensorrt_llm | grep -q "not found" )
08/20/2025 12:06 AM UTC
...

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.