L4T Compute Assist | NVIDIA NGC

NGC Catalog

CLASSIC

Welcome Guest

For copy image paths and more information, please view on a desktop device.

Description

This container allows running compute-only applications using CUDA and TensorRT on the integrated GPU of Holoscan devkits (Clara AGX DevKit, IGX Orin DevKit) while other applications can run on dGPU, natively or in another container.

Publisher

NVIDIA

Latest Tag

l4t_35.3-trt_8.5.2

Modified

July 1, 2025

Compressed Size

2.24 GB

Multinode Support

Multi-Arch Support

l4t_35.3-trt_8.5.2 (Latest) Security Scan Results

Linux / arm64

Disclaimer: this container is deprecated as of the Holoscan SDK 1.0 release. Refer to your developer kit user guide for up-to-date instructions.

Overview

NVIDIA developer kits like the NVIDIA IGX Orin or the NVIDIA Clara AGX have both a discrete GPU (dGPU - optional on IGX Orin) and an integrated GPU (iGPU - Tegra SoC). At this time, when these developer kits are flashed to leverage the discrete GPU (using HoloPack), the integrated GPU cannot be used due to conflicts between the CUDA libraries for dGPU and iGPU stack.

This container allows running compute-only applications using CUDA and TensorRT on the iGPU of these developer kits by isolating the iGPU stack from the rest of the host device. Other applications can run concurrently on the dGPU, natively or in another container.

For Holoscan documentation, visit the Holoscan developer page.

Prerequisites

Set up your developer kit in dGPU mode:

Developer Kit User Guide HoloPack

NVIDIA IGX Orin Guide 2.0

NVIDIA Clara AGX Guide 1.2

Developer Kit	User Guide	HoloPack
NVIDIA IGX Orin	Guide	2.0
NVIDIA Clara AGX	Guide	1.2

Load the nvgpu kernel driver for iGPU:

sudo insmod $(find /usr/lib/modules -name nvgpu.ko -type f,l | head -n1)

Install the Holoscan SDK
Either:
- Create your own Holoscan application
- Reuse an example from the SDK
- Reuse an application from HoloHub

Running the container

Log in to the NGC docker registry
Copy the version you want to use from the Get Container drop-down at the top of this webpage, and set this as your NGC_CONTAINER_IMAGE_PATH in your terminal:
- for HoloPack 2.0, use l4t_35.3-trt_8.5.2
- for HoloPack 1.2, use l4t_34.1.2-trt_8.4.0
```
# For example
export NGC_CONTAINER_IMAGE_PATH="nvcr.io/nvidia/clara-holoscan/l4t-compute-assist:l4t_35.3-trt_8.5.2"
```
Start the L4T Compute Assist container:
```
export HOLOSCAN_SDK_INSTALL_PATH="/opt/nvidia/holoscan" # choose other path if installed somewhere else on your host
export APP_PATH="/path/to/your_app"
export DATA_PATH="/path/to/your_data" # if needed by your_app
docker run -it --rm --net=host \
  --privileged \
  --runtime=runc \
  --ipc=host \
  --cap-add=CAP_SYS_PTRACE \
  --ulimit memlock=-1 \
  -v ${HOLOSCAN_SDK_INSTALL_PATH}:${HOLOSCAN_SDK_INSTALL_PATH}:ro \
  -e PYTHONPATH=${HOLOSCAN_SDK_INSTALL_PATH}/python/lib \
  -v ${APP_PATH}:${APP_PATH}:ro \
  -v ${DATA_PATH}:${DATA_PATH}:ro \
  ${NGC_CONTAINER_IMAGE_PATH}
```
- --privileged is to run with privileged permissions to access the iGPU driver.
- --runtime=runc is to ensure you are not using the nvidia docker runtime which would load dGPU drivers instead of iGPU. If this does not work, remove the nvidia runtime from your defaults in /etc/docker/daemon.json (you'll need to add --runtime=nvidia when running your other containers for dGPU).
- HOLOSCAN_SDK_INSTALL_PATH is the path to where you have installed the Holoscan SDK on your host.
  - Debian package: /opt/nvidia/holoscan
  - Python wheel: in your environment's dist-packages/holoscan
  - If you want to mount the SDK from a Holoscan dGPU container instead of your devkit host:
    - add --name holoscan_dgpu -v /opt/nvidia/holoscan when running your dGPU container
    - replace -v ${HOLOSCAN_SDK_INSTALL_PATH} ... by --volumes-from holoscan_dgpu when running the L4T Compute Assist container
- -e PYTHONPATH is set for python3 to find the Holoscan python module if it is not installed from a wheel.
- APP_PATH is the path of your built Holoscan application on your host.
- DATA_PATH is the path of any data on your host which you might need to run your application.
- --ipc=host, --cap-add=CAP_SYS_PTRACE, --ulimit memlock=-1 are needed for the distributed applications using UCX to work. (See https://openucx.readthedocs.io/en/master/running.html#running-in-docker-containers)
Once in the container, you can run ${APP_PATH}, or any other commands to leverage CUDA and/or TensorRT on the iGPU.

Tips:

You can run deviceQuery inside the container to confirm you are using the iGPU device (Xavier/Orin) and CUDA drivers (11.4).
You can run tegrastats on the host to visualize the iGPU load when running an app in the container (GR3D_FREQ percentage).

Troubleshooting

deviceQuery: no CUDA-capable device is detected

Ensure that the nvgpu kernel driver is loaded (see prerequisites).

tegrastats: does not show GR3D_FREQ

Ensure that the nvgpu kernel driver is loaded (see prerequisites).

deviceQuery: returns "Quadro RTX" capable device (dGPU), 11.6 CUDA drivers

Ensure that you're not using the nvidia container runtime (refer to --runtime=runc instructions above).

tegrastats: GR3D_FREQ is at 0%

Ensure that:

you're not using the nvidia container runtime (refer to --runtime=runc instructions above)
your app is running in the iGPU container
your app uses CUDA for GPU computation

License

By pulling and using the container, you accept the terms and conditions of this End User License Agreement.