Linux / arm64
DOCA Telemetry Service (DTS) runs inside of its own Kubernetes pod on the BlueField and collects data from built-in providers and external telemetry applications. The following providers are available (disabled by default):
Additional telemetry applications written on top of DOCA API can send the service data over IPC.
Collected data is written/exported according to the configuration. The following oprions are available:
Telemetry agent can export the data via Prometheus (pull) or use Fluent bit (push). The Prometheus endpoint is bound to port 9100 and can be enabled using the config file. Data collected by several Telemetry Agents from their BlueFields can be aggregated by a Telemetry Agent that runs on a separate host. This setup requires InfiniBand port configuration.
All preparation steps are listed under DOCA's Container Deployment User Guide.
Create default folders for DTS:
mkdir -p /opt/mellanox/doca/services/telemetry/config mkdir -p /opt/mellanox/doca/services/telemetry/ipc_sockets mkdir –p /opt/mellanox/doca/services/telemetry/data
The .yaml configuration for our container is doca_telemetry.yaml, and is stored with the rest of the .yaml configurations as were pulled from NGC in the previous steps (See "Installation and Getting Started"). The yaml file allows generating DTS configuration from scratch, and overwriting Fluent-Bit export configuration.
In case Fluent Bit forwarding should be enabled, please add the destination host and port to the "command" found in the initContainers section:
command: ["/bin/bash/", "-c", /usr/bin/telemetry-init.sh && /usr/bin/enable-forward-to-morpheus.sh 127.0.0.1 24224"]
Please note that the host and port shown above are just an example.
Simply copy the updated
doca_telemetry.yaml file to the
/etc/kubelet.d directory. Kubelet will automatically pull the container image from NGC, and spawn a pod executing the container. The DOCA Telemetry Service application will start executing right away.
# View currently active pods, and their IDs (it might take up to 20 seconds for the pod to start) crictl pods # View currently active containers, and their IDs crictl ps # Examine logs of a given container crictl logs # Examine kubelet logs, in case something didn't work as expected journalctl -u kubelet
Please refer to the documentation for more information.
The DOCA Telemetry Service guide is available here.
DOCA is licensed under the NVIDIA DOCA License. By pulling and using the container, you accept the terms and conditions of this license.
Use the NVIDIA Developers forum for questions regarding this Software.