NGC Catalog
CLASSIC
Welcome Guest
Containers
Audio2Face-3D

Audio2Face-3D

For copy image paths and more information, please view on a desktop device.
Logo for Audio2Face-3D
Associated Products
Features
Description
NVIDIA NIM for GPU accelerated Audio2Face-3D inference through gRPC APIs.
Publisher
NVIDIA
Latest Tag
1.3.15
Modified
May 3, 2025
Compressed Size
20.91 GB
Multinode Support
No
Multi-Arch Support
No
1.3.15 (Latest) Security Scan Results

Linux / amd64

Sorry, your browser does not support inline SVG.

What Is NVIDIA NIM?

The Audio2Face NIM simplifies the deployment of the Audio2Face tuned models which are optimized for language understanding, reasoning, and text generation use cases, and outperforms many of the available open source chat models on common industry benchmarks. NVIDIA Audio2Emotion is embedded within Audio2Face, and it is designed to automatically recognize the emotions in human speech.

The Audio2Face-3D (A2F-3D) microservice is a key component of our facial animation technology stack, designed to process audio input and generate corresponding facial animations. In legacy mode, A2F-3D integrates both server and client functionalities using gRPC to seamlessly handle data streams within a larger pipeline. Audio2Face-3D NIM exposes the following gRPCs:

  • Bidirectional Streaming gRPC or 2 Unidirectional Streaming endpoints for processing audio data and getting animation data.
  • Unary gRPC for getting the current configuration of the microservice.

Getting Started

Please follow the Quick start guide for Prerequisites & dependencies

Refer to the Audio2Face Microservice documentation for more details.

Compatible Infrastructure Software Versions

OS      : Ubuntu 22.04/24.04 (bare-metal or with WSL)
CUDA    : 12.6
Driver  : 535.183.06 (for Data Center GPUs), 560.35.03 (for RTX GPUs) and 560.94 (for Windows WSL)
NVIDIA Container Toolkit : latest version
Docker	: latest version

NOTE: Any Linux distribution should work but has not been tested by our teams. Your Docker environment must support NVIDIA GPUs. Please refer to the NVIDIA Container Toolkit for more information. Security Vulnerabilities in OSS packages

Please review the Security Scanning tab to view the latest security scan results. For certain open-source vulnerabilities listed in the scan results, NVIDIA provides a response in the form of a Vulnerability Exploitability eXchange (VEX) document. The VEX information can be reviewed and downloaded from the Security Scanning Security Scanning tab.

Getting Help

Enterprise Support Get access to knowledge base articles and support cases or submit a ticket: https://www.nvidia.com/en-us/data-center/products/ai-enterprise-suite/support/

Usage Restrictions

You may not use the Software or any of its components for the purpose of emotion recognition. Any technology included in the Software may only be used as fully integrated in the Software and consistent with all applicable documentation.

Governing Terms

This software is governed by the NVIDIA Software License Agreement and Product Specific Terms for AI Products. Use of the models is governed by the NVIDIA Community Model License.

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their supporting model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.

For more detailed information on ethical considerations for this model, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards. Please report security vulnerabilities or NVIDIA AI Concerns here.

You are responsible for ensuring that your use of NVIDIA AI Foundation Models complies with all applicable laws.