NGC | Catalog

Maxine

For contents of this collection and more information, please view on a desktop device.
Logo for Maxine

Description

NVIDIA Maxine™ is a GPU-accelerated SDK with state-of-the-art AI features for developers to build virtual collaboration and content creation applications such as video conferencing and live streaming.

Curator

NVIDIA

Modified

October 6, 2022
Containers
Helm Charts
Models
Resources

NVIDIA Maxine

NVIDIA Maxine is a suite of GPU-accelerated SDKs and cloud-native microservices for deploying AI features that enhance audio, video, and augmented reality effects for real-time communications services and platforms. Maxine’s state-of-the-art models create high-quality effects that can be achieved with standard microphones and webcams.

Three Maxine SDKs are available: Video Effects, Audio Effects, and Augmented Reality (AR).

In addition to what is available on NGC publicly, our Early Access Program contains additional SDK features and microservices available under the terms and conditions of that program. Apply today.

What is included

This collection houses publicly available Maxine SDK resources and the information needed to use them. These resources contain the SDK Programming Guides detailing the SDK-specific APIs, general feature-level description, and bare-metal installation. If you are interested in Maxine’s early access features, apply through our Early Access Program

Resources

Audio Effects SDK

(Windows/Linux: 1.2.0) - Improve audio quality from incoming and/or outgoing audio feeds, making conversations easier to understand.

  • Key Features
    • Audio Super Resolution
    • Acoustic Echo Cancellation
    • Noise Removal
    • Room Echo Removal
    • Noise Removal and Room Echo Removal: Combines both effects to remove noise and suppress audio reverberations
  • Latest Release
    • [New] Speaker Focus (early access)
    • Quality enhancement for Audio Super Resolution
    • Bug fixes and quality enhancement for Noise Removal
  • Operating Systems
    • Window 10, Windows 11 64-bit, Ubuntu 18.04, CentOS 7, CentOS 8 RHEL7, RHEL8, Debian 10+
  • Dependencies
    • Windows SDK: NVIDIA Display Driver 520.46+, CMake 3.9+
    • Server SDKs (Linux): CUDA 11.3.1, CMake 3.12+, (TRT 8.0.1.6 and CuDNN 8.2 are packaged)
    • Update to TensorRT 8.0.1.6, CUDA 11.3u1 & cuDNN 8.2.1 (Linux), all of which belong to the NVIDIA AI platform.
  • Supported Hardware
    • Windows SDK: NVIDIA GeForce RTX 20XX and 30XX Series, Quadro RTX 3000, TITAN RTX, or higher (any NVIDIA GPUs with Tensor Cores)
    • Server SDK: V100, T4, A2, A10, A16, A30, A40, A100 (with MIG support)
    • Support for Ada-generation GPUs for Windows SDKs

Windows Audio Effects SDK and Linux Audio Effects SDK

Video Effects SDK

(Windows/Linux: 0.7.1) - Apply incredible AI effects for video quality enhancement. The Video Effects SDK is free to use within any application on both client-side or server/data center.

  • Key Features
    • Virtual Background
    • Super Resolution (up to 4X scaling factor)
    • Upscaler (up to 4X scaling factor)
    • Artifact Reduction
    • Video Noise Removal
  • Latest Release
    • Improved temporal stability for Virtual Background
    • Linux SDKs are now Operating System agnostic
  • Operating Systems
    • Windows 10, Windows 11 64-bit, Ubuntu 18.04, Ubuntu 20.04, CentOS 7
  • Dependencies
    • Windows SDK: NVIDIA Display Driver 465.89 or more recent, CMake 3.12+
    • Server SDKs (Linux): CUDA® 11.3.1, NVIDIA TensorRT® (TRT) 8.0.1.6, cuDNN 8.2.1, CMake 3.12+
  • Supported Hardware
    • Windows SDK: NVIDIA® GeForce RTX® 20XX and 30XX Series, Quadro RTX™ 3000, TITAN RTX™, or higher (any NVIDIA GPUs with Tensor Cores)
    • Server SDK: V100, T4, A10, A30, A100 (with MIG support)
    • Support for Ada-generation GPUs for Windows SDKs

Windows Video Effects SDK and Linux Video Effects SDK

Augmented Reality SDK

(Windows/Linux: 0.8.1)- Enable real-time 3D tracking of a person’s face using a standard web camera. Create unique AR effects such as overlaying 3D content on a face, driving 3D characters and virtual interactions in real time. Note: The Linux version of the Augmented Reality SDK is currently only available in the Early Access Program.

  • Key Features
    • Face Tracking
    • Face Landmark tracking
    • Face Mesh
    • Body Pose Estimation
    • Eye Contact
    • [New] Face Expression Estimation
  • Latest Release
  • Operating Systems
    • Windows 10, Windows 11 64-bit, Ubuntu 18.04, Ubuntu 20.04, CentOS 7
  • Supported Hardware
    • Windows SDK: NVIDIA GeForce RTX 20XX and 30XX Series, Quadro RTX 3000, TITAN RTX, or higher (any NVIDIA GPUs with Tensor Cores)
    • Server SDK: V100, T4, A10, A30, A100 (with MIG support)
    • Support for Ada-generation GPUs for Windows SDKs
  • Software Dependencies
    • Windows SDK: NVIDIA Display Driver 465.89+ or more recent, CMake 3.12+
    • Server SDKs (Linux): CUDA 11.3.1, TRT 8.0.1.6, CuDNN 8.2.1, CMake 3.12+
    • For Containers: Docker (19.03+) and the up to date version of NVIDIA-Docker

Windows AR SDK and Linux AR SDK (early access program)

Getting started with Maxine

Procedure

Follow the resource cards for specifics on using each of the SDKs. SDK-specific programming guides are available inside Audio Effects SDK, Video Effects SDK, and Augmented Reality SDK Program Guides. You can also find them in the documentation which is available here.

License

Please refer to the SDK packages for the SDK-specific licenses.

Ethical AI

NVIDIA’s platforms and application frameworks enable developers to build a wide array of AI applications. Please consider potential algorithmic bias when choosing or creating the models being deployed. Work with the model’s developer to ensure:

  • The model meets the requirements for the relevant industry and use case
  • The necessary instruction and documentation are provided to understand error rates, confidence intervals, and results
  • The model is being used under the conditions and in the manner intended.