NVIDIA
NVIDIA
Windows Audio Effects SDK
Resource
NVIDIA
NVIDIA
Windows Audio Effects SDK

Audio Effects SDK for Windows delivers AI-based audio enhancement algorithms, improving end-to-end conversation quality.

Join or Subscribe to get accessSubscribe to the product below to access this premium content:
NVIDIA AI Enterprise
NVIDIA AI EnterpriseAccelerate your AI agent development
Subscribe Now
NVIDIA Developer Program
NVIDIA Developer ProgramJoin the Developer Program for access to free tools, support, and tech resources.
Get Access
Note: You can gain access to hundreds more GPU-optimized artifacts by creating a free NGC account.
Already Subscribed?Log in

NVIDIA Windows Audio Effects SDK

The NVIDIA Windows Audio Effects SDK provides the following audio effects for broadcast use cases with real-time audio processing:

  1. Background Noise Removal: removes common background noise while preserving the speaker’s natural voice, with improved accuracy for automated speech recognition.

  2. Room Echo Cancellation: removes and suppresses reverbs from audio that might occur from the recording environment, improving speech clarity.

  3. Background Noise Reduction + Room Echo Cancellation: removes unwanted noises and reverberations from audio, improving speech intelligibility.

  4. Audio Super Resolution: improves sound quality by adding higher frequency content to the audio stream. For low-frequency audio, this feature predicts the higher frequency spectrum of input audio, which improves audio quality.

  5. Acoustic Echo Cancellation: removes acoustic echo and feedback from audio, which improves the bidirectional audio quality.

  6. Studio Voice: enables ordinary headset, laptop, and desktop microphones to deliver the sound of a high-end studio mic, even if recorded in less-than-ideal acoustic environments with distortions such as reverberations or static noise. Studio Voice enhances and recovers speech degraded by noise reduction filters and beamforming algorithms, making the audio sound like it was recorded in a professional studio. This effect has two variations: Studio Voice High Quality and Studio Voice Low Latency.

  7. Speaker Focus: identifies and isolates the primary speaker and removes all other speakers from the input audio. This significantly improves the intelligibility of the primary speaker’s voice when others are speaking in the background.

  8. Voice Font: converts the input voice to match the reference speaker’s voice while keeping linguistic information and prosody unchanged. Currently only available as an EA feature.

  9. For more detailed information, please refer to the documentation guide in the SDK packages.

License

NVIDIA Evaluation License Agreement

Get Help

Getting started with the SDK

Please refer to the programming guide for quick start guide, API reference and more.

Enterprise Support

Get access to knowledge base articles and support cases or submit a ticket.

Publisher
NVIDIA
NVIDIA
Latest Version2.1.0
UpdatedMarch 16, 2026 UTC
Compressed Size973.13 MB