Resources

Github

NeMo
Llama
Demucs
Whisper
Whisper NeMo Diarization
Text to speech alignment using CTC forced alignment
Utilities intended for use with Llama models.
Llama Recipes: Examples to get started using the Llama models from Meta
timsainb/noisereduce: Noise reduction in python using spectral gating
pyannote/pyannote-audio: Neural building blocks for speaker diarization
microsoft/DNS-Challenge: This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
WenzheLiu-Speech/awesome-speech-enhancement: speech enhancement\speech seperation\sound source localization
nanahou/Awesome-Speech-Enhancement: A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
jonashaag/speech-enhancement: Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement
yxlu-0102/MP-SENet: Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
## SUPERSEDED: THIS DATASET HAS BEEN REPLACED. ## Noisy speech database for training speech enhancement algorithms and TTS models

Web

Llama
Download Llama
Llama 3.2 Requirements
Average handle time (AHT): Formula and tips for improvement

Notebooks

Hybrid Demucs Music Source Separation

PyPI

demucs
MPSENet

Errors

The file is already fully retrieved; nothing to do.

Paper

Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
FINALLY: fast and universal speech enhancement with studio-like quality
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Youtube

A Course on Speech Enhancement
COMS 4995 Final on Speech Enhancement
Achieving Studio-Quality Speech with Generative AI
How to Fix Bad Podcast Audio
Speech Enhancement for Cochlear Implant Recipients Using Deep Complex Convolution Transformer With F
Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors
2024 종합설계 3팀 2차, Neural Network for Speech Enhancement
MIAI Deeptails Seminar : Generative Models as Data-driven Priors for Speech Enhancement
Hardware Efficient Speech Enhancement With Noise Aware Multi Target Deep Learning
Diffusion Models for Speech Enhancement | Julius Richter
Speech Enhancement: Basics & Key Details
Guided Speech Enhancement Network (ICASSP 2023)
VSANet: Real-time Speech Enhancement Based on Voice Activity Detection and Causal Spatial Attention
Research intern talk: Unified speech enhancement approach for speech degradation & noise suppression
Magnitude and phase spectrum with example
Deep Learning In Audio for Absolute Beginners: From No Experience & No Datasets to a Deployed Model
Look Once to Hear: Target Speech Hearing with Noisy Examples

Wikipedia

Speech enhancement

Hugging Face

Models(asteroid)
cankeles/DPTNet_WHAMR_enhsingle_16k
JacobLinCool/MP-SENet-VB
JacobLinCool/MP-SENet-DNS
ENOT-AutoDL/MP-SENet

Web

Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
The Audio Developer Conference - ADC is an annual event celebrating all audio development technologies, from music applications and game audio to audio processing and embedded systems.
Look Once to Hear: Target Speech Hearing with Noisy Examples - CHI '24
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition > Introduction | Class Central Classroom

Dataset

VoiceBank+DEMAND
VoiceBank+DEMAND

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RESOURCES.md

RESOURCES.md

Resources

Github

Web

Notebooks

PyPI

Errors

Paper

Youtube

Wikipedia

Hugging Face

Web

Dataset

Files

RESOURCES.md

Latest commit

History

RESOURCES.md

File metadata and controls

Resources

Github

Web

Notebooks

PyPI

Errors

Paper

Youtube

Wikipedia

Hugging Face

Web

Dataset