- NeMo
- Llama
- Demucs
- Whisper
- Whisper NeMo Diarization
- Text to speech alignment using CTC forced alignment
- Utilities intended for use with Llama models.
- Llama Recipes: Examples to get started using the Llama models from Meta
- timsainb/noisereduce: Noise reduction in python using spectral gating
- pyannote/pyannote-audio: Neural building blocks for speaker diarization
- microsoft/DNS-Challenge: This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
- WenzheLiu-Speech/awesome-speech-enhancement: speech enhancement\speech seperation\sound source localization
- nanahou/Awesome-Speech-Enhancement: A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
- jonashaag/speech-enhancement: Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement
- yxlu-0102/MP-SENet: Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
- ## SUPERSEDED: THIS DATASET HAS BEEN REPLACED. ## Noisy speech database for training speech enhancement algorithms and TTS models
- Llama
- Download Llama
- Llama 3.2 Requirements
- Average handle time (AHT): Formula and tips for improvement
- Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
- MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
- FINALLY: fast and universal speech enhancement with studio-like quality
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
- A Course on Speech Enhancement
- COMS 4995 Final on Speech Enhancement
- Achieving Studio-Quality Speech with Generative AI
- How to Fix Bad Podcast Audio
- Speech Enhancement for Cochlear Implant Recipients Using Deep Complex Convolution Transformer With F
- Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors
- 2024 종합설계 3팀 2차, Neural Network for Speech Enhancement
- MIAI Deeptails Seminar : Generative Models as Data-driven Priors for Speech Enhancement
- Hardware Efficient Speech Enhancement With Noise Aware Multi Target Deep Learning
- Diffusion Models for Speech Enhancement | Julius Richter
- Speech Enhancement: Basics & Key Details
- Guided Speech Enhancement Network (ICASSP 2023)
- VSANet: Real-time Speech Enhancement Based on Voice Activity Detection and Causal Spatial Attention
- Research intern talk: Unified speech enhancement approach for speech degradation & noise suppression
- Magnitude and phase spectrum with example
- Deep Learning In Audio for Absolute Beginners: From No Experience & No Datasets to a Deployed Model
- Look Once to Hear: Target Speech Hearing with Noisy Examples
- Models(asteroid)
- cankeles/DPTNet_WHAMR_enhsingle_16k
- JacobLinCool/MP-SENet-VB
- JacobLinCool/MP-SENet-DNS
- ENOT-AutoDL/MP-SENet
- Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
- The Audio Developer Conference - ADC is an annual event celebrating all audio development technologies, from music applications and game audio to audio processing and embedded systems.
- Look Once to Hear: Target Speech Hearing with Noisy Examples - CHI '24
- Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition > Introduction | Class Central Classroom