HPMDubbing_Vocoder

This repository is the vocoder of our model (HPMDubbing), which is used to convert the mel-spectrogram generated by our model into time-domain waveform.

Pretrained Models

We provide the pretrained models. One can download the checkpoints of generator (e.g., g_05000000) within the listed folders.

Folder Name	Sampling Rate	Hop Length	Segment Size	Win Length	Params.	Dataset	Fine-Tuned
HPM_Chem	16000 Hz	160	8000	640	55M	LibriTTS	No
HPM_V2C	22050 Hz	220	9900	880	58M	LibriTTS	No

Training

Please run

python train_V2C_HiFiGAN.py --config config_V2C_22050Hz.json

or

python train_hifigan_16KHz.py --config config_Chem_16KHz.json

Inference

inference.py : wav -> mel -> wav

python inference.py --checkpoint_file [Your path of checkpoint_file]

inference_e2e.py : mel -> wav

python inference_e2e.py --checkpoint_file [Your path of checkpoint_file]

tensorboard

Please run

tensorboard --logdir HifiGAN_16/logs/ --port=[Your port]

or

tensorboard --logdir My_vocoder_V2C/logs/ --port=[Your port]

References

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis, J. Kong, et al.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
HifiGAN_16		HifiGAN_16
LJSpeech-1.1		LJSpeech-1.1
My_vocoder_V2C		My_vocoder_V2C
__pycache__		__pycache__
audio		audio
images		images
preprocessors		preprocessors
text		text
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
config_Chem_16KHz.json		config_Chem_16KHz.json
config_V2C_22050Hz.json		config_V2C_22050Hz.json
env.py		env.py
inference.py		inference.py
inference_e2e.py		inference_e2e.py
meldataset.py		meldataset.py
models.py		models.py
requirements.txt		requirements.txt
train_V2C_HiFiGAN.py		train_V2C_HiFiGAN.py
train_hifigan_16KHz.py		train_hifigan_16KHz.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HPMDubbing_Vocoder

Pretrained Models

Training

Inference

tensorboard

References

Citation

About

Releases

Packages

Languages

License

GalaxyCong/HPMDubbing_Vocoder

Folders and files

Latest commit

History

Repository files navigation

HPMDubbing_Vocoder

Pretrained Models

Training

Inference

tensorboard

References

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages