Implement missing speed functions along with durable speech rate / speed changer function. #4115

isikhi · 2024-12-28T20:13:30Z

Added missing speed parameters to functions and ensured more durable, accurate speed adjustments with the new adjust_speech_rate function.

Removes a (GPL) dependency

refactor(dataset): get audio length with torchaudio

https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners/customizing-github-hosted-runners#installing-software-on-ubuntu-runners

refactor(bin.find_unique_chars): use existing function

https://packaging.python.org/en/latest/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows/

Reverts c59f0ca (coqui-ai#13) Too many CI test timeouts from installing torch/nvidia packages with uv: astral-sh/uv#1912

Update repository links, package names, release script

Fixes coqui-ai#1691

Can be handled by adjusting logging levels instead.

Update links and Github actions

The XTTS model itself already supports Hindi, it was just in these components.

Use Python logging instead of print()

feat(xtts): support Hindi for sentence-splitting and fine-tuning

Add tokenizer logging, update version for release 0.23.0

…ids file Previously, running `LanguageManager.init_from_config(config)` would never use the `language_ids_file` if that field is present because it was overwritten in the next line with a new manager that manually parses languages from the datasets in the config. Now that is only used as a fallback.

fix(xtts): clearer error message when file given to checkpoint_dir

Expand Python API capabilities

[ci skip]

Improve documentation

feat: allow both Path and strings where possible and add type hints

)

This way the outputs are available for further downstream processing, e.g. with grep. For TTS/bin/synthesize.py, if --pipe_out is set, log to stderr because then only the output audio stream should be on stdout, e.g. to pipe it to aplay.

fix(bin): log to stdout in cli tools

…ai#237) * Fix num2words call using non-standard lang code * build: update minimum num2words version --------- Co-authored-by: Enno Hermann <[email protected]>

…e durable latents. also missed tts speed implementations added.

CLAassistant · 2024-12-28T20:13:38Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ isikhi
❌ SkaceKamen
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

isikhi · 2024-12-29T21:38:31Z

Since I learned that this repo is no longer used and maintained, I am closing the mr here and opening it to idiap's repo. It continues from here. idiap#239

eginhard and others added 30 commits March 14, 2024 20:48

refactor(dataset): get audio length with torchaudio

adbcba0

Removes a (GPL) dependency

Merge pull request coqui-ai#21 from eginhard/audio-length

571f065

refactor(dataset): get audio length with torchaudio

refactor(bin.find_unique_chars): use existing function

7630abb

ci(tests.yml): run apt-get update before installing espeak

d76d0ef

https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners/customizing-github-hosted-runners#installing-software-on-ubuntu-runners

Merge pull request #22 from eginhard/unique-chars

018daa0

refactor(bin.find_unique_chars): use existing function

fix: update repository links, package names, metadata

d772724

ci(pypi-release): update actions, use trusted publishing

7fe6a01

https://packaging.python.org/en/latest/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows/

chore: update version to v0.22.1

dd3768d

ci: switch back from uv to pip

00f8d47

Reverts c59f0ca (coqui-ai#13) Too many CI test timeouts from installing torch/nvidia packages with uv: astral-sh/uv#1912

Merge pull request #24 from idiap/coqui-refs

a4ca02b

Update repository links, package names, release script

fix: use logging instead of print statements

b6ab85a

Fixes coqui-ai#1691

refactor: remove verbose arguments

b711e19

Can be handled by adjusting logging levels instead.

feat(utils.generic_utils): improve setup_logger() arguments and output

9b2d48f

feat(utils.generic_utils): add custom formatter for logging to console

ab64844

fix: logging in executables

7dc5d1e

fix(utils.manage): remove bare except, improve messages

e689fd1

docs: update links

aa40fd2

ci(workflows): update actions

107e22c

ci(workflows.docker): update image namespace

31f1c8b

Merge pull request coqui-ai#1 from idiap/update-docs

e626a29

Update links and Github actions

feat(xtts): support hindi for sentence-splitting and fine-tuning

d416865

The XTTS model itself already supports Hindi, it was just in these components.

Merge pull request coqui-ai#3 from idiap/logging

dfbe016

Use Python logging instead of print()

Merge pull request coqui-ai#4 from idiap/hindi

2ad790d

feat(xtts): support Hindi for sentence-splitting and fine-tuning

fix(tokenizer): add debug logging

b3c9685

docs(README): update badges to new pypi package

794eecb

chore: update version to 0.23.0

f7d69cc

Merge pull request coqui-ai#5 from idiap/tokenizer-logging

5527f70

Add tokenizer logging, update version for release 0.23.0

build: add python 3.12 support

8b1ed02

build: switch to forked trainer package

f636fab

eginhard and others added 25 commits December 6, 2024 06:46

Merge pull request coqui-ai#184 from idiap/xtts-error

e8d99aa

fix(xtts): clearer error message when file given to checkpoint_dir

feat(api): allow mixing TTS and vocoder model name and path

85dbb3b

chore(api): add type hints

a05177c

feat(api): support passing speaker/language id file paths

89abd98

refactor(api): use save_wav() from Synthesizer instance

806af96

refactor(bin.synthesize): use Python API for CLI

e0f6211

Merge pull request coqui-ai#197 from idiap/api

b545ab8

Expand Python API capabilities

fix: handle difference in xtts/tortoise attention (coqui-ai#199)

c0d9ed3

chore: bump version to 0.25.1 (coqui-ai#202)

f329072

build(docs): update dependencies, fix makefile

236e490

docs: improve documentation

849e75e

docs: move project structure from readme into documentation

e23766d

docs: use nested contents for easier overview

ae2f8d2

docs: streamline readme and reuse content in other docs pages

e38dcbe

[ci skip]

Merge pull request coqui-ai#207 from idiap/docs

cd52907

Improve documentation

feat: allow both Path and strings where possible and add type hints

a425ba5

docs: add notes about xtts fine-tuning

0df04cc

Merge pull request coqui-ai#210 from idiap/manager

5165e71

feat: allow both Path and strings where possible and add type hints

feat(manager): print download location when listing models (coqui-ai#213

9d5fc60

)

docs(xtts): show manual inference with default speakers

1f9dda6

Merge pull request coqui-ai#217 from idiap/stdout

370fb1d

fix(bin): log to stdout in cli tools

fix(xtts): voice_dir should remain None if not specified (coqui-ai#224)

f89ce41

fix(xtts): use correct language code for Czech num2words call (coqui-…

98080e2

…ai#237) * Fix num2words call using non-standard lang code * build: update minimum num2words version --------- Co-authored-by: Enno Hermann <[email protected]>

feat: add adjust_speech_rate function to modify speech speed with mor…

26128be

…e durable latents. also missed tts speed implementations added.

isikhi mentioned this pull request Dec 29, 2024

Implement missing speed functions along with durable speech rate / speed changer function. idiap/coqui-ai-TTS#239

Open

Merge branch 'dev' into fix-improvements/adjust-speech-rate-or-speed

ed1563b

isikhi closed this Dec 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement missing speed functions along with durable speech rate / speed changer function. #4115

Implement missing speed functions along with durable speech rate / speed changer function. #4115

isikhi commented Dec 28, 2024

CLAassistant commented Dec 28, 2024 •

edited

Loading

isikhi commented Dec 29, 2024

Implement missing speed functions along with durable speech rate / speed changer function. #4115

Implement missing speed functions along with durable speech rate / speed changer function. #4115

Conversation

isikhi commented Dec 28, 2024

CLAassistant commented Dec 28, 2024 • edited Loading

isikhi commented Dec 29, 2024

CLAassistant commented Dec 28, 2024 •

edited

Loading