rg_text_to_sound

Research Group - Text to Sound

This README has three sections:

WebsocketServer usage: instructions to run the WebsocketServer for integration with the main system
Main Resources: the "primary"/"most used" resouces and where to find them in this repository
Other Resources: the resources that are implemented in this repository but are secondary, with a brief description and links to more in-depth READMEs

WebsocketServer usage

Setup

Run bash setup.sh to install required dependencies.
Disclaimer: Due to the use of Tensorflow, the WebsocketServer is not compatible with Windows.

Usage

Run bash run_server.sh or python main_tts.py to run the server. Run bash run_client.sh to run a test client that will communicate with the server.

Docs

For more informations, see the tts_websocketserver README.md

Main Resources

TTS Pipeline

Currently located at: tts_pipeline/
This repository defines the skeleton of the pipelines used at inference time, a first design of the pipeline can be found here.
For informations on how to use TTS Pipeline, please read its README.md

NOTE:

There is a newly implemented keyword extractor: tts_pipeline.pipelines.waterfall.models.UnifiedKeywordPairsExtractorV3.UnifiedKeywordPairsExtractorV3
This keyword extractor is not currently used in tts_websocketserver since the older version (tts_pipeline.pipelines.waterfall.models.UnifiedKeywordPairsExtractorV2) is currently under evaluation.
The newer version is supposed to be smarter and give better keyword matchings. More infos can be found in the doctring of the UnifiedKeywordPairsExtractorV3 class

TTS WebsocketServer

Currently located at: tts_websocketserver/
This repository holds the implementation of a websocket server that exposes TTS Pipeline's prediction functionalities for the Production Team.
For more informations on TTS WebsocketServer, please read its README.md

Other resources

Git Tutorial

Currently located at: playground/beat_toedtli/
A simple guideline on the structure and usage of this repository. More details can be found in its own README.md

words embeddings

Currently located at: playground/beat_toedtli/word_embeddings
Word embeddings explorative implementations, benchmarked with benchmarking_tools More details can be found in its own README.md

Command Patterns Dataset

Currently located at: playground/mirco_nani/embeddings_pipelines/commands_patterns_dataset
A simple sentences generator to augment our dataset. It takes sentence patterns with blank tokens and replaces these tokens with given keywords. The final output consists of all the possible sentences obtainable from all the combinations of patterns and keywords.
More details can be found in its own README.md

Benchmarking Tools

Currently located at: playground/mirco_nani/embeddings_pipelines/benchmarking_tools
These tools are made to benchmark embedding models (such as BERT or sent2vec) in order to produce performances comparisons. More details can be found in its own README.md

Embeddings Pipelines

Currently located at: playground/mirco_nani/embeddings_pipelines/
First implementation of TTS Pipeline
More details can be found in its own README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rg_text_to_sound

WebsocketServer usage

Setup

Usage

Docs

Main Resources

TTS Pipeline

NOTE:

TTS WebsocketServer

Other resources

Git Tutorial

words embeddings

Command Patterns Dataset

Benchmarking Tools

Embeddings Pipelines

About

Releases

Packages

Contributors 10

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 166 Commits
playground		playground
tts_pipeline		tts_pipeline
tts_websocketserver		tts_websocketserver
.gitignore		.gitignore
KeywordExtractor_byList.ipynb		KeywordExtractor_byList.ipynb
LICENSE		LICENSE
README.md		README.md
main_tts.py		main_tts.py
run_client.sh		run_client.sh
run_server.sh		run_server.sh
run_server_mac.sh		run_server_mac.sh
setup.sh		setup.sh

License

TheSoundOfAIOSR/rg_text_to_sound

Folders and files

Latest commit

History

Repository files navigation

rg_text_to_sound

WebsocketServer usage

Setup

Usage

Docs

Main Resources

TTS Pipeline

NOTE:

TTS WebsocketServer

Other resources

Git Tutorial

words embeddings

Command Patterns Dataset

Benchmarking Tools

Embeddings Pipelines

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 10

Languages

Packages