lifefeel / Grapheme-to-Phoneme Public

Notifications You must be signed in to change notification settings
Fork 0
Star 20

Grapheme-to-Phoneme(G2P) 관련자료 모음

20 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
README.md		README.md

Repository files navigation

Grapheme-to-Phoneme(G2P)

Grapheme-to-Phoneme 관련자료 모음

Tutorials

Building a phonetic dictionary
- CMUSphinx 툴에서 제공하는 오픈소스 G2P로 발음사전을 구축할 수 있는 튜토리얼

Tools

Sequitur G2P - A trainable Grapheme-to-Phoneme converter
- 위 링크에서 오리지널 소스코드를 받을 수 있으나 2016년 5월에 마지막 버전이 출시된 이후 업데이트가 되지 않고 있다. @jtrmal이란 사람에 의해 별도로 github 레포지토리가 유지되고 있다.
- github 레포지토리 - https://github.com/sequitur-g2p/sequitur-g2p
Sequence-to-Sequence G2P toolkit - CMUSphinx에서 공개한 오픈소스 Seq2Seq G2P

Dataset

CMU US English Dictionary
- 공식사이트는 Speech at CMU에 있는 CMUdict 페이지 인 것 같으나 최근까지 업데이트 되고 있는 버전은 위의 링크인 github 레포지토리이다.

Papers

통계적 방법을 이용한 G2P

Joint-Sequence Models for Grapheme-to-Phoneme Conversion (2008.05)
- Sequitur G2P를 개발한 Maximilian Bisani의 논문
GRAPHEME-TO-PHONEME CONVERSION SCHEME FOR SENTENCE-BY-SENTENCE LEARNING OF KOREAN MANUSCRIPT USING JOINT SEQUENCE STATISTICAL MODEL(2016)
- 한국어에 대해 문장단위로 학습하는 것을 제안한 논문

DNN 기반 G2P

Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion (2015.05)
- BLSTM을 사용한 encoder-decoder 모델
- PER 5.45%, WER 23.55% (CMUDict)
- PER 7.38%, WER 30.77% (NetTalk)
- PER 6.51%, WER 26.69% (Pronlex)
Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks for Grapheme-to-Phoneme Conversion utilizing Complex Many-to-Many Alignments (2016.09. Interspeech2016)
- LSTM LM을 이용한 BLSTM 모델
- PER 5.37%, WER 23.23% (CMUDict)
Grapheme-to-phoneme conversion using Long Short-Term Memory recurrent neural networks (2015.04)
- LSTM및 n-gram FST를 조합한 모델
- WER 21.3% (CMUDict)
Jointly Learning to Align and Convert Graphemes to Phonemes with Neural Attention Models (2016.10)
- LSTM encoder-decoder + global attention 앙상블 모델
- PER 4.69%, WER 20.24% (CMUDict)
Multitask Sequence-to-Sequence Models for Grapheme-to-Phoneme Conversion (2017.08. Interspeech2017)
- Sequitur G2P + Multitask learning + BLSTM + Attention 모델
- PER 5.76%, WER 24.88% (CMUDict)
Massively Multilingual Neural Grapheme-to-Phoneme Conversion (2017.08)

Korean G2P 관련 논문

A linguistically motivated approach to grapheme-to-phoneme conversion for Korean (2006) - Kyuchul Yoon, Chris Brew
- 미국 오하이오 주립대 언어학과에서 발표한 논문
Grapheme-to-Phoneme Conversion Using Automatically Extracted Associative Rules for Korean TTS System (2006) - Jinsik Lee, Seungwon Kim and Gary Geunbae Lee, Interspeech 2006
- 포항공대 이근배 교수 연구실에서 발표한 논문
Rule-based Korean Grapheme to Phoneme Conversion Using Sound Patterns (2009) - Yu-Chun Wang, Richard Tzong-Han Tsai
- 타이완 대학 연구실에서 발표한 논문
A data-driven grapheme-to-phoneme conversion method using dynamic contextual converting rules for Korean TTS systems (2009) - Jinsik Lee, Gary GeunbaeLee
- 포항공대 이근배 교수 연구실에서 발표한 논문
GRAPHEME-TO-PHONEME CONVERSION SCHEME FOR SENTENCE-BY-SENTENCE LEARNING OF KOREAN MANUSCRIPT USING JOINT SEQUENCE STATISTICAL MODEL(2016)
- 한국어에 대해 문장단위로 학습하는 것을 제안한 논문

About

Grapheme-to-Phoneme(G2P) 관련자료 모음

g2p grapheme-to-phoneme

Report repository

Releases

No releases published

Packages

No packages published