Optimized Chinese Pronunciation Prediction by Component-Based Statistical Machine Translation

홈 > 연구문헌 > 영문 논문지 > JIPS (한국정보처리학회)

한글제목(Korean Title)	Optimized Chinese Pronunciation Prediction by Component-Based Statistical Machine Translation
영문제목(English Title)	Optimized Chinese Pronunciation Prediction by Component-Based Statistical Machine Translation
저자(Author)	Shunle Zhu
원문수록처(Citation)	VOL 17 NO. 01 PP. 0203 ~ 0212 (2021. 02)
한글내용 (Korean Abstract)
영문내용 (English Abstract)	To eliminate ambiguities in the existing methods to simplify Chinese pronunciation learning, we propose a model that can predict the pronunciation of Chinese characters automatically. The proposed model relies on a statistical machine translation (SMT) framework. In particular, we consider the components of Chinese characters as the basic unit and consider the pronunciation prediction as a machine translation procedure (the component sequence as a source sentence, the pronunciation, pinyin, as a target sentence). In addition to traditional features such as the bidirectional word translation and the n-gram language model, we also implement a component similarity feature to overcome some typos during practical use. We incorporate these features into a log-linear model. The experimental results show that our approach significantly outperforms other baseline models.
키워드(Keyword)	Chinese Pronunciation Prediction Component Features Statistical Machine Translation (SMT)
파일첨부	PDF 다운로드