Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model

Rui Liu, Feilong Bao, Guanglai Gao, Hui Zhang, Yonghe Wang


In the speech synthesis systems, the phrase break (PB) prediction is the first and most important step. Recently, the state-of-the-art PB prediction systems mainly rely on word embeddings. However this method is not fully applicable to Mongolian language, because its word embeddings are inadequate trained, owing to the lack of resources. In this paper, we introduce a bidirectional Long Short Term Memory (BiLSTM) model which combined word embeddings with syllable and morphological embedding representations to provide richer and multi-view information which leverages the agglutinative property. Experimental results show the proposed method outperforms compared systems which only used the word embeddings. In addition, further analysis shows that it is quite robust to the Out-of-Vocabulary (OOV) problem owe to the refined word embedding. The proposed method achieves the state-of-the-art performance in the Mongolian PB prediction.


 DOI: 10.21437/Interspeech.2018-1706

Cite as: Liu, R., Bao, F., Gao, G., Zhang, H., Wang, Y. (2018) Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model. Proc. Interspeech 2018, 57-61, DOI: 10.21437/Interspeech.2018-1706.


@inproceedings{Liu2018,
  author={Rui Liu and Feilong Bao and Guanglai Gao and Hui Zhang and Yonghe Wang},
  title={Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={57--61},
  doi={10.21437/Interspeech.2018-1706},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1706}
}