ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Evaluating two versions of the momel pitch modelling algorithm on a corpus of read speech in Korean

Daniel Hirst, Hyongsil Cho, Sunhee Kim, Hyunji Yu

The Momel algorithm provides an automatic factoring of raw fundamental frequency into two components: a microprosodic component, corresponding to local variations of pitch caused by the phonetic nature of the speech segments and a macroprosodic component corresponding to the overall pitch pattern of the utterance which is then represented as a sequence of pitch targets. An earlier evaluation estimated the overall efficiency of the algorithm (F-measure) at around 95% on a corpus of read speech for 5 European languages and at around 93% for a corpus of spontaneous speech. In this paper we present the results of the evaluation of the output of two versions of the Momel algorithm as compared with manually corrected pitch targets for a corpus of just over 2 hours of read speech in Korean (40 continuous 5-sentence passages, each read by 5 male and 5 female speakers). The results show that the new version of the Momel algorithm performs systematically better than the earlier version.


doi: 10.21437/Interspeech.2007-459

Cite as: Hirst, D., Cho, H., Kim, S., Yu, H. (2007) Evaluating two versions of the momel pitch modelling algorithm on a corpus of read speech in Korean. Proc. Interspeech 2007, 1649-1652, doi: 10.21437/Interspeech.2007-459

@inproceedings{hirst07_interspeech,
  author={Daniel Hirst and Hyongsil Cho and Sunhee Kim and Hyunji Yu},
  title={{Evaluating two versions of the momel pitch modelling algorithm on a corpus of read speech in Korean}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={1649--1652},
  doi={10.21437/Interspeech.2007-459}
}