ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Mandarin connected digits recognition for whispered speech

Tingting Ru, Xiang Xie, Hui Yin, Jingming Kuang

In this paper, the acoustic characteristics and recognition of whispered speech are discussed. A Mandarin digits database is built both in normal speech and whispered speech. The collected speech materials of normal and whispered speech are analyzed to verify the characteristics and differences for the two kinds of speech. Cross recognition is carried out using normal and whispered speech as training data and testing data respectively, and the detailed recognition results are analyzed by using the confusion matrices. The results show that it's not suitable to recognize whispered speech using models trained by normal speech, and the word correct rate of the whispered speech is in close relation with its acoustic characteristics. Some possible solutions are also suggested.

doi: 10.21437/Interspeech.2008-347

Cite as: Ru, T., Xie, X., Yin, H., Kuang, J. (2008) Mandarin connected digits recognition for whispered speech. Proc. Interspeech 2008, 1141-1144, doi: 10.21437/Interspeech.2008-347

  author={Tingting Ru and Xiang Xie and Hui Yin and Jingming Kuang},
  title={{Mandarin connected digits recognition for whispered speech}},
  booktitle={Proc. Interspeech 2008},