ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Lexical stress detection for L2 English speech using deep belief networks

Kun Li, Xiaojun Qian, Shiyin Kang, Helen Meng

This paper investigates lexical stress detection for L2 English speech using Deep Belief Networks (DBNs). The features of the DBN used in this work include the syllable-based prosodic features (assumed to have Gaussian distribution) and their expected lexical stress (assumed to have Bernoulli distribution). As stressed syllables are more prominent than their neighbors, the two preceding and two following syllables are taken into consideration. Experimental results show that the DBN achieves an accuracy of about 80% in syllable stress classification (primary/secondary/no stress) for words with three or more syllables. It outperforms the conventional Gaussian Mixture Model and our previous Prominence Model by an absolute accuracy of about 8% and 4%, respectively.


doi: 10.21437/Interspeech.2013-447

Cite as: Li, K., Qian, X., Kang, S., Meng, H. (2013) Lexical stress detection for L2 English speech using deep belief networks. Proc. Interspeech 2013, 1811-1815, doi: 10.21437/Interspeech.2013-447

@inproceedings{li13e_interspeech,
  author={Kun Li and Xiaojun Qian and Shiyin Kang and Helen Meng},
  title={{Lexical stress detection for L2 English speech using deep belief networks}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={1811--1815},
  doi={10.21437/Interspeech.2013-447}
}