Is Syllable Stress Information Robust for ASR in Adverse Conditions?

Bogdan Ludusan, Stefan Ziegler, Guillaume Gravier


This paper presents a study on the robustness of stress information for automatic speech recognition in the presence of noise. The syllable stress, extracted from the speech signal, was integrated in the recognition process by means of a previously proposed decoding method. Experiments were conducted for several signal-to-noise ratio conditions and the results show that stress information is robust in the presence of medium to low noise. This was found to be true both when syllable boundary information was used for stress detection and when this information was not available. Furthermore, the obtained relative improvement increased with a decrease in signal quality, indicating that the stressed parts of the signal can be considered islands of reliability.


 DOI: 10.21437/SpeechProsody.2014-176

Cite as: Ludusan, B., Ziegler, S., Gravier, G. (2014) Is Syllable Stress Information Robust for ASR in Adverse Conditions?. Proc. 7th International Conference on Speech Prosody 2014, 939-943, DOI: 10.21437/SpeechProsody.2014-176.


@inproceedings{Ludusan2014,
  author={Bogdan Ludusan and Stefan Ziegler and Guillaume Gravier},
  title={{Is Syllable Stress Information Robust for ASR in Adverse Conditions?}},
  year=2014,
  booktitle={Proc. 7th International Conference on Speech Prosody 2014},
  pages={939--943},
  doi={10.21437/SpeechProsody.2014-176},
  url={http://dx.doi.org/10.21437/SpeechProsody.2014-176}
}