ISCA Archive SpeechProsody 2016
ISCA Archive SpeechProsody 2016

Syllable nucleus and boundary detection in noisy conditions

Sreedhar Patha, Yegnanarayana Bayya, Suryakanth V Gangashetty

In this paper, utilizing the feature of the sonority, the problems of detecting the syllable nuclei and the syllable boundaries are explored. As there is minimal obstruction in the oral cavity in the production of sonorant sounds, sonority feature is used. A method is proposed for extracting the sonority profile from the strength of the dominant resonance frequency using the zero time windowing method. The strength of the dominant resonance frequencies in the specific band of 500 Hz to 1700 Hz is used. The proposed method is evaluated on SVL-DD-NEWScorpora. The results are compared with the Fourier transform method and the energy-based method of extracting the sonority profile. The proposed method performed better compared to other methods, especially in the presence of noise such as white, pink and babble. The method is also evaluated on TIMIT test database and the performance is on par with the current methods in detecting the syllable nuclei.

doi: 10.21437/SpeechProsody.2016-74

Cite as: Patha, S., Bayya, Y., Gangashetty, S.V. (2016) Syllable nucleus and boundary detection in noisy conditions. Proc. Speech Prosody 2016, 360-364, doi: 10.21437/SpeechProsody.2016-74

  author={Sreedhar Patha and Yegnanarayana Bayya and Suryakanth V Gangashetty},
  title={{Syllable nucleus and boundary detection in noisy conditions}},
  booktitle={Proc. Speech Prosody 2016},