ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

A comparative study of glottal open quotient estimation techniques

John Kane, Stefan Scherer, Louis-Philippe Morency, Christer Gobl

The robust and efficient extraction of features related to the glottal excitation source has become increasingly important for speech technology. The glottal open quotient (OQ) is one relevant measurement which is known to significantly vary with changes in voice quality on a breathy to tense continuum. The extraction of OQ, however, is hampered in the time-domain by the difficulty in consistently locating the point of glottal opening as well the computational load of its measurement. Determining OQ correlates in the frequency domain is an attractive alternative, however the lower frequencies of glottal source spectrum are also affected by other aspects of the glottal pulse shape thereby precluding closed-form solutions and straightforward mappings. The present study provides a comparison of three OQ estimation methods and shows a new method based on spectral features and artificial neural networks to outperform existing methods in terms of discrimination of voice quality, lower error values on a large volume of speech data and dramatically reduced computation time.


doi: 10.21437/Interspeech.2013-25

Cite as: Kane, J., Scherer, S., Morency, L.-P., Gobl, C. (2013) A comparative study of glottal open quotient estimation techniques. Proc. Interspeech 2013, 1658-1662, doi: 10.21437/Interspeech.2013-25

@inproceedings{kane13b_interspeech,
  author={John Kane and Stefan Scherer and Louis-Philippe Morency and Christer Gobl},
  title={{A comparative study of glottal open quotient estimation techniques}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={1658--1662},
  doi={10.21437/Interspeech.2013-25}
}