ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Application of automatic speaker recognition techniques to pathological voice assessment (dysphonia)

Corinne Fredouille, G. Pouchoulin, Jean-Fran├žois Bonastre, M. Azzarello, A. Giovanni, A. Ghio

This paper investigates the adaptation of Automatic Speaker Recognition (ASR) techniques to the pathological voice assessment (dysphonic voices). The aim of this study is to provide a novel method, suitable for keeping track of the evolution of the patient's pathology: easy-to-use, fast, non-invasive for the patient, and affordable for the clinicians. This method will be complementary to the existing ones - the perceptual judgment and the usual objective measurement (jitter, airflows...) which remain time and human resource consuming.

The system designed for this particular task relies on the GMMbased approach, which is the state-of-the-art for speaker recognition. It is derived from the open source ASR tools (LIA_SpkDet and ALIZE) of the LIA lab.

Experiments conducted on a dysphonic corpus provide promising results, underlining the interest of such an approach and opening further research investigation.


doi: 10.21437/Interspeech.2005-90

Cite as: Fredouille, C., Pouchoulin, G., Bonastre, J.-F., Azzarello, M., Giovanni, A., Ghio, A. (2005) Application of automatic speaker recognition techniques to pathological voice assessment (dysphonia). Proc. Interspeech 2005, 149-152, doi: 10.21437/Interspeech.2005-90

@inproceedings{fredouille05_interspeech,
  author={Corinne Fredouille and G. Pouchoulin and Jean-Fran├žois Bonastre and M. Azzarello and A. Giovanni and A. Ghio},
  title={{Application of automatic speaker recognition techniques to pathological voice assessment (dysphonia)}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={149--152},
  doi={10.21437/Interspeech.2005-90}
}