Using Phonologically Weighted Levenshtein Distances for the Prediction of Microscopic Intelligibility

Lionel Fontan, Isabelle Ferrané, Jérôme Farinas, Julien Pinquier, Xavier Aumont


This article presents a new method for analyzing Automatic Speech Recognition (ASR) results at the phonological feature level. To this end the Levenshtein distance algorithm is refined in order to take into account the distinctive features opposing substituted phonemes. This method allows to survey features additions or deletions, providing microscopic qualitative information as a complement to word recognition scores. To explore the relevance of the qualitative data gathered by this method, a study is conducted on a speech corpus simulating presbycusis effects on speech perception at eight severity stages. Consonantic features additions and deletions in ASR outputs are analyzed and put in relation with intelligibility data collected in 30 human subjects. ASR results show monotonic trends in most consonantic features along the degradation conditions, which appear to be consistent with the misperceptions that could be observed in human subjects.


DOI: 10.21437/Interspeech.2016-431

Cite as

Fontan, L., Ferrané, I., Farinas, J., Pinquier, J., Aumont, X. (2016) Using Phonologically Weighted Levenshtein Distances for the Prediction of Microscopic Intelligibility. Proc. Interspeech 2016, 650-654.

Bibtex
@inproceedings{Fontan+2016,
author={Lionel Fontan and Isabelle Ferrané and Jérôme Farinas and Julien Pinquier and Xavier Aumont},
title={Using Phonologically Weighted Levenshtein Distances for the Prediction of Microscopic Intelligibility},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-431},
url={http://dx.doi.org/10.21437/Interspeech.2016-431},
pages={650--654}
}