ISCA Archive MAVEBA 2005
ISCA Archive MAVEBA 2005

Effects of MP3 encoding on voice pathology detection: results with MFCC parameters

Nicolás Sáenz-Lechón, Juan Ignacio Godino-Llorente, Víctor Osma-Ruiz, Pedro Gómez-Vilda, Santiago Aguilera-Navarro

This paper presents a performance comparison for a voice pathology detection system dealing with different types of audio data. Several files of sustained phonation of vowel /a/, from Kay Elemetrics database, were encoded with MP3 algorithm with various bit rates (160, 48 and 24 kbps). A multilayer perceptron classifier is then used to automatically detect the normal from the pathologia files. Results are compared with those obtained for the original database, using confusion matrices and DET plots. There are no significant differences between the designed detectors.

Index Terms. Voice pathology detection, Multi-Layer Perceptrons, MPEG Audio layer 3 (MP3)


Cite as: Sáenz-Lechón, N., Godino-Llorente, J.I., Osma-Ruiz, V., Gómez-Vilda, P., Aguilera-Navarro, S. (2005) Effects of MP3 encoding on voice pathology detection: results with MFCC parameters. Proc. Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA 2005), 15-18

@inproceedings{saenzlechon05b_maveba,
  author={Nicolás Sáenz-Lechón and Juan Ignacio Godino-Llorente and Víctor Osma-Ruiz and Pedro Gómez-Vilda and Santiago Aguilera-Navarro},
  title={{Effects of MP3 encoding on voice pathology detection: results with MFCC parameters}},
  year=2005,
  booktitle={Proc. Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA 2005)},
  pages={15--18}
}