8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

PROSPECT Features and their Application to Missing Data Techniques for Robust Speech Recognition

Hugo Van hamme

Katholieke Universiteit Leuven, Belgium

Missing data theory has been applied to the problem of speech recognition in adverse environments. The resulting systems require acoustic models that are expressed in the spectral rather than in the cepstral domain, which leads to loss of accuracy. Cepstral Missing Data Techniques (CMDT) surmount this disadvantage, but require significantly more computation. In this paper, we study alternatives to the cepstral representation that lead to more efficient MDT systems. The proposed solution, PROSPECT features (Projected Spectra), can be interpreted as a novel speech representation, or as an approximation of the inverse covariance (precision) matrix of the Gaussian distributions modeling the log-spectra.

Full Paper

Bibliographic reference.  hamme, Hugo Van (2004): "PROSPECT features and their application to missing data techniques for robust speech recognition", In INTERSPEECH-2004, 101-104.