5th International Conference on Spoken Language Processing

Sydney, Australia
November 30 - December 4, 1998

Phoneme Recognition with Statistical Modeling of the Prediction Error of Neural Networks

Felix Freitag, Enric Monte

UPC, Spain

This paper presents a speech recognition system which incorporates predictive neural networks. The neural networks are used to predict observation vectors of speech. The prediction error vectors are modeled on the state level by Gaussian densities, which provide the local similarity measure for the Viterbi algorithm during recognition. The system is evaluated on a continuous speech phoneme recognition task. Compared with a HMM reference system, the proposed system obtained better results in the speech recognition experiments.

Full Paper

Bibliographic reference.  Freitag, Felix / Monte, Enric (1998): "Phoneme recognition with statistical modeling of the prediction error of neural networks", In ICSLP-1998, paper 0455.