8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Direct Optimisation of a Multilayer Perceptron for the Estimation of Cepstral Mean and Variance Statistics

John Dines, Jithendra Vepa

IDIAP Research Institute, Switzerland

We propose an alternative means of training a multilayer perceptron for the task of speech activity detection based on a criterion to minimise the error in the estimation of mean and variance statistics for speech cepstrum based features using the Kullback-Leibler divergence. We present our baseline and proposed speech activity detection approaches for multi-channel meeting room recordings and demonstrate the effectiveness of the new criterion by comparing the two approaches when used to carry out cepstrum mean and variance normalisation of features used in our meeting ASR system.

Full Paper

Bibliographic reference.  Dines, John / Vepa, Jithendra (2007): "Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics", In INTERSPEECH-2007, 2921-2924.