Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Acoustical and Lexical Based Confidence Measures for a Very Large Vocabulary Telephone Speech Hypothesis-Verification System

Javier Macías-Guarasa, Javier Ferreiros, Ruben San-Segundo, Juan Manuel Montero, Juan Manuel Pardo

Grupo de Tecnología del Habla. Departamento Ingeniería Electrónica. Universidad Politécnica de MadridSpain Spain

In the context of large vocabulary speech recognition system, it’s of major interest to classify every utterance as being correctly or incorrectly recognised. In this paper we are presenting a preliminary study on a wordlevel confidence estimation system based on the output of a neural network. We use a combination of multiple features extracted from the acoustical and lexical decoders of our reference system, those available in the hypothesis stage of a hypothesis-verification very large vocabulary telephone speech recognition system. We will show the system architecture, describe the experiments leading to the selection of the set of parameters to be used by the NN and the final performance, showing promising results as compared with the use of standard log-likelihood ratio techniques for confidence scoring.

