Sixth European Conference on Speech Communication and Technology
We describe a new algorithm for finding the hypothesis in a recognition lattice that is expected to minimize the word error rate (WER). Our approach thus overcomes the mismatch between the word-based performance metric and the standard MAP scoring paradigm that is sentence-based, and that can lead to sub-optimal recognition results. To this end we first find a complete alignment of all words in the recognition lattice, identifying mutually supporting and competing word hypotheses. Finally, a new sentence hypothesis is formed by concatenating the words with maximal posterior probabilities. Experimentally, this approach leads to a significant WER reduction in a large vocabulary recognition task.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Mangu, Lidia / Brill, Eric / Stolcke, Andreas (1999): "Finding consensus among words: lattice-based word error minimization", In EUROSPEECH'99, 495-498.