8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Task-Specific Minimum Bayes-Risk Decoding using Learned Edit Distance

Izhak Shafran, William Byrne

The Johns Hopkins University, USA

This paper extends the minimum Bayes-risk framework to incorporate a loss function specific to the task and the ASR system. The errors are modeled as a noisy channel and the parameters are learned from the data. The resulting loss function is used in the risk criterion for decoding. Experiments on a large vocabulary conversational speech recognition system demonstrate significant gains of about 1% absolute over MAP hypothesis and about 0.6% absolute over untrained loss function. The approach is general enough to be applicable to other sequence recognition problems such as in Optical Character Recognition (OCR) and in analysis of biological sequences.

Full Paper

Bibliographic reference.  Shafran, Izhak / Byrne, William (2004): "Task-specific minimum Bayes-risk decoding using learned edit distance", In INTERSPEECH-2004, 1945-1948.