ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Decoding-time prediction of non-verbalized punctuation

Anoop Deoras, Jürgen Fritsch

This paper presents novel methods that integrate lexical prediction of non-verbalized punctuations with Viterbi decoding for Large Vocabulary Conversational Speech Recognition (LVCSR) in a single pass. We describe two different approaches - one based on a modified finite state machine representation of language models and one based on an extension of an LVCSR decoder. We discuss advantages over traditional punctuation prediction approaches based on post-processing of recognition hypotheses, including experimental evaluation of the proposed approach using a state-of-the-art LVCSR decoder. Experiments were performed on a medical documentation corpus and results demonstrate that the proposed methods yield improved punctuation prediction accuracy while at the same time reducing system complexity and memory requirements.

doi: 10.21437/Interspeech.2008-418

Cite as: Deoras, A., Fritsch, J. (2008) Decoding-time prediction of non-verbalized punctuation. Proc. Interspeech 2008, 1449-1452, doi: 10.21437/Interspeech.2008-418

  author={Anoop Deoras and Jürgen Fritsch},
  title={{Decoding-time prediction of non-verbalized punctuation}},
  booktitle={Proc. Interspeech 2008},