Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Speech Recognition with Automatic Punctuation

C. Julian Chen

IBM Thomas J. Watson Research Center, Yorktown Heights, NY, USA

We present a method of speech recognition with automatic punctuation based on a combination of acoustic and lexical evidence. In the recognizer vocabulary, punctuation marks are treated as word entries. By assigning the acoustic baseforms of silence, breath, and other non-speech sounds to punctuation marks, and using a properly processed N-gram language model, unpronounced punctuation marks of various types (commas, periods, etc.) appear naturally in the recognizer output. This technology can be used in dictation systems to improve usability, in commercial broadcast transcription systems to reduce editing time, and in information retrieval systems to provide phrasing information to facilitate natural language understanding.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Chen, C. Julian (1999): "Speech recognition with automatic punctuation", In EUROSPEECH'99, 447-450.