INTERSPEECH 2013
14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Improved Models for Automatic Punctuation Prediction for Spoken and Written Text

Nicola Ueffing (1), Maximilian Bisani (2), Paul Vozila (1)

(1) Nuance Communications, USA
(2) Amazon.de GmbH, Germany

This paper presents improved models for the automatic prediction of punctuation marks in written or spoken text. Various kinds of textual features are combined using Conditional Random Fields. These features include language model scores, token n-grams, sentence length, and syntactic information extracted from parse trees. The resulting models are evaluated on several different tasks, ranging from formal newspaper text to informal, dictated messages and documents, and from written text to spoken text. The newly developed models outperform a hidden-event language model by up to 26% relative in F-score. Evaluation of punctuation prediction on erroneous ASR output as well as evaluation against multiple references is not straightforward. We propose modifications of existing evaluation methods to handle these cases.

Full Paper

Bibliographic reference.  Ueffing, Nicola / Bisani, Maximilian / Vozila, Paul (2013): "Improved models for automatic punctuation prediction for spoken and written text", In INTERSPEECH-2013, 3097-3101.