2003 ISCA Workshop on Multilingual Spoken Document Retrieval
(MSDR2003)

Hong Kong
April 4-5, 2003

Automatic Speech Annotation and Transcription in a Broadcast News Task

Hugo Meinedo, Joćo P. Neto

L2F - Spoken Language Systems Lab, INESC-ID / IST, Lisboa, Portugal

This paper describes our work on the development of a system for automatic speech transcription applied to a Broadcast News (BN) task for the European Portuguese language. We developed audio segmentation modules including a scheme for tagging certain speaker clusters (anchors). We developed a speech recognition system for the broadcast news task using appropriate models. The tests were conducted using large quantities of BN data and show good results in terms of word error rate and processing time. This system is currently integrated in a prototype audio indexing and document retrieval system that is daily processing the main news show of the national Portuguese broadcaster.


Full Paper

Bibliographic reference.  Meinedo, Hugo / Neto, Joćo P. (2003): "Automatic speech annotation and transcription in a broadcast news task", In MSDR-2003, 95-100.