 |
2003 ISCA Workshop on
Multilingual Spoken Document Retrieval
(MSDR2003)
Hong Kong
April 4-5, 2003 |
 |
Automatic Speech Annotation and Transcription in a Broadcast News Task
Hugo Meinedo, Joćo P. Neto
L2F - Spoken Language Systems Lab,
INESC-ID / IST, Lisboa, Portugal
This paper describes our work on the development
of a system for automatic speech transcription applied
to a Broadcast News (BN) task for the European
Portuguese language. We developed audio segmentation
modules including a scheme for tagging certain
speaker clusters (anchors). We developed a speech
recognition system for the broadcast news task using
appropriate models. The tests were conducted using
large quantities of BN data and show good results in
terms of word error rate and processing time. This
system is currently integrated in a prototype audio
indexing and document retrieval system that is daily
processing the main news show of the national Portuguese
broadcaster.
Full Paper
Bibliographic reference.
Meinedo, Hugo / Neto, Joćo P. (2003):
"Automatic speech annotation and transcription in a broadcast news task",
In MSDR-2003, 95-100.