This paper describes our work on the development of a system for automatic speech transcription applied to a Broadcast News (BN) task for the European Portuguese language. We developed audio segmentation modules including a scheme for tagging certain speaker clusters (anchors). We developed a speech recognition system for the broadcast news task using appropriate models. The tests were conducted using large quantities of BN data and show good results in terms of word error rate and processing time. This system is currently integrated in a prototype audio indexing and document retrieval system that is daily processing the main news show of the national Portuguese broadcaster.
Cite as: Meinedo, H., Neto, J.P. (2003) Automatic speech annotation and transcription in a broadcast news task. Proc. ISCA Workshop on Multilingual Spoken Document Retrieval (MSDR 2003), 95-100
@inproceedings{meinedo03_msdr, author={Hugo Meinedo and João P. Neto}, title={{Automatic speech annotation and transcription in a broadcast news task}}, year=2003, booktitle={Proc. ISCA Workshop on Multilingual Spoken Document Retrieval (MSDR 2003)}, pages={95--100} }