7th International Conference on Spoken Language Processing
September 16-20, 2002
This work deals with some interesting issues that arose when the ITCirst broadcast news transcription system was applied to transcribe the audio track of historical documentary films. Due to an evident acoustic and linguistic mismatch between the broadcast news and the new application domain, the initial word error rate was of 46.4%. By exploiting a limited amount of manually annotated training data, adaptation of all components of the transcription system was performed, namely the audio partitioner, the acoustic model, and the language model. This permitted to achieve a word error rate of 30%, which makes automatic transcription of documentary films effective for information retrieval applications.
Bibliographic reference. Brugnara, F. / Cettolo, M. / Federico, M. / Giuliani, D. (2002): "Issues in automatic transcription of historical audio data", In ICSLP-2002, 1441-1444.