Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Automatic Phonetic Transcription of Large Speech Corpora: A Comparative Study

Christophe Van Bael, Lou Boves, Henk van den Heuvel, Helmer Strik

Radboud Universiteit Nijmegen, The Netherlands

This study investigates whether automatic transcription procedures can approximate manual phonetic transcriptions typically delivered with contemporary large speech corpora. We used ten automatic procedures to generate a broad phonetic transcription of well-prepared speech (read-aloud texts) and spontaneous speech (telephone dialogues). The resulting transcriptions were compared to manually verified phonetic transcriptions. We found that the quality of this type of transcription can be approximated by a fairly simple and cost-effective procedure.

Full Paper

Bibliographic reference.  Bael, Christophe Van / Boves, Lou / Heuvel, Henk van den / Strik, Helmer (2006): "Automatic phonetic transcription of large speech corpora: a comparative study", In INTERSPEECH-2006, paper 1173-Tue2WeO.5.