ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition

April 13-16, 2003
Tokyo Institute of Technology, Tokyo, Japan

How to Improve Human and Machine Transcriptions of Spontaneous Speech

Diana Binnenpoorte, Simo Goddijn, Catia Cucchiarini

A2RT, Department of Language and Speech, University of Nijmegen, The Netherlands

This paper reports on an experiment aimed at measuring the quality of automatic and human phonetic transcriptions of different speech styles that were produced within the framework of a large speech corpus project for Dutch, the Spoken Dutch Corpus (Corpus Gesproken Nederlands, CGN). The results indicate that the procedure adopted in the CGN to improve the quality of phonetic transcriptions does indeed contribute to achieving this aim. However, better transcriptions of spontaneous speech could probably be obtained by resorting to ASR techniques for pronunciation variation modeling. Our research indicates how this could be achieved.

Full Paper

Bibliographic reference.  Binnenpoorte, Diana / Goddijn, Simo / Cucchiarini, Catia (2003): "How to improve human and machine transcriptions of spontaneous speech", in SSPR-2003, paper TMO7.