Sixth European Conference on Speech Communication and Technology
(EUROSPEECH'99)

Budapest, Hungary
September 5-9, 1999

A Study of Adaptation Techniques on a Voicemail Transcription Task

Jing Huang, Mukund Padmanabhan

IBM T. J. Watson Research Center, Yorktown Heights, NY, USA

Speaker adaptation techniques have emerged as very effective and practical methods to improve ASR performance on a test speaker with only limited speech data from the speaker. We explore the use of adaptation techniques on a new Voicemail database and present some adaptation techniques on a new Voicemail database and present some theoretical extensions of the Cluster Transformation (CT) technique. Our experiments on 40 hours of voicemail data and four clusters shows that using cluster information with MLLR improves over baseline MLLR by 2.2% (relative). When the amount of adaptation data in a short message is insufficient to reliably decide its cluster, higher improvements result when we use MLLR for the very short messages and CT on longer ones.


Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Huang, Jing / Padmanabhan, Mukund (1999): "A study of adaptation techniques on a voicemail transcription task", In EUROSPEECH'99, 13-16.