Sixth European Conference on Speech Communication and Technology
Speaker adaptation techniques have emerged as very effective and practical methods to improve ASR performance on a test speaker with only limited speech data from the speaker. We explore the use of adaptation techniques on a new Voicemail database and present some adaptation techniques on a new Voicemail database and present some theoretical extensions of the Cluster Transformation (CT) technique. Our experiments on 40 hours of voicemail data and four clusters shows that using cluster information with MLLR improves over baseline MLLR by 2.2% (relative). When the amount of adaptation data in a short message is insufficient to reliably decide its cluster, higher improvements result when we use MLLR for the very short messages and CT on longer ones.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Huang, Jing / Padmanabhan, Mukund (1999): "A study of adaptation techniques on a voicemail transcription task", In EUROSPEECH'99, 13-16.