INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Adaptation for Soft Whisper Recognition Using a Throat Microphone

Szu-Chen Jou, Tanja Schultz, Alex Waibel

Carnegie Mellon University, USA

This paper describes various adaptation methods applied to recognizing soft whisper recorded with a throat microphone. Since the amount of adaptation data is small and the testing data is very different from the training data, a series of adaptation methods is necessary. The adaptation methods include: maximum likelihood linear regression, feature-space adaptation, and re-training with downsampling, sigmoidal low-pass filter, or linear multivariate regression. With these adaptation methods, the word error rate improves from 99.3% to 32.9%.

Full Paper

Bibliographic reference.  Jou, Szu-Chen / Schultz, Tanja / Waibel, Alex (2004): "Adaptation for soft whisper recognition using a throat microphone", In INTERSPEECH-2004, 1493-1496.