8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Robust ASR Model Adaptation by Feature-Based Statistical Data Mapping

Xuechuan Wang, Douglas O'Shaughnessy

University of Quebec, Canada

Automatic speech recognition (ASR) model adaptation is important to many real-life ASR applications due to the variability of speech. The differences of speaker, bandwidth, context, channel and et al. between speech databases of initial ASR models and application data can be major obstacles to the effectiveness of ASR models. ASR models, therefore, need to be adapted to the application environments. Maximum Likelihood Linear Regression (MLLR) is a popular model-based method mainly used for speaker adaptation. This paper proposes a feature-based statistical Data Mapping (SDM) approach, which is more flexible than MLLR in various applications, such as different bandwidth and context. Experimental results on the TIMIT database show that ASR models adapted by the SDM approach have improved accuracy.

Full Paper

Bibliographic reference.  Wang, Xuechuan / O'Shaughnessy, Douglas (2004): "Robust ASR model adaptation by feature-based statistical data mapping", In INTERSPEECH-2004, 2905-2908.