EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Bandwidth Mismatch Compensation for Robust Speech Recognition

Yuan-Fu Liao (1), Jeng-Shien Lin (1), Wei-Ho Tsai (2)

(1) National Taipei University of Technology, Taiwan
(2) Academia Sinica, Taiwan

In this paper, an iterative bandwidth mismatch compensation (BMC) algorithm is proposed to alleviate the need of multiple pre-trained models for recognizing different bandwidth speech. The BMC uses the concept of the bandwidth extension as similar as in the speech enhancement approaches. However, it aims at directly improving the recognition accuracy instead of speech intelligence or quality and utilizes only recognizer's hidden Markov models (HMMs) for both bandwidth mismatch compensation and recognition. The BMC first detects the bandwidth of the input speech signal based on a divergence measurement. The HMM/Gaussian mixture model (GMM)- based method is then used to iteratively segment the input speech utterance and compensates the speech features. Experiments on serious bandwidth mismatched conditions, i.e., training on 8 kHz and testing on 4 kHz or 5.5 kHz bandwidth database have verified the effectiveness of the proposed approach.

Full Paper

Bibliographic reference.  Liao, Yuan-Fu / Lin, Jeng-Shien / Tsai, Wei-Ho (2003): "Bandwidth mismatch compensation for robust speech recognition", In EUROSPEECH-2003, 3093-3096.