8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Artificial Bandwidth Extension for Speech Signals Using Speech Recogniton

Shingo Kuroiwa, Masashi Takashina, Satoru Tsuge, Ren Fuji

University of Tokushima, Japan

In this paper, we propose a non-realtime speech bandwidth extension method using HMM-based speech recognition and HMM-based speech synthesis. In the proposed method, first, the phoneme-state sequence is estimated from the bandlimited speech signals using the speech recognition technique. Next, for estimating spectrum envelopes of lost high-frequency components, an HMM-based speech synthesis technique generates a synthetic speech signal (spectrum sequence) according to the predicted phoneme-state sequence. Since both speech recognition and speech synthesis take into account dynamic feature vectors, we can obtain a smoothly varying spectrum sequence. For evaluating the proposed method, we conducted subjective and objective experiments. The experimental results show the effectiveness of the proposed method for bandwidth extension. However, the proposed method needs more improvement in speech quality.

Full Paper

Bibliographic reference.  Kuroiwa, Shingo / Takashina, Masashi / Tsuge, Satoru / Fuji, Ren (2007): "Artificial bandwidth extension for speech signals using speech recogniton", In INTERSPEECH-2007, 2501-2504.