8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Acoustic Model Adaptation for Coded Speech Using Synthetic Speech

Koji Tanaka (1), Fuji Ren (2), Shingo Kuroiwa (2), Satoru Tsuge (2)

(1) The University of Tokushima, Japan
(2) The University of Tokushima, Japan

In this paper, we describe a novel acoustic model adaptation technique which generates "speaker-independent" HMM for the target environment. Recently, personal digital assistants like cellular phones are shifting to IP terminals. The encoding-decoding process utilized for transmitting over IP networks deteriorates the quality of speech data. This deterioration causes degradation in speech recognition performance. Acoustic model adaptations can improve recognition performance. However, the conventional adaptation methods usually require a large amount of adaptation data. The proposed method uses HMM-based speech synthesis to generate adaptation data from the acoustic model of HMM-based speech recognizer, and consequently does not require any speech data for adaptation. Experimental results on G.723.1 coded speech recognition show that the proposed method improves speech recognition performance. A relative word error rate reduction of approximately 12% was observed.

Full Paper

Bibliographic reference.  Tanaka, Koji / Ren, Fuji / Kuroiwa, Shingo / Tsuge, Satoru (2004): "Acoustic model adaptation for coded speech using synthetic speech", In INTERSPEECH-2004, 2925-2928.