In recent years, IP telephone service has spread rapidly. However, an unavoidable problem of IP telephone service is deterioration of speech due to packet loss, which often occurs on wireless networks. To overcome this problem, we propose a novel lost speech reconstruction method using speech recognition based on Missing Feature Theory and HMM-based speech synthesis. The proposed method uses linguistic information and can deal with the lack of syllable units which conventional methods are unable to handle. We conducted subjective and objective evaluation experiments under speaker independent conditions. These results showed the effectiveness of the proposed method. Although there is a processing delay in the proposed method, we believe that this method will open up new applications for speech recognition and speech synthesis technology.
Cite as: Kuroiwa, S., Tsuge, S., Ren, F. (2006) Lost speech reconstruction method using speech recognition based on missing feature theory and HMM-based speech synthesis. Proc. Interspeech 2006, paper 1347-Tue2BuP.4, doi: 10.21437/Interspeech.2006-339
@inproceedings{kuroiwa06_interspeech, author={Shingo Kuroiwa and Satoru Tsuge and Fuji Ren}, title={{Lost speech reconstruction method using speech recognition based on missing feature theory and HMM-based speech synthesis}}, year=2006, booktitle={Proc. Interspeech 2006}, pages={paper 1347-Tue2BuP.4}, doi={10.21437/Interspeech.2006-339} }