Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Cross-Language Synthesis with a Polyglot Synthesizer

Javier Latorre, Koji Iwano, Sadaoki Furui

Tokyo Institute of Technology, Japan

In this paper we examine the use of an HMM-based polyglot synthesizer for languages for which very limited or no speech data is available. In a former study, we presented a system that combines monolingual corpora from several languages to create a polyglot synthesizer. With this synthesizer we can synthesize any of the languages included in the training data with the same output voice and speech quality. In this paper, we approximate the sounds of non-included languages, by those available in the polyglot training data. Since the phonetic inventory of a polyglot synthesizer is wider than that of a monolingual one, the approximation of such non-included sounds becomes more accurate and thus the perceptual intelligibility increases. Moreover, the performance of a polyglot synthesizer can be further improved by adding a reduced amount of data from the target language.

Full Paper

Bibliographic reference.  Latorre, Javier / Iwano, Koji / Furui, Sadaoki (2005): "Cross-language synthesis with a polyglot synthesizer", In INTERSPEECH-2005, 1477-1480.