ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

A transformation-based learning approach to language identification for mixed-lingual text-to-speech synthesis

J. C. Marcadet, V. Fischer, C. Waast-Richard

Recent progress in corpus-based concatenative text-to-speech synthesis has generated some interest in systems that are capable of synthesizing text from more than one language. In this paper we describe the language identification component of such a mixed-lingual text-to-speech system. Relying only on the input text, we employ two different methods, namely a transformation based learning approach and a stochastic n-gram approach, and we describe the combination of both methods. While the transformation-based learning approach already produces average error rates of less than 2 percent and outperforms the n-gram classification scheme, the combination of both methods results in a further error reduction of up to 50 percent.


doi: 10.21437/Interspeech.2005-711

Cite as: Marcadet, J.C., Fischer, V., Waast-Richard, C. (2005) A transformation-based learning approach to language identification for mixed-lingual text-to-speech synthesis. Proc. Interspeech 2005, 2249-2252, doi: 10.21437/Interspeech.2005-711

@inproceedings{marcadet05_interspeech,
  author={J. C. Marcadet and V. Fischer and C. Waast-Richard},
  title={{A transformation-based learning approach to language identification for mixed-lingual text-to-speech synthesis}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2249--2252},
  doi={10.21437/Interspeech.2005-711}
}