INTERSPEECH 2004 - ICSLP
An integrated method of text pre-processing and language identification is introduced to deal with the problem of mixed-language e-mail messages in a speech-enabled e-mail reading system. Our method can confidently distinguish between the supported languages and switch between several TTS engines or languages to read the portions of the text in the appropriate language. This is achieved by making use of the combined information from a text pre-processor and a language identifier that relies on both statistical information and linguistic features indicative of a particular language.
Full Paper Presentation [.ppt]
Bibliographic reference. Oria, Daniela / Vetek, Akos (2004): "Multilingual e-mail text processing for speech synthesis", In INTERSPEECH-2004, 841-844.