ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Building sleek synthesizers for multi-lingual screen reader

E Veera Raghavendra, B. Yegnanarayana, Alan W. Black, Kishore Prahallad

In this paper, we are investigating the unit size: syllable, halfphone and quarter-phone to be used for speech synthesis in multi-lingual screen reader in phonetic languages such as Telugu and non-phonetic language English. Perceptual studies show that syllable-level unit performs better for Telugu and half-phone units perform better for English. While syllable based synthesizers produce better sounding speech, the coverage of all syllables is a non-trivial issue. We address the issue of coverage of syllables through approximate matching of syllable and show that such approximation produces intelligible and better quality speech than diphone units. In this paper, we also propose a hybrid synthesizer within the framework of unit selection and also show that the hybrid synthesizer built from pruned database performs as well as hybrid synthesizer built from unpruned database.


doi: 10.21437/Interspeech.2008-185

Cite as: Raghavendra, E.V., Yegnanarayana, B., Black, A.W., Prahallad, K. (2008) Building sleek synthesizers for multi-lingual screen reader. Proc. Interspeech 2008, 1865-1868, doi: 10.21437/Interspeech.2008-185

@inproceedings{raghavendra08_interspeech,
  author={E Veera Raghavendra and B. Yegnanarayana and Alan W. Black and Kishore Prahallad},
  title={{Building sleek synthesizers for multi-lingual screen reader}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1865--1868},
  doi={10.21437/Interspeech.2008-185}
}