ISCA Archive SSW 2007
ISCA Archive SSW 2007

Building a better Indian English voice using "more data"

Rohit Kumar, Rashmi Gangadharaiah, Sharath Rao, Kishore Prahallad, Carolyn P. Rosé, Alan W. Black

We report our experiments towards improving an existing publicly available Indian English voice using additional data. The additional data was used to create new duration and pronunciation models as well as to convert the existing voice to create a more Indian sounding voice. Two experiments along the above lines are reported. In the first experiment, we found that changing the pronunciation models has the potential to improve an existing Indian English voice. We conducted a second experiment to validate this finding. The second experiment shows the potential value in carefully investigating the separate effects of the different components of a pronunciation model in order to understand their unique contributions to improving an Indian English voice.


Cite as: Kumar, R., Gangadharaiah, R., Rao, S., Prahallad, K., Rosé, C.P., Black, A.W. (2007) Building a better Indian English voice using "more data". Proc. 6th ISCA Workshop on Speech Synthesis (SSW 6), 90-94

@inproceedings{kumar07_ssw,
  author={Rohit Kumar and Rashmi Gangadharaiah and Sharath Rao and Kishore Prahallad and Carolyn P. Rosé and Alan W. Black},
  title={{Building a better Indian English voice using "more data"}},
  year=2007,
  booktitle={Proc. 6th ISCA Workshop on Speech Synthesis (SSW 6)},
  pages={90--94}
}