TBT (Toolkit to Build TTS): A High Performance Framework to Build Multiple Language HTS Voice

Atish Shankar Ghone, Rachana Nerpagar, Pranaw Kumar, Arun Baby, Aswin Shanmugam, Sasikumar M., Hema A. Murthy


With the development of high quality TTS systems, application area of synthetic speech is increasing rapidly. Beyond the communication aids for the visually impaired and vocally handicap, TTS voices are being used in various educational, telecommunication and multimedia applications. All around the world people are trying to build TTS voice for their regional languages. TTS voice building requires a number of steps to follow and involves use of multiple tools, which makes it time consuming, tedious and perplexing to a user. This paper describes a Toolkit developed for HMM-based TTS voice building that makes the process much easier and handy. The toolkit uses all required tools, viz. HTS, Festival, Festvox, Hybrid Segmentation Tool, etc. and handles each and every step starting from phone set creation, then prompt generation, hybrid segmentation, F0 range finding, voice building, and finally putting the built voice into Synthesis framework. Wherever possible it does parallel processing to reduce time. It saves manual effort and time to a large extent and enable a person to build TTS voice very easily. This toolkit is made available under Open Source license.


Cite as: Ghone, A.S., Nerpagar, R., Kumar, P., Baby, A., Shanmugam, A., M., S., Murthy, H.A. (2017) TBT (Toolkit to Build TTS): A High Performance Framework to Build Multiple Language HTS Voice. Proc. Interspeech 2017, 3427-3428.


@inproceedings{Ghone2017,
  author={Atish Shankar Ghone and Rachana Nerpagar and Pranaw Kumar and Arun Baby and Aswin Shanmugam and Sasikumar M. and Hema A. Murthy},
  title={TBT (Toolkit to Build TTS): A High Performance Framework to Build Multiple Language HTS Voice},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={3427--3428}
}