8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Duration Modeling for Hindi Text-to-speech Synthesis System

Sridhar Krishna Nemala, Partha Pratim Talukdar, Kalika Bali, A. G. Ramakrishnan

Hewlett-Packard Labs India, India

This paper reports preliminary results of data-driven modeling of segmental (phoneme) duration for Hindi. Classification and Regression Tree (CART) based data-driven duration modeling for segmental duration prediction is presented. A number of features are considered and their usefulness and relative contribution for segmental duration prediction is assessed. Objective evaluation of the duration model, by root mean squared prediction error (RMSE) and correlation between actual and predicted durations, is performed.

Full Paper

Bibliographic reference.  Nemala, Sridhar Krishna / Talukdar, Partha Pratim / Bali, Kalika / Ramakrishnan, A. G. (2004): "Duration modeling for hindi text-to-speech synthesis system", In INTERSPEECH-2004, 789-792.