INTERSPEECH 2004 - ICSLP
This paper reports preliminary results of data-driven modeling of segmental (phoneme) duration for Hindi. Classification and Regression Tree (CART) based data-driven duration modeling for segmental duration prediction is presented. A number of features are considered and their usefulness and relative contribution for segmental duration prediction is assessed. Objective evaluation of the duration model, by root mean squared prediction error (RMSE) and correlation between actual and predicted durations, is performed.
Bibliographic reference. Nemala, Sridhar Krishna / Talukdar, Partha Pratim / Bali, Kalika / Ramakrishnan, A. G. (2004): "Duration modeling for hindi text-to-speech synthesis system", In INTERSPEECH-2004, 789-792.