EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Estimation of the Parameters of the Quantitative Intonation Model with Continuous Wavelet Analysis

Hans Kruschke, Michael Lenz

Dresden University of Technology, Germany

Intonation generation in state-of-the-art speech synthesis requires the analysis of a large amount of data. Therefore reliable algorithms for the extraction of the parameters of an intonation model from a given F0 contour are required. This contribution proposes improvements concerning the extraction of the parameters of the quantitative intonation model developed by Fujisaki. The improvements are mainly based on the application of the continuous wavelet transform for the detection of accents and phrases in a F0 contour. A detailed explanation of the underlying idea of this approach is given and the implemented algorithm is described. Results prove that with the proposed method a significant improvement in the accuracy of the extracted parameters is achieved. Thereby the structure and the rules of the algorithm are kept relatively simple.

Full Paper

Bibliographic reference.  Kruschke, Hans / Lenz, Michael (2003): "Estimation of the parameters of the quantitative intonation model with continuous wavelet analysis", In EUROSPEECH-2003, 2881-2884.