7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

On F0 Trajectory Optimization for Very High-Quality Speech Manipulation

Hideki Kawahara (1), Parham Zolfaghari (2), Alain de Cheveigné (3)

(1) Wakayama University, Japan; (2) Nagoya University, Japan; (3) CNRS-IRCAM, France

An optimized fundamental frequency (F0) trajectory extraction method, which alleviates systematic F0 glitches at vowel-nasal boundaries and in the vicinity of consonants, is introduced. The proposed method employs minimum phase group delay compensation for apparent F0 modulations due to variations in their corresponding vocal tract transfer functions. This method can also be considered as an implementation of a generalized version of analysis by synthesis. Evaluation using EGG reference signals revealed that the proposed method reduces the systematic biases by 50%.


Full Paper

Bibliographic reference.  Kawahara, Hideki / Zolfaghari, Parham / Cheveigné, Alain de (2002): "On F0 trajectory optimization for very high-quality speech manipulation", In ICSLP-2002, 2397-2400.