7th International Conference on Spoken Language Processing
September 16-20, 2002
An optimized fundamental frequency (F0) trajectory extraction method, which alleviates systematic F0 glitches at vowel-nasal boundaries and in the vicinity of consonants, is introduced. The proposed method employs minimum phase group delay compensation for apparent F0 modulations due to variations in their corresponding vocal tract transfer functions. This method can also be considered as an implementation of a generalized version of analysis by synthesis. Evaluation using EGG reference signals revealed that the proposed method reduces the systematic biases by 50%.
Bibliographic reference. Kawahara, Hideki / Zolfaghari, Parham / Cheveigné, Alain de (2002): "On F0 trajectory optimization for very high-quality speech manipulation", In ICSLP-2002, 2397-2400.