ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT

Hideki Kawahara, Alain de Cheveigné, Hideki Banno, Toru Takahashi, Toshio Irino

A new method for source information extraction is proposed. The aim of the method is to provide optimal source information for the very high quality speech manipulation system STRAIGHT. The method is based on both time interval and frequency cues, and it provides fundamental frequency and periodicity information within each frequency band, to allow mixed mode excitation. The method is designed to minimize perceptual disturbance due to errors in source information extraction. A preliminary evaluation using a database of simultaneously recorded EGG and speech signals yielded very low gross error rates (0.029% for females and 0.14% for males). In addition, the method is designed so as to minimize the perceptual disturbance caused by any such gross error.


doi: 10.21437/Interspeech.2005-335

Cite as: Kawahara, H., Cheveigné, A.d., Banno, H., Takahashi, T., Irino, T. (2005) Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT. Proc. Interspeech 2005, 537-540, doi: 10.21437/Interspeech.2005-335

@inproceedings{kawahara05_interspeech,
  author={Hideki Kawahara and Alain de Cheveigné and Hideki Banno and Toru Takahashi and Toshio Irino},
  title={{Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={537--540},
  doi={10.21437/Interspeech.2005-335}
}