This paper describes a new automatic pitch-marking method using wavelet transform. This method detects discontinuity in the speech waveform which occurs at the glottal closure instant (GCI). A time domain prosodic modification technique requires an appropriate determination of the synthesis pitch-marks. We evaluated the performance of the newly developed pitchmarking method by using our internal speech databases with an electroglottograph signal. We achieved 96 percent detection accuracy on the performance evaluation. We confirmed that the proposed pitch-marking method is suitable for waveform concatenation-based synthesis through a listening test using pitch modified speech.
Cite as: Sakamoto, M., Saitoh, T. (2000) An automatic pitch-marking method using wavelet transform. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 650-653, doi: 10.21437/ICSLP.2000-619
@inproceedings{sakamoto00_icslp, author={Masaharu Sakamoto and Takashi Saitoh}, title={{An automatic pitch-marking method using wavelet transform}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 3, 650-653}, doi={10.21437/ICSLP.2000-619} }