ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Robust F0 estimation based on log-time scale autocorrelation and its application to Mandarin tone recognition

Yusuke Kida, Masaru Sakai, Takashi Masuko, Akinori Kawamura

This paper proposes a novel F0 estimation method in which delta-logF0 is directly estimated based on autocorrelation function (ACF) on a logarithmic time scale. Since peaks of ACFs of periodic signals have a specific pattern on the log-time scale and the period only affects the position of the pattern, delta-logF0 can be estimated directly from the shift of the peaks of the log-time scale ACF (LTACF) without F0 estimation. Then logF0 is estimated from the sum of LTACFs shifted based on delta-logF0. Experimental results show that the proposed method is more robust against noise than the baseline ACF-based method. It is also shown that the proposed method significantly improves the Mandarin tone recognition accuracy.


doi: 10.21437/Interspeech.2009-752

Cite as: Kida, Y., Sakai, M., Masuko, T., Kawamura, A. (2009) Robust F0 estimation based on log-time scale autocorrelation and its application to Mandarin tone recognition. Proc. Interspeech 2009, 2971-2974, doi: 10.21437/Interspeech.2009-752

@inproceedings{kida09_interspeech,
  author={Yusuke Kida and Masaru Sakai and Takashi Masuko and Akinori Kawamura},
  title={{Robust F0 estimation based on log-time scale autocorrelation and its application to Mandarin tone recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2971--2974},
  doi={10.21437/Interspeech.2009-752}
}