ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

High-resolution noise-robust spectral-based pitch estimation

Marián Képesi, Luis Weruaga

This paper introduces a new spectral representation-based pitch estimation method. Since pitch is never stationary during real conversations, but often undergoes changes because of intonation, the spectral representation is derived from the Short-time Harmonic Chirp Transform. This lets our technique to perform very well in noisy conditions, and to extract pitch values with high confidence, even from segments with strong intonations. The paper discusses a new way of segment-vice pitch extraction and does not deal with continuous pitch tracking, which is a topic of our future work. However, the performance of the proposed method is demonstrated on real recordings and the noise-dependency of its accuracy is numerically analyzed.


doi: 10.21437/Interspeech.2005-172

Cite as: Képesi, M., Weruaga, L. (2005) High-resolution noise-robust spectral-based pitch estimation. Proc. Interspeech 2005, 313-316, doi: 10.21437/Interspeech.2005-172

@inproceedings{kepesi05_interspeech,
  author={Marián Képesi and Luis Weruaga},
  title={{High-resolution noise-robust spectral-based pitch estimation}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={313--316},
  doi={10.21437/Interspeech.2005-172}
}