Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source

Sudarsana Reddy Kadiri, Bayya Yegnanarayana


This paper focuses on the problem of estimating fundamental frequency from singing voice. Estimation of fundamental frequency is a well studied topic in the speech research community. From the recent studies on fundamental frequency estimation from singing voice with state-of-art methods proposed for speech, there exists a significant gap in accuracy for singing voice. This is mainly because of the wider and rapid variations in pitch in singing voice compared to that in speech. To overcome this, in this paper we propose a method to derive the fundamental frequency from singing voice by exploiting the harmonics of impulse-like excitation in sequence of glottal cycles. The proposed method is compared with the eight state-of-art methods such as YIN, SWIPE, YAAPT, RAPT, SRH, SFF_CEP, PEFAC and SHRP on the LYRICS singing database. From the experimental results, it is observed that the accuracy of fundamental frequency by the proposed method is better than many state-of-art methods in various singing categories and laryngeal mechanisms.


 DOI: 10.21437/Interspeech.2018-2495

Cite as: Kadiri, S.R., Yegnanarayana, B. (2018) Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source. Proc. Interspeech 2018, 2319-2323, DOI: 10.21437/Interspeech.2018-2495.


@inproceedings{Kadiri2018,
  author={Sudarsana Reddy Kadiri and Bayya Yegnanarayana},
  title={Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={2319--2323},
  doi={10.21437/Interspeech.2018-2495},
  url={http://dx.doi.org/10.21437/Interspeech.2018-2495}
}