INTERSPEECH 2007
8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Combining Rate and Place Information for Robust Pitch Extraction

Martin Heckmann, Frank Joublin, Christian Goerick

Honda Research Institute Europe GmbH, Germany

In this paper we propose an algorithm for the robust extraction of pitch combining both temporal (rate) and pattern matching (place) techniques. Following a transformation into the spectral domain via the application of a Gammatone filter bank the rate information is extracted in each band via the zero crossing distances in that band. Next a comb filter with teeth at the harmonics of the current fundamental frequency hypothesis is set up, reflecting the pattern matching aspect. The signals below the teeth of the comb filter are analyzed upon consistency. This yields an allocation pattern for the filter. The current allocation pattern is compared to prototypical ones allowing the suppression of side peaks at harmonics and sub-harmonics of the true fundamental. A comparison to a state of the art autocorrelation based algorithm is performed showing significantly better results for our algorithm.

Full Paper

Bibliographic reference.  Heckmann, Martin / Joublin, Frank / Goerick, Christian (2007): "Combining rate and place information for robust pitch extraction", In INTERSPEECH-2007, 2765-2768.