This paper describes a robust multi-phase algorithm for marking of pitch pulses in speech using both glottal and speech signals. In the first phase, the glottal signal is used for the estimation of the fundamental frequency (f0) contour of the given sentence. Next, pitch mark candidates are generated on the basis of both glottal and speech signals. In the third phase, the best sequence of pitch marks is found in the set of the candidates. Finally, this pitch mark sequence is post-processed. One of the features of the new method is that every pitch mark detected is given confidence, so that problematic pitch mark subsequences can be located. The algorithm was tested and compared with other pitch-mark detection methods.
Bibliographic reference. Legát, Milan / Matoušek, Jindřich / Tihelka, Daniel (2007): "A robust multi-phase pitch-mark detection algorithm", In INTERSPEECH-2007, 1641-1644.