9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

A Hybrid Speech Signal Based Algorithm for Pitch Marking Using Finite State Machines

H. Hussein, M. Wolff, Oliver Jokisch, F. Duckhorn, G. Strecha, Rüdiger Hoffmann

Technische Universität Dresden, Germany

Pitch marking is a major task in speech processing. Thus, an accurate detection of pitch marks (PM) is required. In this paper, we propose a hybrid method for pitch marking that combines outputs of two different speech signal based pitch marking algorithms (PMA). We use a finite state machine (FSM) to represent and combine the pitch marks. The hybrid PMA is implemented in four stages: preprocessing, alignment, selection and postprocessing. In the alignment stage, the preprocessed pitch marks are shifted to a local minimum of the speech signal and the confidence score for every pitch mark is calculated. The confidence scores are used as transition weights for the FSM. The PMA outputs are combined into a single sequence of pitch marks. The more accurate pitch marks with the highest confidence score are chosen in the selection stage. A PM reference database contains 10 minutes speech including manually adjusted PM. The evaluation results indicate that the proposed hybrid method outperforms the single PMAs but also other current state-of-the-art algorithms which have been evaluated on a second reference database containing 44 speakers.

Full Paper

Bibliographic reference.  Hussein, H. / Wolff, M. / Jokisch, Oliver / Duckhorn, F. / Strecha, G. / Hoffmann, Rüdiger (2008): "A hybrid speech signal based algorithm for pitch marking using finite state machines", In INTERSPEECH-2008, 135-138.