ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

A hybrid speech signal based algorithm for pitch marking using finite state machines

H. Hussein, M. Wolff, Oliver Jokisch, F. Duckhorn, G. Strecha, RĂ¼diger Hoffmann

Pitch marking is a major task in speech processing. Thus, an accurate detection of pitch marks (PM) is required. In this paper, we propose a hybrid method for pitch marking that combines outputs of two different speech signal based pitch marking algorithms (PMA). We use a finite state machine (FSM) to represent and combine the pitch marks. The hybrid PMA is implemented in four stages: preprocessing, alignment, selection and postprocessing. In the alignment stage, the preprocessed pitch marks are shifted to a local minimum of the speech signal and the confidence score for every pitch mark is calculated. The confidence scores are used as transition weights for the FSM. The PMA outputs are combined into a single sequence of pitch marks. The more accurate pitch marks with the highest confidence score are chosen in the selection stage. A PM reference database contains 10 minutes speech including manually adjusted PM. The evaluation results indicate that the proposed hybrid method outperforms the single PMAs but also other current state-of-the-art algorithms which have been evaluated on a second reference database containing 44 speakers.


doi: 10.21437/Interspeech.2008-31

Cite as: Hussein, H., Wolff, M., Jokisch, O., Duckhorn, F., Strecha, G., Hoffmann, R. (2008) A hybrid speech signal based algorithm for pitch marking using finite state machines. Proc. Interspeech 2008, 135-138, doi: 10.21437/Interspeech.2008-31

@inproceedings{hussein08_interspeech,
  author={H. Hussein and M. Wolff and Oliver Jokisch and F. Duckhorn and G. Strecha and RĂ¼diger Hoffmann},
  title={{A hybrid speech signal based algorithm for pitch marking using finite state machines}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={135--138},
  doi={10.21437/Interspeech.2008-31}
}