ISCA Archive SpeechProsody 2010
ISCA Archive SpeechProsody 2010

Using prosodic features for predicting phrase boundaries

Caroline Kaufhold, Elmar Nöth

Spoken input of address data in modern GPS units is typically done by filling one information slot after another. To fill-in multiple slots at once, the particular slot information contained in the input utterance has to be extracted. We employ phrase boundaries to separate the speech signal into certain slots. In our evaluation, several types of input utterances differing in the number of slot information and their order are thoroughly examined. For each type, a set of twenty strong prosodic features is trained. By incorporating supporting a-priori features, an Fmeasure value of 93:0% is reached for a typical use case.

Index Terms: prosody, phrase boundary detection, multislot input modality

Cite as: Kaufhold, C., Nöth, E. (2010) Using prosodic features for predicting phrase boundaries. Proc. Speech Prosody 2010, paper 872

  author={Caroline Kaufhold and Elmar Nöth},
  title={{Using prosodic features for predicting phrase boundaries}},
  booktitle={Proc. Speech Prosody 2010},
  pages={paper 872}