ISCA Archive SpeechProsody 2010
ISCA Archive SpeechProsody 2010

Modelling filled pauses prosody to synthesise disfluent speech

Jordi Adell, Antonio Bonafonte, David Escudero-Mancebo

In the present paper we present a new approach to the synthesis of filled pauses since they are as frequent as most frequent words in conversational speech. The problem is tackled from the point of view of disfluent speech synthesis. Based on the synthetic disfluent speech model, we analyse the features that describe filled pauses and propose a model to predict them. The model was implemented and perceptually evaluated with successful results.

Index Terms: speech synthesis, disfluent speech, filled pauses


Cite as: Adell, J., Bonafonte, A., Escudero-Mancebo, D. (2010) Modelling filled pauses prosody to synthesise disfluent speech. Proc. Speech Prosody 2010, paper 624

@inproceedings{adell10_speechprosody,
  author={Jordi Adell and Antonio Bonafonte and David Escudero-Mancebo},
  title={{Modelling filled pauses prosody to synthesise disfluent speech}},
  year=2010,
  booktitle={Proc. Speech Prosody 2010},
  pages={paper 624}
}