ISCA Archive Interspeech 2015
ISCA Archive Interspeech 2015

Detecting repetitions in spoken dialogue systems using phonetic distances

José Lopes, Giampiero Salvi, Gabriel Skantze, Alberto Abad, Joakim Gustafson, Fernando Batista, Raveesh Meena, Isabel Trancoso

Repetitions in Spoken Dialogue Systems can be a symptom of problematic communication. Such repetitions are often due to speech recognition errors, which in turn makes it harder to use the output of the speech recognizer to detect repetitions. In this paper, we combine the alignment score obtained using phonetic distances with dialogue-related features to improve repetition detection. To evaluate the method proposed we compare several alignment techniques from edit distance to DTW-based distance, previously used in Spoken-Term detection tasks. We also compare two different methods to compute the phonetic distance: the first one using the phoneme sequence, and the second one using the distance between the phone posterior vectors. Two different datasets were used in this evaluation: a bus-schedule information system (in English) and a call routing system (in Swedish). The results show that approaches using phoneme distances over-perform approaches using Levenshtein distances between ASR outputs for repetition detection.

doi: 10.21437/Interspeech.2015-60

Cite as: Lopes, J., Salvi, G., Skantze, G., Abad, A., Gustafson, J., Batista, F., Meena, R., Trancoso, I. (2015) Detecting repetitions in spoken dialogue systems using phonetic distances. Proc. Interspeech 2015, 1805-1809, doi: 10.21437/Interspeech.2015-60

  author={José Lopes and Giampiero Salvi and Gabriel Skantze and Alberto Abad and Joakim Gustafson and Fernando Batista and Raveesh Meena and Isabel Trancoso},
  title={{Detecting repetitions in spoken dialogue systems using phonetic distances}},
  booktitle={Proc. Interspeech 2015},