INTERSPEECH 2008
9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Towards Measuring Continuous Acoustic Feature Convergence in Unconstrained Spoken Dialogues

Spyros Kousidis, David Dorran, Yi Wang, Brian Vaughan, Charlie Cullen, Dermot Campbell, Ciaran McDonnell, Eugene Coyle

Dublin Institute of Technology, Ireland

Acoustic/prosodic feature (a/p) convergence has been known to occur both in dialogues between humans, as well as in humancomputer interactions. Understanding the form and function of convergence is desirable for developing next generation conversational agents, as this will help increase speech recognition performance and naturalness of synthesized speech. Currently, the underlying mechanisms by which continuous and bi-directional convergence occurs are not well understood. In this study, a direct comparison between time-aligned frames shows significant similarity in acoustic feature variation between the two speakers. The method described (TAMA) constitutes a first step towards a quantitative analysis of a/p convergence.

Full Paper

Bibliographic reference.  Kousidis, Spyros / Dorran, David / Wang, Yi / Vaughan, Brian / Cullen, Charlie / Campbell, Dermot / McDonnell, Ciaran / Coyle, Eugene (2008): "Towards measuring continuous acoustic feature convergence in unconstrained spoken dialogues", In INTERSPEECH-2008, 1692-1695.