Current advances in spoken interface design point towards a shift towards more “human-like” interaction, as opposed to the traditional “push-to-talk” approach. However, human dialogue is characterized by synchrony and multi-modality, and these properties are not captured by traditional representation approaches, such as turn succession. This paper proposes an alternative representation schema for recorded (human) dialogues, which employs per frame averages of speaker turn distribution, in order to inform further analyses of temporal features (pauses and overlaps) in terms of inter-speaker accommodation. Preliminary results of such analyses are provided.
Bibliographic reference. Kousidis, Spyros / Dorran, David / McDonnell, Ciaran / Coyle, Eugene (2009): "Towards flexible representations for analysis of accommodation of temporal features in spontaneous dialogue speech", In INTERSPEECH-2009, 2163-2166.