12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Interactional Style Detection for Versatile Dialogue Response Using Prosodic and Semantic Features

Wei-Bin Liang, Chung-Hsien Wu, Chih-Hung Wang, Jhing-Fa Wang

National Cheng Kung University, Taiwan

This work presents an approach to interactional style (IS) detection for versatile responses in spoken dialogue systems (SDSs). Since speakers generally express their intents in different styles, the responses of an SDS should be versatile instead of invariable, planned responses. Moreover, the IS of dialogue turns can be affected by dialogue topics and speakers' emotional states. In this study, three base-level classifiers are employed for preliminary detection, including latent Dirichlet allocation for dialogue topic categorization, support vector machine for prosody-based emotional state identification and maximum entropy for semantic label-based emotional state identification. Finally, an artificial neural network is adopted for IS detection considering the scores estimated from the aforementioned classifiers. To evaluate the proposed approach, an SDS in a chatting domain was constructed for evaluation. The performance of IS detection can achieve 82.67%.

Full Paper

Bibliographic reference.  Liang, Wei-Bin / Wu, Chung-Hsien / Wang, Chih-Hung / Wang, Jhing-Fa (2011): "Interactional style detection for versatile dialogue response using prosodic and semantic features", In INTERSPEECH-2011, 1345-1348.