Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech

Slim Ouni, Vincent Colotte, Sara Dahmani, Soumaya Azzi


Within the framework of developing an expressive audiovisual speech synthesis, an acoustic and visual analysis of expressive acted speech is proposed in this paper. Our purpose is to identify the main characteristics of audiovisual expressions that need to be integrated during synthesis to provide believable emotions to the virtual 3D talking head. We conducted a case study of a semi-professional actor who uttered a set of sentences for 6 different emotions in addition to neutral speech. We have recorded concurrently audio and motion capture data. The acoustic and the visual data have been analyzed. The main finding is that although some expressions are not well identified, some expressions were well characterized and tied in both acoustic and visual space.


DOI: 10.21437/Interspeech.2016-730

Cite as

Ouni, S., Colotte, V., Dahmani, S., Azzi, S. (2016) Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech. Proc. Interspeech 2016, 580-584.

Bibtex
@inproceedings{Ouni+2016,
author={Slim Ouni and Vincent Colotte and Sara Dahmani and Soumaya Azzi},
title={Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-730},
url={http://dx.doi.org/10.21437/Interspeech.2016-730},
pages={580--584}
}