ISCA Archive SLTU 2012
ISCA Archive SLTU 2012

Modeling the prosody of Vietnamese attitudes for expressive speech synthesis

Dang-Khoa Mac, Eric Castelli, Véronique Aubergé

Attitudes or social affects are strongly implied in interaction processing, and specifically to socio-cultural aspects of language. This paper presents the modeling of attitude to apply in expressive speech synthesis in Vietnamese, an under-resourced tonal language. A prosodic model for Vietnamese attitude is proposed based on the concept of “rendez-vous” between linguistic levels and prosodic functions of utterance. This model is applied to generate the prosody of attitudes in Vietnamese. The perceptual experiment on the synthetic utterances with this model shows that the attitudes are well evaluated.

Index Terms: attitude, tone, prosodic modeling, expressive speech synthesis, Vietnamese


Cite as: Mac, D.-K., Castelli, E., Aubergé, V. (2012) Modeling the prosody of Vietnamese attitudes for expressive speech synthesis. Proc. 3rd Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2012), 114-118

@inproceedings{mac12_sltu,
  author={Dang-Khoa Mac and Eric Castelli and Véronique Aubergé},
  title={{Modeling the prosody of Vietnamese attitudes for expressive speech synthesis}},
  year=2012,
  booktitle={Proc. 3rd Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2012)},
  pages={114--118}
}