ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Production and perception of pseudo-V1CV2 outside the vowel triangle: speech illusion effects

Thi Anh Xuan Tran, Viet Son Nguyen, Eric Castelli, René Carré

Vowels are generally described with static articulatory configurations represented by targets in the acoustic space: typically, formant frequencies in the F1-F2 and F2-F3 planes. Plosive consonants can be described in terms of places of articulation, represented by locus or locus equations in an acoustic plane. But how are a given vowel and a given consonant identified, when produced with different acoustic characteristics and in different environments? To which extent do listeners use contextual information? To which extent do they use normalization, and of which kind? These questions lead to studying both vowels and consonants from a dynamic point of view. At this level, what are the respective roles of static targets and dynamics information? Previous studies reveal that synthesized transitions situated on a F1-F2 plane but beyond the values observed in natural speech can be perceived as V1V2: that is, vowel-to-vowel transitions can be characterized simply by the direction and rate of the transitions, even when absolute frequency values are outside of the vowel triangle. The present paper extends the investigation to consonants: it reports new experiments showing that perception of pseudo-V1CV2 can also be obtained with formant transitions situated outside the vowel triangle.


doi: 10.21437/Interspeech.2013-124

Cite as: Tran, T.A.X., Nguyen, V.S., Castelli, E., Carré, R. (2013) Production and perception of pseudo-V1CV2 outside the vowel triangle: speech illusion effects. Proc. Interspeech 2013, 407-411, doi: 10.21437/Interspeech.2013-124

@inproceedings{tran13_interspeech,
  author={Thi Anh Xuan Tran and Viet Son Nguyen and Eric Castelli and René Carré},
  title={{Production and perception of pseudo-V1CV2 outside the vowel triangle: speech illusion effects}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={407--411},
  doi={10.21437/Interspeech.2013-124}
}