ISCA Archive Interspeech 2005

A timbre space for speech

Hiroko Terasawa, Malcolm Slaney, Jonathan Berger

We describe a perceptual space for timbre, define an objective metric that takes into account perceptual orthogonality and measure the quality of timbre interpolation. We discuss two timbre representations and measure perceptual judgments. We determine that a timbre space based on Mel-frequency cepstral coefficients (MFCC) is a good model for perceptual timbre space.

doi: 10.21437/Interspeech.2005-285

