Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

A Timbre Space for Speech

Hiroko Terasawa, Malcolm Slaney, Jonathan Berger

Stanford University, USA

We describe a perceptual space for timbre, define an objective metric that takes into account perceptual orthogonality and measure the quality of timbre interpolation. We discuss two timbre representations and measure perceptual judgments. We determine that a timbre space based on Mel-frequency cepstral coefficients (MFCC) is a good model for perceptual timbre space.

Full Paper

Bibliographic reference.  Terasawa, Hiroko / Slaney, Malcolm / Berger, Jonathan (2005): "A timbre space for speech", In INTERSPEECH-2005, 1729-1732.