10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Perceptual Cost Function for Cross-Fading Based Concatenation

Qi Miao, Alexander Kain, Jan P. H. van Santen

Oregon Health & Science University, USA

In earlier research, we applied a linear weighted cross-fading function to ensure smooth concatenation. However, this can cause unnaturally shaped spectral trajectories. We propose context-sensitive cross-fading. To train this system, a perceptually validated cost function is needed, which is the focus of this paper. A corpus was designed to generate a variety of formant trajectory shapes. A perceptual experiment was performed and a multiple linear regression model was applied to predict perceptual quality ratings from various distances between cross-faded and natural trajectories. Results show that perceptual quality could be predicted well from the proposed distance measures.

Full Paper

Bibliographic reference.  Miao, Qi / Kain, Alexander / Santen, Jan P. H. van (2009): "Perceptual cost function for cross-fading based concatenation", In INTERSPEECH-2009, 732-735.