Voice Quality: Functions, Analysis and Synthesis
August 27-29, 2003
We present an investigation of the representation of voice source aperiodicities in the Multi-Band Excitation (MBE) speech model for the compression of narrowband speech. The MBE model is a fixed-frame based analysis-synthesis algorithm which combines harmonic and stochastic components to reconstruct speech from estimated model parameters. Pitch cycle perturbations, such as jitter and shimmer, are not captured accurately in the framewise constant parameter estimates, thus impacting the reproduced voice quality. The actual dependence of MBE reconstructed voice quality on the voice pitch and the type of perturbation are explored through objective measurements and subjective listening with synthetic and natural speech.
Full Paper Presentation (PDF; 1757 KB) Presentation (Powerpoint; 3038 KB)
Bibliographic reference. Rao, Preeti / Patwardhan, Pushkar (2003): "On the representation of voice source aperiodicities in the MBE speech coding model", In VOQUAL'03, 121-126.