ISCA Archive Odyssey 2001
ISCA Archive Odyssey 2001

Feature warping for robust speaker verification

Jason Pelecanos, Sridha Sridharan

We propose a novel feature mapping approach that is robust to channel mismatch, additive noise and to some extent, non-linear effects attributed to handset transducers. These adverse effects can distort the short-term distribution of the speech features. Some methods have addressed this issue by conditioning the variance of the distribution, but not to the extent of conforming the speech statistics to a target distribution. The proposed target mapping method warps the distribution of a cepstral feature stream to a standardised distribution over a specified time interval. We evaluate a number of the enhancement methods for speaker verification, and compare them against a Gaussian target mapping implementation. Results indicate improvements of the warping technique over a number of methods such as Cepstral Mean Subtraction (CMS), modulation spectrum processing, and short-term windowed CMS and variance normalisation.

Cite as: Pelecanos, J., Sridharan, S. (2001) Feature warping for robust speaker verification. Proc. The Speaker and Language Recognition Workshop (Odyssey 2001), 213-218

  author={Jason Pelecanos and Sridha Sridharan},
  title={{Feature warping for robust speaker verification}},
  booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2001)},