Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

A Model-Based Transformational Approach to Robust Speaker Recognition

Remco Teunen, Ben Shahshahani, Larry Heck

Nuance Communications, Menlo Park, CA, USA

A novel statistical modeling and compensation method for robust speaker recognition is presented. The method specifically addresses the degradation in speaker verification performance due to the mismatch in channels (e.g., telephone handsets) between enrollment and testing sessions. In mismatched conditions, the new approach uses speaker-independent channel transformations to synthesize a speaker model that corresponds to the channel of the testing session. Effectively verification is always performed in matched channel conditions. Results on the 1998 NIST Speaker Recognition Evaluation corpus show that the new approach yields performance that matches the best reported results. Specifically, our approach yields similar improvements (19.9% reduction in EER compared to CMN alone) as the HNORM score-based compensation method, but with a fraction of the training time.

Full Paper

Bibliographic reference.  Teunen, Remco / Shahshahani, Ben / Heck, Larry (2000): "A model-based transformational approach to robust speaker recognition", In ICSLP-2000, vol.2, 495-498.