13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Investigating Performance of the Discriminative Methods For Long-Term Speaker Adaptation

Danning Jiang (1), Dimitri Kanevsky (2), Vaibhava Goel (2), Yong Qin (1)

(1) IBM China Research Lab, Beijing, China
(2) IBM Watson Research Center, New York, USA

Many of today's speech recognition applications can benefit from long-term speaker adaptation using speaker logs, and discriminative methods present a promising approach for that given their previous successes. This paper carries out large-vocabulary speech recognition experiments to investigate performance of feature-space and model-space discriminative adaptation methods for long-term speaker adaptation. The experimental results suggest that though on average discriminative adaptation does not obtain a big gain over ML adaptation, there are still a number of test speakers that show significant improvements. Motivated by this observation, we further propose an efficient method to automatically select speakers which can obtain big improvements in discriminative adaptation. When 35%~65% of the whole test population are selected for discriminative adaptation, the relative WER reduction over ML adaptation can reach 4%~5% if only these speakers' performance is inspected.

Index Terms: discriminative speaker adaptation, CDLT, DLT, DMAP, performance prediction

Full Paper

Bibliographic reference.  Jiang, Danning / Kanevsky, Dimitri / Goel, Vaibhava / Qin, Yong (2012): "Investigating performance of the discriminative methods for long-term speaker adaptation", In INTERSPEECH-2012, 1768-1771.