INTERSPEECH 2012
13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

A Feature Space Transformation Method for Personalization using Generalized i-Vector Clustering

Kaisheng Yao, Yifan Gong, Chaojun Liu

Microsoft Corporation, Redmond, WA, USA

We present a feature space transformation method for personalization. This method includes a generalization of i-vector based clustering that allows parameter tying of sub-loading matrices. This method trains i-vector parameters from utterances of a device, uncovering a low dimension space for clustering variability within a device. We show through empirical results impacts of parameters of the generalized i-vector method. We conducted recognition experiments on an internal large vocabulary voice search system for gaming. The method achieved significant reductions of word error rates by 28%, compared to a per utterance adaptation system.

Index Terms: speech recognition, personalization, adaptation, i-vector

Full Paper

Bibliographic reference.  Yao, Kaisheng / Gong, Yifan / Liu, Chaojun (2012): "A feature space transformation method for personalization using generalized i-vector clustering", In INTERSPEECH-2012, 1352-1355.