12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Personalizing Model M for Voice-Search

Geoffrey Zweig (1), Shuangyu Chang (2)

(1) Microsoft Research, USA
(2) Speech at Microsoft, USA

Model M is a recently proposed class based exponential n-gram language model. In this paper, we extend it with personalization features, address the scalability issues present with large data sets, and test its effectiveness on the Bing Mobile voice-search task. We find that Model M by itself reduces both perplexity and word error rate compared with a conventional model, and that the personalization features produce a further significant improvement. The personalization features provide a very large improvement when the history contains a relevant query; thus the overall effect is gated by the number of times a user re-queries a past request.

Full Paper

Bibliographic reference.  Zweig, Geoffrey / Chang, Shuangyu (2011): "Personalizing model M for voice-search", In INTERSPEECH-2011, 609-612.