In this paper we describe our efforts to build a Mandarin Chinese voice search system. We describe our strategies for data collection, language, lexicon and acoustic modeling, as well as issues related to text normalization that are an integral part of building voice search systems. We show excellent performance on typical spoken search queries under a variety of accents and acoustic conditions. The system has been in operation since October 2009 and has received very positive user reviews.
Bibliographic reference. Shan, Jiulong / Wu, Genqing / Hu, Zhihong / Tang, Xiliu / Jansche, Martin / Moreno, Pedro J. (2010): "Search by voice in Mandarin Chinese", In INTERSPEECH-2010, 354-357.