We describe the recent progress in SRIís Mandarin speech-to-text system developed for 2008 evaluation in the DARPA GALE program. A data-driven lexicon expansion technique and language model adaptation methods contribute to the improvement in recognition performance. Our system yields 8.3% character error rate on the GALE dev08 test set, and 7.5% after combining with RWTH systems. Compared to our 2007 evaluation system, a significant improvement of 13% relative has been achieved.
Bibliographic reference. Lei, Xin / Wu, Wei / Wang, Wen / Mandal, Arindam / Stolcke, Andreas (2009): "Development of the 2008 SRI Mandarin speech-to-text system for broadcast news and conversation", In INTERSPEECH-2009, 2099-2102.