10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Development of the 2008 SRI Mandarin Speech-to-Text System for Broadcast News and Conversation

Xin Lei (1), Wei Wu (2), Wen Wang (1), Arindam Mandal (1), Andreas Stolcke (1)

(1) SRI International, USA
(2) University of Washington, USA

We describe the recent progress in SRIís Mandarin speech-to-text system developed for 2008 evaluation in the DARPA GALE program. A data-driven lexicon expansion technique and language model adaptation methods contribute to the improvement in recognition performance. Our system yields 8.3% character error rate on the GALE dev08 test set, and 7.5% after combining with RWTH systems. Compared to our 2007 evaluation system, a significant improvement of 13% relative has been achieved.

Full Paper

Bibliographic reference.  Lei, Xin / Wu, Wei / Wang, Wen / Mandal, Arindam / Stolcke, Andreas (2009): "Development of the 2008 SRI Mandarin speech-to-text system for broadcast news and conversation", In INTERSPEECH-2009, 2099-2102.