ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Maximum mutual information estimation via second order cone programming for large vocabulary continuous speech recognition

Dalei Wu, Baojie Li, Hui Jiang

In this paper, we have successfully extended our previous work of convex optimization methods to MMIE-based discriminative training for large vocabulary continuous speech recognition. Specifically, we have re-formulated the MMIE training into a second order cone programming (SOCP) program using some convex relaxation techniques that we have previously proposed. Moreover, the entire SOCP formulation has been developed for word graphs instead of N-best lists to handle large vocabulary tasks. The proposed method has been evaluated in the standard WSJ-5k task and experimental results show that the proposed SOCP method significantly outperforms the conventional EBW method in terms of recognition accuracy as well as convergence behavior. Our experiments also show that the proposed SOCP method is efficient enough to handle some relatively large HMM sets normally used in large vocabulary tasks.


doi: 10.21437/Interspeech.2009-234

Cite as: Wu, D., Li, B., Jiang, H. (2009) Maximum mutual information estimation via second order cone programming for large vocabulary continuous speech recognition. Proc. Interspeech 2009, 672-675, doi: 10.21437/Interspeech.2009-234

@inproceedings{wu09c_interspeech,
  author={Dalei Wu and Baojie Li and Hui Jiang},
  title={{Maximum mutual information estimation via second order cone programming for large vocabulary continuous speech recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={672--675},
  doi={10.21437/Interspeech.2009-234}
}