13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Accelerated Batch Learning of Convex Log-linear Models for LVCSR

Simon Wiesler, Ralf Schlüter, Hermann Ney

Human Language Technology and Pattern Recognition, Computer Science Department, RWTH Aachen University, Aachen, Germany

This paper describes a log-linear modeling framework suitable for large-scale speech recognition tasks. We introduce modifications to our training procedure that are required for extending our previous work on log-linear models to larger tasks. We give a detailed description of the training procedure with a focus on aspects that impact computational efficiency. The performance of our approach is evaluated on the English Quaero corpus, a challenging broadcast conversations task. The log-linear model consistenly outperforms the maximum likelihood baseline system. Comparable performance to a system with minimum-phone-error training is achieved.

Index Terms: acoustic modeling, discriminative models

Full Paper

Bibliographic reference.  Wiesler, Simon / Schlüter, Ralf / Ney, Hermann (2012): "Accelerated batch learning of convex log-linear models for LVCSR", In INTERSPEECH-2012, 1207-1210.