ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Investigations on convex optimization using log-linear HMMs for digit string recognition

Georg Heigold, David Rybach, Ralf Schlüter, Hermann Ney

Discriminative methods are an important technique to refine the acoustic model in speech recognition. Conventional discriminative training is initialized with some baseline model and the parameters are re-estimated in a separate step. This approach has proven to be successful, but it includes many heuristics, approximations, and parameters to be tuned. This tuning involves much engineering and makes it difficult to reproduce and compare experiments. In contrast to the conventional training, convex optimization techniques provide a sound approach to estimate all model parameters from scratch. Such a straight approach hopefully dispense with additional heuristics, e.g. scaling of posteriors. This paper addresses the question how well this concept using log-linear models carries over to practice. Experimental results are reported for a digit string recognition task, which allows for the investigation of this issue without approximations.


doi: 10.21437/Interspeech.2009-79

Cite as: Heigold, G., Rybach, D., Schlüter, R., Ney, H. (2009) Investigations on convex optimization using log-linear HMMs for digit string recognition. Proc. Interspeech 2009, 216-219, doi: 10.21437/Interspeech.2009-79

@inproceedings{heigold09_interspeech,
  author={Georg Heigold and David Rybach and Ralf Schlüter and Hermann Ney},
  title={{Investigations on convex optimization using log-linear HMMs for digit string recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={216--219},
  doi={10.21437/Interspeech.2009-79}
}