ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Decision tree acoustic models for ASR

Jitendra Ajmera, Masami Akamine

This paper presents a summary of our research progress using decision-tree acoustic models (DTAM) for large vocabulary speech recognition. Various configurations of training DTAMs are proposed and evaluated on wall-street journal (WSJ) task. A number of different acoustic and categorical features have been used for this purpose. Various ways of realizing a forest instead of a single tree have been presented and shown to improve recognition accuracy. Although the performance is not shown to be better than Gaussian mixture models (GMMs), several advantages of DTAMs have been highlighted and exploited. These include compactness, computational simplicity and ability to handle unordered information.


doi: 10.21437/Interspeech.2009-432

Cite as: Ajmera, J., Akamine, M. (2009) Decision tree acoustic models for ASR. Proc. Interspeech 2009, 1403-1406, doi: 10.21437/Interspeech.2009-432

@inproceedings{ajmera09_interspeech,
  author={Jitendra Ajmera and Masami Akamine},
  title={{Decision tree acoustic models for ASR}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1403--1406},
  doi={10.21437/Interspeech.2009-432}
}