An acoustic model is a simplified mathematical representation of acoustic-phonetic information. The simplifying assumptions inherent to each model entail that it may only be capable of capturing a certain aspect of the available information. An effective combination of different types of model should therefore permit a combined model that can utilize all the information captured by the individual models. This paper reports some preliminary research in combining certain types of acoustic model for speech recognition. In particular, we designed and implemented a single HMM framework, which combines a segment-based modeling technique with the standard HMM technique. The recognition experiments, based on a speaker-independent E-set database, have shown that the combined model has the potential of producing a significantly higher performance than the individual models considered in isolation.
Cite as: Ming, J., Hanna, P., Stewart, D., Vaseghi, S., Smith, F.J. (1998) Capturing discriminative information using multiple modeling techniques. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0263, doi: 10.21437/ICSLP.1998-188
@inproceedings{ming98_icslp, author={Ji Ming and Philip Hanna and Darryl Stewart and Saeed Vaseghi and F. Jack Smith}, title={{Capturing discriminative information using multiple modeling techniques}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0263}, doi={10.21437/ICSLP.1998-188} }