The object of this paper is to present and compare two different systems for isolated-digit speaker-independent speech recognition, one based in Continuous-densities HMM and the other in a Time-Delay Neural Network. The HMM system passed several optimization phases from a system with 10 Mel frequency cepstrum coefficients (MFCC) plus energy, to a system with 10 MFCC plus their 10 incremental parameters, plus energy and delta-energy. Optimization of the TDNN system has been made based on shifting some input frames for each activation of the neurons in the first layer instead of one as used in other experiments.
Cite as: Ferreiros, J., Castro, A., Pardo, J.M. (1991) Comparison between two different approaches in speaker - independent isolated digit recognition. Proc. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991), 999-1002, doi: 10.21437/Eurospeech.1991-239
@inproceedings{ferreiros91_eurospeech, author={J. Ferreiros and A. Castro and J. M. Pardo}, title={{Comparison between two different approaches in speaker - independent isolated digit recognition}}, year=1991, booktitle={Proc. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991)}, pages={999--1002}, doi={10.21437/Eurospeech.1991-239} }