ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

High performance connected digit recognition through gender-dependent acoustic modelling and vocal tract length normalisation

Ramalingam Hariharan, Olli Viikki

Large inter-speaker variability of speech is one of the major sources which degrade the performance of state-of-the-art speech recognition systems. During the recent years, several methods, including gender-dependent acoustic modelling and vocal tract length normalisation, have been developed to reduce this variability. In this paper, we first investigate these two methods individually and propose how they should be implemented in real-world speech recognition systems. Secondly, we show that by combining these two techniques, it is possible to further reduce the error rate in a connected digit recognition task under a realistic car noise environment. Experimental results justify the use of the combined approach. A 44.1% decrease in string error rate was observed when the performance of the joint system was compared to the genderindependent baseline system. The results were also better than that obtained when using these techniques individually.


doi: 10.21437/ICSLP.2000-402

Cite as: Hariharan, R., Viikki, O. (2000) High performance connected digit recognition through gender-dependent acoustic modelling and vocal tract length normalisation. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 847-850, doi: 10.21437/ICSLP.2000-402

@inproceedings{hariharan00_icslp,
  author={Ramalingam Hariharan and Olli Viikki},
  title={{High performance connected digit recognition through gender-dependent acoustic modelling and vocal tract length normalisation}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 847-850},
  doi={10.21437/ICSLP.2000-402}
}