ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Speaker and channel-normalized set of formant parameters for telephone speech recognition

Boris Lobanov, T. Levkovskaya, Igor E. Kheidorov

The speech parameters, most commonly used nowadays, are Cepstral coefficients derived from FFT or LPC Spectrum. An alternative approach that can potentially provide maximum speaker and channel independence is estimation of articulatory based features such as formant frequencies, amplitudes and voicing degree. A present report describes a new method and algorithm of robust estimation of F1(t), F2(t), F3(t), A1(t),A2(t), A3(t), V(t) from telephone speech signal, and also the procedures of their normalization against speaker and channel variability. The results obtained from the experiments confirm the efficiency of the suggested set of formant parameters in a view of speech signal speaker – and channel variability resistance. According to the experiments it gives significant improvement in the recognition performance as compared with cepstral parameters use.


doi: 10.21437/Eurospeech.1999-86

Cite as: Lobanov, B., Levkovskaya, T., Kheidorov, I.E. (1999) Speaker and channel-normalized set of formant parameters for telephone speech recognition. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 331-334, doi: 10.21437/Eurospeech.1999-86

@inproceedings{lobanov99_eurospeech,
  author={Boris Lobanov and T. Levkovskaya and Igor E. Kheidorov},
  title={{Speaker and channel-normalized set of formant parameters for telephone speech recognition}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={331--334},
  doi={10.21437/Eurospeech.1999-86}
}