ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Spoken language identification utilizing fundamental frequency and cepstra

Shuichi Itahashi, Toshikazu Kiuchi, Mikio Yamamoto

This paper describes a combined method of spoken language identification, which utilizes speech fundamental frequency (Fo) and mel cepstral coefficients. In the first method, the Fo contour was used as prosodic information; its trajectory was approximated by polygonal lines or exponential functions, their parameters were used for discrimination. The second method is based on an ergodic HMM using cepstra as segmental information. The number of states of the HMM was varied from 4 to 64. Speech data of 40-seconds spontaneous uttereances were used, spoken by 50 male speakers for each of the 10 languages considered in this study. The results show the effectiveness of the two proposed methods, and that better indentification rate is obtained by combining the two methods.


doi: 10.21437/Eurospeech.1999-99

Cite as: Itahashi, S., Kiuchi, T., Yamamoto, M. (1999) Spoken language identification utilizing fundamental frequency and cepstra. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 383-386, doi: 10.21437/Eurospeech.1999-99

@inproceedings{itahashi99_eurospeech,
  author={Shuichi Itahashi and Toshikazu Kiuchi and Mikio Yamamoto},
  title={{Spoken language identification utilizing fundamental frequency and cepstra}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={383--386},
  doi={10.21437/Eurospeech.1999-99}
}