EUROSPEECH '95
Fourth European Conference on Speech Communication and Technology

Madrid, Spain
September 18-21, 1995

Estimation of Speech Formant-Dynamics Using Neural Networks

P. Gomez, V. Rodellar, A. Alvarez, J. Bobadilla, J. Bernal, V. Nieto, M. Perez

Departamento de Arquitectura y Tecnologia de Sistemas Informaticos, Universidad Politecnica de Madrid, Madrid, Spain

Throughout the present paper, the possibility of using Neural Networks to produce x-y representations from speech in real time, such in vowel and vowel-like sounds, is theoretically shown and practically documented. A certain kind of Time-Delay Neural Network, is shown to be the most efficient operator to extract formant-dynamic information for these plottings. This opens the possibility for constructing Visual User Interfaces for Language Learning Systems using relatively simple hardware.

Full Paper

Bibliographic reference.  Gomez, P. / Rodellar, V. / Alvarez, A. / Bobadilla, J. / Bernal, J. / Nieto, V. / Perez, M. (1995): "Estimation of speech formant-dynamics using neural networks", In EUROSPEECH-1995, 2221-2224.