ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

System of microphone arrays and neural networks for robust speech recognition in multimedia environments

Qiguang Lin, Ea-Ee Jan, Chi Wei Che, Bert de Vries

Hands-free operation of speech processing systems is sometimes desired to avoid encumbrance of the user by tethered microphone equipment. This paper explores the use of array microphones and neural networks (MANN) for robust speech recognition in real-world environments, such as large-group conferencing. Microphone arrays (MA) provide high-quality, hands-free sound pickup under severe acoustical conditions; and neural network (NN) processors "learn" the characteristics of environmental interference and transform features of MA-enhanced signal to those obtained under close-talking conditions. In this study, both realroom collected and computer-simulated reverberant speech signals are used to evaluate the power and advantages of MANN for direct deployment of speech recognition technology in adverse practical environments.


Cite as: Lin, Q., Jan, E.-E., Che, C.W., Vries, B.d. (1994) System of microphone arrays and neural networks for robust speech recognition in multimedia environments. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 1247-1250

@inproceedings{lin94b_icslp,
  author={Qiguang Lin and Ea-Ee Jan and Chi Wei Che and Bert de Vries},
  title={{System of microphone arrays and neural networks for robust speech recognition in multimedia environments}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={1247--1250}
}