Automatic speech recognition by computers can provide the most natural and efficient method of communication between humans and computers. While in recent years high performance speech recognition systems are beginning to emerge from research institutions, scientists unequivocally agree that the deployment of speech recognition systems into realistic operating environments will require many hours of speech data to help us model the inherent variability in the speech signal. This paper describes the experiences of researchers at MIT in the collection of two large speech databases.
Cite as: Zue, V., Seneff, S., Glass, J. (1989) Speech database development: TIMIT and beyond. Proc. Speech Input/Output Assessment and Speech Databases, Vol.2, 35-40
@inproceedings{zue89_sioa, author={Victor Zue and Stephanie Seneff and James Glass}, title={{Speech database development: TIMIT and beyond}}, year=1989, booktitle={Proc. Speech Input/Output Assessment and Speech Databases}, pages={Vol.2, 35-40} }