Automatic speech recognition of real-life conversational speech is a precondition for building natural human-centered man-machine interfaces. Being able to extract speech utterances from real-life broadcast news audio streams and transcribing them with an overall word accuracy of 83% we are still faced with the problem of transcribing true conversational speech in real-life (i.e. bad) background conditions. The switchboard task focusses on the latter problem. The paper summarizes a set of experimental investigations on the switchboard corpus using the Philips LVCSR system.
Cite as: Beyerlein, P., Aubert, X., Harris, M., Meyer, C., Schramm, H. (2001) Investigations on conversational speech recognition. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 499-502, doi: 10.21437/Eurospeech.2001-131
@inproceedings{beyerlein01_eurospeech, author={Peter Beyerlein and X. Aubert and M. Harris and C. Meyer and Hauke Schramm}, title={{Investigations on conversational speech recognition}}, year=2001, booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)}, pages={499--502}, doi={10.21437/Eurospeech.2001-131} }