ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Large vocabulary continuous speech recognition of read speech over cellular and landline networks

Ashwin Rao, Bob Roth, Venkatesh Nagesha, Don McAllaster, Natalie Liberman, Larry Gillick

We report results of large vocabulary continuous speech recognition (LVCSR) experiments, conducted using speech data read over cellular and landline phones. Specifically, we compare (using stereo recordings) the speaker-independent and speakeradapted recognition word error rates (WERs) measured over cellular and landline networks, with those measured using a closetalking noise-canceling headset microphone, which serves as a baseline. A test set consisting of speech data recorded by 25 speakers is used; each speaker providing test and adaptation data. We use acoustic models trained from relatively high-quality training data and an interpolated trigram language model. Some insights into the relative degradation in WERs over telephone networks are also provided by examining the recognition error rates for bandlimited and coded microphone speech.


Cite as: Rao, A., Roth, B., Nagesha, V., McAllaster, D., Liberman, N., Gillick, L. (2000) Large vocabulary continuous speech recognition of read speech over cellular and landline networks. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 402-405

@inproceedings{rao00_icslp,
  author={Ashwin Rao and Bob Roth and Venkatesh Nagesha and Don McAllaster and Natalie Liberman and Larry Gillick},
  title={{Large vocabulary continuous speech recognition of read speech over cellular and landline networks}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 402-405}
}