Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension

Chia-Hsuan Lee, Szu-Lin Wu, Chi-Liang Liu, Hung-yi Lee


Reading comprehension has been widely studied. One of the most representative reading comprehension tasks is Stanford Question Answering Dataset (SQuAD), on which machine is already comparable with human. On the other hand, accessing large collections of multimedia or spoken content is much more difficult and time-consuming than plain text content for humans. It's therefore highly attractive to develop machines which can automatically understand spoken content. In this paper, we propose a new listening comprehension task – Spoken SQuAD. On the new task, we found that speech recognition errors have catastrophic impact on machine comprehension and several approaches are proposed to mitigate the impact.


 DOI: 10.21437/Interspeech.2018-1714

Cite as: Lee, C., Wu, S., Liu, C., Lee, H. (2018) Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension. Proc. Interspeech 2018, 3459-3463, DOI: 10.21437/Interspeech.2018-1714.


@inproceedings{Lee2018,
  author={Chia-Hsuan Lee and Szu-Lin Wu and Chi-Liang Liu and Hung-yi Lee},
  title={Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={3459--3463},
  doi={10.21437/Interspeech.2018-1714},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1714}
}