On the Role of Style in Parsing Speech with Neural Models

Trang Tran, Jiahong Yuan, Yang Liu, Mari Ostendorf


The differences in written text and conversational speech are substantial; previous parsers trained on treebanked text have given very poor results on spontaneous speech. For spoken language, the mismatch in style also extends to prosodic cues, though it is less well understood. This paper re-examines the use of written text in parsing speech in the context of recent advances in neural language processing. We show that neural approaches facilitate using written text to improve parsing of spontaneous speech, and that prosody further improves over this state-of-the-art result. Further, we find an asymmetric degradation from read vs. spontaneous mismatch, with spontaneous speech more generally useful for training parsers.


 DOI: 10.21437/Interspeech.2019-3122

Cite as: Tran, T., Yuan, J., Liu, Y., Ostendorf, M. (2019) On the Role of Style in Parsing Speech with Neural Models. Proc. Interspeech 2019, 4190-4194, DOI: 10.21437/Interspeech.2019-3122.


@inproceedings{Tran2019,
  author={Trang Tran and Jiahong Yuan and Yang Liu and Mari Ostendorf},
  title={{On the Role of Style in Parsing Speech with Neural Models}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={4190--4194},
  doi={10.21437/Interspeech.2019-3122},
  url={http://dx.doi.org/10.21437/Interspeech.2019-3122}
}