Task Specific Sentence Embeddings for ASR Error Detection

Sahar Ghannay, Yannick Estève, Nathalie Camelin


This paper presents a study on the modeling of automatic speech recognition errors at the sentence level. We aim in this study to compensate certain phenomena highlighted by the analysis of the outputs generated by the ASR error detection system we previously proposed. We investigated three different approaches, that are based respectively on the use of sentence embeddings dedicated to ASR error detection task, a probabilistic contextual model and a bidirectional long short term memory (BLSTM) architecture. An approach to build task-specific sentence embeddings is proposed and compared to the Doc2vec approach. Experiments are performed on transcriptions generated by the LIUM ASR system applied to the ETAPE corpus. They show that the proposed sentence embeddings dedicated to ASR error detection achieve better results than generic sentence embeddings and that the integration of task-specific sentence embeddings in our system achieves better results than the probabilistic contextual model and BLSTM models.


 DOI: 10.21437/Interspeech.2018-2211

Cite as: Ghannay, S., Estève, Y., Camelin, N. (2018) Task Specific Sentence Embeddings for ASR Error Detection. Proc. Interspeech 2018, 1288-1292, DOI: 10.21437/Interspeech.2018-2211.


@inproceedings{Ghannay2018,
  author={Sahar Ghannay and Yannick Estève and Nathalie Camelin},
  title={Task Specific Sentence Embeddings for ASR Error Detection},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={1288--1292},
  doi={10.21437/Interspeech.2018-2211},
  url={http://dx.doi.org/10.21437/Interspeech.2018-2211}
}