Vagyojaka is an open-source post-editing and annotation tool ¯ for automatic speech recognition (ASR) that aims to reduce the human effort required to correct the ASR results. We adopt a dictionary-based lookup method to highlight the incorrect words in the ASR transcript and give suggestions by generat ing the closest valid words. For curating the speech corpus, we provide a rich list of tagset that captures various spoken audio features. Further, we conducted a user study to evaluate the ef fectiveness of our tool and observed that post-editing requires 1/3 lesser time than editing without using our tool. The user study can be found on our website 1.
Cite as: Kumar, R., Adiga, D., Kothari, M., Dalal, J., Ramakrishnan, G., Jyothi, P. (2022) VAgyojaka: An Annotating and Post-Editing Tool for Automatic Speech Recognition. Proc. Interspeech 2022, 857-858
@inproceedings{kumar22d_interspeech, author={Rishabh Kumar and Devaraja Adiga and Mayank Kothari and Jatin Dalal and Ganesh Ramakrishnan and Preethi Jyothi}, title={{VAgyojaka: An Annotating and Post-Editing Tool for Automatic Speech Recognition}}, year=2022, booktitle={Proc. Interspeech 2022}, pages={857--858} }