ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Efficient handwriting correction of speech recognition errors with template constrained posterior (TCP)

Lijuan Wang, Tao Hu, Peng Liu, Frank K. Soong

More mobile devices are starting to use automatic speech recognition for command or text input. However, correcting recognition errors in a small compact mobile device is usually inconvenient and it may take several finger operations on a small keypad to correct errors. In this paper, we propose a new multimodal input method and a novel confidence measure - template constrained posterior (TCP) to simplify the correction process. The method works by interactively integrating a handwriting recognizer with a speech recognizer. Information obtained in pen-based error marking, like error location, error type, etc., is fed back to the speech recognizer, and speech recognition errors are automatically corrected using the TCP confidence measure. Experimental results on Aurora2, Wall Street Journal, Switchboard, and two Chinese databases show that compared with speech recognition baseline, the proposed method achieves relative error reduction of 64.9%, 43.9%, 26.1%, 39.0%, 31.4%, respectively, after the auto correction.


doi: 10.21437/Interspeech.2008-659

Cite as: Wang, L., Hu, T., Liu, P., Soong, F.K. (2008) Efficient handwriting correction of speech recognition errors with template constrained posterior (TCP). Proc. Interspeech 2008, 2659-2662, doi: 10.21437/Interspeech.2008-659

@inproceedings{wang08n_interspeech,
  author={Lijuan Wang and Tao Hu and Peng Liu and Frank K. Soong},
  title={{Efficient handwriting correction of speech recognition errors with template constrained posterior (TCP)}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2659--2662},
  doi={10.21437/Interspeech.2008-659}
}