This paper explores techniques for utilizing untranscribed training data pools to increase the available training data for automatic speech recognition systems. It has been well established that current speech recognition technology, especially in Large Vocabulary Conversational Speech Recognition (LVCSR), is largely language independent, and that the dominant factor with regards to performance on a certain language is the amount of available training data. The paper addresses this need for increased training data by presenting ways to use untranscribed acoustic data to increase the training data size and thus improve speech recognition.
Cite as: Zavaliagkos, G., Siu, M.-H., Colthurst, T., Billa, J. (1998) Using untranscribed training data to improve performance. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 1007, doi: 10.21437/ICSLP.1998-679
@inproceedings{zavaliagkos98_icslp, author={George Zavaliagkos and Man-Hung Siu and Thomas Colthurst and Jayadev Billa}, title={{Using untranscribed training data to improve performance}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 1007}, doi={10.21437/ICSLP.1998-679} }