Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks

Muhammad Umar Farooq, Farah Adeeba, Sahar Rauf, Sarmad Hussain


Development of Large Vocabulary Continuous Speech Recognition (LVCSR) system is a cumbersome task, especially for low resource languages. Urdu is the national language and lingua franca of Pakistan, with 100 million speakers worldwide. Due to resource scarcity, limited work has been done in the domain of Urdu speech recognition. In this paper, collection of Urdu speech corpus and development of Urdu speech recognition system is presented. Urdu LVCSR is developed using 300 hours of read speech data with a vocabulary size of 199K words. Microphone speech is recorded from 1671 Urdu and Punjabi speakers in both indoor and outdoor environments. Different acoustic modeling techniques such as Gaussian Mixture Models based Hidden Markov Models (GMM-HMM), Time Delay Neural Networks (TDNN), Long-Short Term Memory (LSTM) and Bidirectional Long-Short Term Memory (BLSTM) networks are investigated. Cross entropy and Lattice Free Maximum Mutual Information (LF-MMI) objective functions are employed during acoustic modeling. In addition, Recurrent Neural Network Language Model (RNNLM) is also being used for re-scoring. Developed speech recognition system has been evaluated on 9.5 hours of collected test data and a minimum Word Error Rate (%WER) of 13.50% is achieved.


 DOI: 10.21437/Interspeech.2019-2629

Cite as: Farooq, M.U., Adeeba, F., Rauf, S., Hussain, S. (2019) Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks. Proc. Interspeech 2019, 2978-2982, DOI: 10.21437/Interspeech.2019-2629.


@inproceedings{Farooq2019,
  author={Muhammad Umar Farooq and Farah Adeeba and Sahar Rauf and Sarmad Hussain},
  title={{Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={2978--2982},
  doi={10.21437/Interspeech.2019-2629},
  url={http://dx.doi.org/10.21437/Interspeech.2019-2629}
}