Speech recognition in Persian (Farsi) has recently been addressed by a few native speaking researchers and some approaches to isolated word and phoneme recognition have been reported. A main bottleneck in this research field is the lack of a recognition-specific speech corpus. In this work, a phonetically balanced speech database of Persian has been modified and used in continuous speech recognition. A basic continuous speech recognizer using HMMs has been designed for this language and recognition tests have been performed. Using mixture-Gaussian monophone models, a word recognition rate of about 68% in no-grammar tests were obtained while word-pair grammar tests increased this rate to an unexpectedly high value of 99.5%. The reason is found to be the low grammar perplexity of the database which is not suitable for recognition applications. This obviates the need for a Persian speech corpus specifically designed for such tasks.
Cite as: Ahadi, S.M. (1999) Recognition of continuous persian speech using a medium-sized vocabulary speech corpus. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 863-866, doi: 10.21437/Eurospeech.1999-210
@inproceedings{ahadi99_eurospeech, author={S. M. Ahadi}, title={{Recognition of continuous persian speech using a medium-sized vocabulary speech corpus}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={863--866}, doi={10.21437/Eurospeech.1999-210} }