ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Recognition of continuous persian speech using a medium-sized vocabulary speech corpus

S. M. Ahadi

Speech recognition in Persian (Farsi) has recently been addressed by a few native speaking researchers and some approaches to isolated word and phoneme recognition have been reported. A main bottleneck in this research field is the lack of a recognition-specific speech corpus. In this work, a phonetically balanced speech database of Persian has been modified and used in continuous speech recognition. A basic continuous speech recognizer using HMMs has been designed for this language and recognition tests have been performed. Using mixture-Gaussian monophone models, a word recognition rate of about 68% in no-grammar tests were obtained while word-pair grammar tests increased this rate to an unexpectedly high value of 99.5%. The reason is found to be the low grammar perplexity of the database which is not suitable for recognition applications. This obviates the need for a Persian speech corpus specifically designed for such tasks.


doi: 10.21437/Eurospeech.1999-210

Cite as: Ahadi, S.M. (1999) Recognition of continuous persian speech using a medium-sized vocabulary speech corpus. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 863-866, doi: 10.21437/Eurospeech.1999-210

@inproceedings{ahadi99_eurospeech,
  author={S. M. Ahadi},
  title={{Recognition of continuous persian speech using a medium-sized vocabulary speech corpus}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={863--866},
  doi={10.21437/Eurospeech.1999-210}
}