In this research, a Text-To-Speech system for Farsi language has been implemented. The proposed synthesizer concatenates Farsi syllables in a TD-PSOLA manner. This paper is mainly concentrated on investigation about pitch variations in Farsi sentences and presentation of some novel rules for modeling these variations. Based on the location of stressed syllable, we obtain a primary pitch curve for each word. Using prosodic grouping and sentence type effects, the final pitch contour can be determined. High intelligibility and acceptable naturalness of the synthesized speech have been confirmed by subjective listening tests.
Cite as: Abutalebi, H.R., Bijankhan, M. (2000) Implementation of a text-to-speech system for farsi language. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 661-664, doi: 10.21437/ICSLP.2000-164
@inproceedings{abutalebi00_icslp, author={Hamid Reza Abutalebi and Mahmood Bijankhan}, title={{Implementation of a text-to-speech system for farsi language}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 1, 661-664}, doi={10.21437/ICSLP.2000-164} }