ISCA Archive SPECOM 2004
ISCA Archive SPECOM 2004

Phonetic-acoustical problems of personal voice cloning by TTS

Boris M. Lobanov, Lilia I. Tsirulnik

The report describes a recent development of a TTSsystem for Russian based on allophonic natural speech signal elements (about 1500 in all) with the maximal possible imitation of individual male and female voices. In distinction to the biological task of cloning, the target is not a copy of the human being as a whole but of only one of its functions, particularly, that of reading aloud an orthographically unrestricted text preserving thereby the individual acoustic characteristics of a speaker’s voice, as well as his/her phonetic (segmental and prosodic) peculiarities. A successful solution of the task outlined above presupposes that the following two requirements should be unequivocally satisfied: (1) The fullest possible use of a complex of acoustic characteristics carrying information about the individual voice and pronunciation properties of the speaker being imitated; (2) The minimal possible distortions of the elements of concatenation at all stages of their 'production'; (3) The maximal possible accuracy of prosodic modifications in the process of speech synthesis.


Cite as: Lobanov, B.M., Tsirulnik, L.I. (2004) Phonetic-acoustical problems of personal voice cloning by TTS. Proc. 9th Conference on Speech and Computer (SPECOM 2004), 17-21

@inproceedings{lobanov04_specom,
  author={Boris M. Lobanov and Lilia I. Tsirulnik},
  title={{Phonetic-acoustical problems of personal voice cloning by TTS}},
  year=2004,
  booktitle={Proc. 9th Conference on Speech and Computer (SPECOM 2004)},
  pages={17--21}
}