This paper is dedicated to several aspects of creation process of Russian speech databases. The problems of phonetic notation are discussed. The process of selection of text material with expected phonetic characteristics is described. The description of two Russian speech corpuses is given. While doing speech research and/or development of components in Speech Technologies such as text-to-speech or Speech Recognition systems, the researcher needs access to large sets of annotated and labeled speech data. The quality of speech recognition systems based on modern statistical algorithms depends directly on capacity and phonetic portliness of such sets. If the researcher develops so-called engineering approach in speech research he/she needs to study fine structure of speech signal using large amount of labeled speech that contains various speech state-events. Modern approach to building text-to-speech systems based on concatenating of speech fragments demands availability of large speech corpus.
Cite as: Arlazarov, V.L., Bogdanov, D.S., Krivnova, O.F., Podrabinovitch, A.Y. (2004) Creation of Russian speech databases: design, processing, development tools. Proc. 9th Conference on Speech and Computer (SPECOM 2004), 650-656
@inproceedings{arlazarov04_specom, author={Vladimir L. Arlazarov and Dimitri S. Bogdanov and Olga F. Krivnova and Aleksandr Ya. Podrabinovitch}, title={{Creation of Russian speech databases: design, processing, development tools}}, year=2004, booktitle={Proc. 9th Conference on Speech and Computer (SPECOM 2004)}, pages={650--656} }