11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Russian Infants and Children's Sounds and Speech Corpuses for Language Acquisition Studies

Elena E. Lyakso, Olga V. Frolova, Anna V. Kurazhova, Julia S. Gaikova

St. Petersburg State University, Russia

«INFANTRU» and «CHILDRU» are the first Russian child speech database. The corpus «INFANTRU» contains longitudinal vocalizations and speech records (n=2967) of 99 children from 3 mos to 36 mos by long utterances sequences and separate utterances in different psychoemotional state of the child. The database “CHILDRU” contains the records (n=28079, 13956Mb) of 150 children’s speech at the age from 4 to 7 years. Speech material are presented by the following situations: spontaneous speech, answers to questions, reading, poetry or retelling a tale, count and alphabet, play. Speech files format is Windows PCM, 22050 Hz, 16 bit.

Full Paper

Bibliographic reference.  Lyakso, Elena E. / Frolova, Olga V. / Kurazhova, Anna V. / Gaikova, Julia S. (2010): "Russian infants and children's sounds and speech corpuses for language acquisition studies", In INTERSPEECH-2010, 1878-1881.