Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

TeraSpeech’2000 : A 10,000 Speakers Database

Marie-José Caraty, Claude Montacié

LIP6 - Université Pierre et Marie Curie, Paris, France

TeraSpeech is a bilingual database (i.e., English and French) developed in partnership with a French museum, le Musée des Sciences et de l’Industrie in Paris. A demonstration of vocal signature is the support of this data collection. Aiming at the validation of a quality plan, a scenario of the demonstration has been designed, and various protocols have been developed. The quality plan is presented as well as the solutions we found for its validation (i.e., scenario and protocols). The statistics of TeraSpeech are given. Three trends are examined for the perspectives : the validation, the exploitation and the research. Over a single year of the vocal signature exhibition, TeraSpeech’2000 is a collection of more than 30,000 sentences recorded from more than 10,000 visitors. The exposition on acoustics of the museum is planned for ten years. TeraSpeech is expected to be a collection of more than 100,000 speakers recorded over the same sound acquisition channel.


Full Paper

Acoustic Example #1    Acoustic Example #2

Bibliographic reference.  Caraty, Marie-José / Montacié, Claude (2000): "Teraspeech’2000 : a 10,000 speakers database", In ICSLP-2000, vol.3, 973-976.