Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Design and Collection of Czech Lombard Speech Database

Hynek Boril, Petr Pollak

Czech Technical University in Prague, Czech Republic

In this paper, design, collection and parameters of newly proposed Czech Lombard Speech Database (CLSD) are presented. The database focuses on analysis and modeling of Lombard effect to achieve robust speech recognition improvement. The CLSD consists of neutral speech and speech produced in various types of simulated noisy background. In comparison to available databases dealing with Lombard effect, an extensive set of utterances containing phonetically rich words and sentences was chosen to cover the whole phoneme vocabulary of the language. For the purposes of Lombard speech recording, usual 'noisy headphones configuration' was improved by addition of an operator qualifying utterance intelligibility while hearing the same noise mixed with speaker's voice of intensity lowered according to the selected virtual distance. This scenario motivated speakers to react more to the noise background. The CLSD currently consists of 26 speakers.

Full Paper

Bibliographic reference.  Boril, Hynek / Pollak, Petr (2005): "Design and collection of Czech Lombard speech database", In INTERSPEECH-2005, 1577-1580.