Wikispeech - a content management system for speech databases

Christoph Draxler, Klaus Jänsch

In this paper we describe WikiSpeech, a content management system for the web-based creation of speech databases for the development of spoken language technology and basic research. Its main features are full support for the typical recording, annotation and project administration workflow, easy editing of the speech content, plus a fully localizable user interface.

For the creation of a new speech database, it is only necessary to open a new project within WikiSpeech, provide a link to any static project information pages and upload the prompt material to be presented to the speakers. Recordings and annotation are performed via the WWW in a platform independent manner on any Java compatible computer.

WikiSpeech currently has been localized to four languages: German, English, Romanian and Russian, and it is now used for production recordings at the Bavarian Archive for Speech Signals in Munich, Germany.

doi: 10.21437/Interspeech.2008-457

