EUROSPEECH 2003 - INTERSPEECH 2003
This paper describes a preparation of the first large Czech prosodic database which should be useful both in automatic speech recognition (ASR) and text-to-speech (TTS) synthesis. In the area of ASR we intend to use it for an automatic punctuation annotation, in the area of TTS for building a prosodic module for the Czech high-quality synthesis. The database is based on the Czech Radio&TV Broadcast News Corpus (UWB B02) recorded at the University of West Bohemia. The configuration of the database includes recorded speech, raw and stylized F_0 values, frame level energy values, a word- and phoneme-level time alignment, and a linguistically motivated description of the prosodic data. A technique of prosodic data acquisition and stylization is described. A new tagset for a linguistical annotation of the Czech prosody is proposed and used.
Bibliographic reference. Kolar, Jachym / Romportl, Jan / Psutka, Josef (2003): "The czech speech and prosody database both for ASR and TTS purposes", In EUROSPEECH-2003, 1577-1580.