9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Development and Evaluation of Polish Speech Corpus for Unit Selection Speech Synthesis Systems

Grazyna Demenko (1), J. Bachan (1), Bernd Möbius (2), K. Klessa (1), M. Szymański (3), Stefan Grocholewski (3)

(1) Adam Mickiewicz University, Poland; (2) University of Stuttgart, Germany; (3) Poznań University of Technology, Poland

This paper presents the results of a set of experiments assessing the perceived quality of the Polish version of the BOSS unit selection synthesis system. The experiments aimed to evaluate the potential improvement of synthesis quality by three factors pertaining to corpus structure and coverage as well as levels of corpus annotation. The three factors affecting synthesis quality were (i) manual vs. automatic corpus annotation, (ii) coverage of CVC triphones in rich intonational patterns, and (iii) coverage of complex consonant clusters. Results indicate that a manual correction of automatic annotations enhances synthesis quality. Increased coverage of CVC sequences and consonant clusters also improved the perceived synthesis quality, but the effect was smaller than anticipated.

Full Paper

Bibliographic reference.  Demenko, Grazyna / Bachan, J. / Möbius, Bernd / Klessa, K. / Szymański, M. / Grocholewski, Stefan (2008): "Development and evaluation of Polish speech corpus for unit selection speech synthesis systems", In INTERSPEECH-2008, 1650-1653.