Second Workshop on Child, Computer and Interaction (WOCCI 2009)
Cambridge, MA, USA
This contribution describes the robustness evaluation and optimization steps for a speech interface which is suitable for embedded language tutoring with special focus on childrens speech. The baseline algorithms are derived from the pronunciation tutoring system AzAR directed to adult learners of German. The first prototype LiSA (2008) - directed to young children starting at 3 years - is currently evaluated and optimized, mainly addressing following issues: (a) the challenge of ASR-based pronunciation assessment for childrens speech, (b) the handling of noise and reverberation in an embedded application scenario, and (c) the extraction of additional information such as age or gender. The article summarizes evaluation results of the speech recognizer in laboratory and real-world room environment.
Bibliographic reference. Jokisch, Oliver / Hain, Horst-Udo / Petrick, Rico / Hoffmann, Rüdiger (2009): "Robustness optimization of a speech interface for child-directed embedded language tutoring", In WOCCI-2009, 113-116.