10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Speech-Based and Multimodal Media Center for Different User Groups

Markku Turunen (1), Jaakko Hakulinen (1), Aleksi Melto (1), Juho Hella (1), Juha-Pekka Rajaniemi (1), Erno Mäkinen (1), Jussi Rantala (1), Tomi Heimonen (1), Tuuli Laivo (1), Hannu Soronen (2), Mervi Hansen (2), Pellervo Valkama (1), Toni Miettinen (1), Roope Raisamo (1)

(1) University of Tampere, Finland
(2) Tampere University of Technology, Finland

We present a multimodal media center interface based on speech input, gestures, and haptic feedback. For special user groups, including visually and physically impaired users, the application features a zoomable context + focus GUI in tight combination with speech output and full speech-based control. These features have been developed in cooperation with representatives of the user groups. Evaluations of the system with regular users have been conducted and results from a study where subjective evaluations were collected show that the performance and user experience of speech input were very good, similar to results from a ten month public pilot use.

Full Paper

Bibliographic reference.  Turunen, Markku / Hakulinen, Jaakko / Melto, Aleksi / Hella, Juho / Rajaniemi, Juha-Pekka / Mäkinen, Erno / Rantala, Jussi / Heimonen, Tomi / Laivo, Tuuli / Soronen, Hannu / Hansen, Mervi / Valkama, Pellervo / Miettinen, Toni / Raisamo, Roope (2009): "Speech-based and multimodal media center for different user groups", In INTERSPEECH-2009, 1439-1442.