Sixth European Conference on Speech Communication and Technology
Spoken language processing has created value in multiple application areas such as document transcription, data base entry, and command and control. Recently scientists have been focusing on a new class of application that promises on-demand access to multimedia information such as radio and broadcast news. In separate research, augmenting traditional graphical interfaces with additional modalities of interaction, such as spoken language, gesture, or eye tracking, promises to enhance human computer interaction. In this address I discuss the synergy of speech, language and image processing, introduce a new idea for corpus based multimedia interfaces, and identify some remaining challenging research areas.
Full Paper (PDF)
Bibliographic reference. Maybury, Mark T. (1999): "Multimedia interaction for the new millennium", In EUROSPEECH'99, keynote paper 3.