7th International Conference on Spoken Language Processing
September 16-20, 2002
This paper presents our experiments on continuous audiovisual speech recognition. A number of bimodal systems using feature fusion or fusion within Hidden Markov Models are implemented. Experiments with different fusion techniques and their results are presented. Further the performance levels of the bimodal system and a unimodal speech recognizer under noisy conditions are compared.
Bibliographic reference. Wiggers, Pascal / Wojdel, Jacek C. / Rothkrantz, Leon J.M. (2002): "Medium vocabulary continuous audio-visual speech recognition", In ICSLP-2002, 1921-1924.