INTERSPEECH 2012
13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

HMM Based Continuous EOG Recognition for Eye-input Speech Interface

Fuming Fang (1), Takahiro Shinozaki (1), Yasuo Horiuchi (1), Shingo Kuroiwa (1), Sadaoki Furui (2), Toshimitsu Musha (3)

(1) Division of Information Sciences, Chiba University, Chiba, Japan
(2) Department of Computer Science, Tokyo Institute of Technology, Tokyo, Japan
(3) Brain Functions Laboratory Inc., Kanagawa, Japan

To provide an efficient means of communication for those who cannot move muscles of the whole body except eyes due to amyotrophic lateral sclerosis (ALS), we are developing a speech synthesis interface that is based on electrooculogram (EOG) input. EOG is an electrical signal that is observed through electrodes attached on the skin around eyes and reflects eye position. A key component of the system is a continuous recognizer for the EOG signal. In this paper, we propose and investigate a hidden Markov model (HMM) based EOG recognizer applying continuous speech recognition techniques. In the experiments, we evaluate the recognition system both in user dependent and independent conditions. It is shown that 96.1% of recognition accuracy is obtained for five classes of eye actions by a user dependent system using six channels. While it is difficult to obtain good performance by a user independent system, it is shown that maximum likelihood linear regression (MLLR) adaptation helps for EOG recognition.

Index Terms: electrooculogram, hidden Markov model, amyotrophic lateral sclerosis, continuous speech recognition, maximum likelihood linear regression

Full Paper

Bibliographic reference.  Fang, Fuming / Shinozaki, Takahiro / Horiuchi, Yasuo / Kuroiwa, Shingo / Furui, Sadaoki / Musha, Toshimitsu (2012): "HMM based continuous EOG recognition for eye-input speech interface", In INTERSPEECH-2012, 735-738.