Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Segmental Optical Phonetics for Human and Machine Speech Processing

Lynne E. Bernstein

Department of Communication Neuroscience, House Ear Institute, Los Angeles, CA, USA

That talkers produce optical as well as acoustic speech signals, and that perceivers process both types of signals has become well known. Although perceptual effects due to audiovisual speech integration have been a focus of research involving the visual speech stimulus, relatively little is known about visual-only speech perception and optical phonetic signals. This knowledge is needed to exploit optical signals for applications such as synthetic artificial talking heads and audiovisual ASR. One important practical concern is the wide variation in performance among individual visual perceivers and talkers. This paper focuses on variation in visual phonetic perception, phoneme distinctiveness and word recognition. The paper also introduces a project linking optical phonetics, speech kinematics, and perception.


Full Paper

Bibliographic reference.  Bernstein, Lynne E. (2000): "Segmental optical phonetics for human and machine speech processing", In ICSLP-2000, vol.3, 43-46.