ISCA Archive Odyssey 2001
ISCA Archive Odyssey 2001

Using lip features for multimodal speaker verification

Xiaozheng Zhang, Charles C. Broun

With the prevalence of the information age, privacy and personalization are forefront in today's society. As such, biometrics is viewed as an essential component of current and evolving technological systems. Consumers demand unobtrusive and non-invasive approaches. In our previous work, we have demonstrated a speaker verification system that meets these criteria. However, there are additional constraints for fielded systems. The required recognition transactions are often performed in adverse environments and across diverse populations, necessitating robust solutions.

We propose a multimodal approach that builds on our current state-of-the-art speaker verification technology. In order to maintain the transparent nature of the speech interface, we focus on optical sensing technology to provide the additional modality - giving us an audio-visual person recognition system. For the audio domain, we use our existing speaker verification system. For the visual domain, we focus on lip motion.

Cite as: Zhang, X., Broun, C.C. (2001) Using lip features for multimodal speaker verification. Proc. The Speaker and Language Recognition Workshop (Odyssey 2001), 231-236

  author={Xiaozheng Zhang and Charles C. Broun},
  title={{Using lip features for multimodal speaker verification}},
  booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2001)},