ISCA Archive SPECOM 2004
ISCA Archive SPECOM 2004

LPFAV2: a new multi-modal database for developing speech recognition systems for an assistive technology application

Vitor Pera, Antonio Moura, Diamantino Freitas

In this paper we report on the acquisition and content of a new database intended for developing audio-visual speech recognition systems. This database supports a speaker dependent continuous speech recognition task, based on a small vocabulary, and was captured in the European Portuguese language. Along with the collected multi-modal speech materials, the respective orthographic transcription and time-alignment files are supplied. The package also includes data on stochastic language models and the generative grammar associated to the collected spoken sentences. The application addressed by this database, which consists of voice control of a basic scientific calculator, has the particularity of being designed for a person with a specific motor impairment, namely muscular dystrophy. This specificity is a remarkable characteristic, given the lack of such kind of data resources for developing assistive systems based on audio-visual speech recognition technology.


Cite as: Pera, V., Moura, A., Freitas, D. (2004) LPFAV2: a new multi-modal database for developing speech recognition systems for an assistive technology application. Proc. 9th Conference on Speech and Computer (SPECOM 2004), 73-76

@inproceedings{pera04_specom,
  author={Vitor Pera and Antonio Moura and Diamantino Freitas},
  title={{LPFAV2: a new multi-modal database for developing speech recognition systems for an assistive technology application}},
  year=2004,
  booktitle={Proc. 9th Conference on Speech and Computer (SPECOM 2004)},
  pages={73--76}
}