ISCA Archive SIOA 1989
ISCA Archive SIOA 1989

Speech database development: design and analysis of the acoustic-phonetic corpus

Lori F. Lamel, Robert H. Kassel, Stephanie Seneff

The need for a comprehensive, standardized speech database is threefold: first, to acquire acoustic-phonetic knowledge for phonetic recognition; second, to provide speech for training recognizers; and third, to provide a common test base for the evaluation of recognizers. There are many factors to consider in corpus design, making it impossible to provide a complete database for all potential users. It is possible, however, to provide an acceptable database that can be extended to meet future needs. After much discussion among several sites, a consensus was reached that the initial acoustic-phonetic corpus should consist of calibration sentences, a set of phonetically compact sentences, and a large number of randomly selected sentences to provide contextual variation. The database design has been a joint effort including MIT, SRI, and TL This paper describes MIT's role in corpus development and analyses of the phonetic coverage of the complete database. We also include a description of the phonetic transcription and alignment procedure.


Cite as: Lamel, L.F., Kassel, R.H., Seneff, S. (1989) Speech database development: design and analysis of the acoustic-phonetic corpus. Proc. Speech Input/Output Assessment and Speech Databases, Vol.2, 161-170

@inproceedings{lamel89b_sioa,
  author={Lori F. Lamel and Robert H. Kassel and Stephanie Seneff},
  title={{Speech database development: design and analysis of the acoustic-phonetic corpus}},
  year=1989,
  booktitle={Proc. Speech Input/Output Assessment and Speech Databases},
  pages={Vol.2, 161-170}
}