8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Towards a New Level of Anotation Detail of Multilingual Speech Corpora

Anja Geumann

University College Dublin, Ireland

The aim of this paper is to highlight the actual need for corpora that have been annotated based on acoustic information. The acoustic information should be coded in features or properties and is needed to inform further processing systems, i.e. to present a basis for a speech recognition system using linguistic information. Feature annotation of existing corpora in combination with segmental annotation can provide a powerful training material for speech recognition systems, but will as well challenge the further processing of features to segments and syllables. We present here the theoretical preliminaries for our multilingual feature extraction system, that we are currently working on.

Full Paper

Bibliographic reference.  Geumann, Anja (2004): "Towards a new level of anotation detail of multilingual speech corpora", In INTERSPEECH-2004, 2785-2788.