ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Three-dimensional modelling of speech corpora: added value through visualisation

Toomas Altosaar, Matti Karjalainen, Martti Vainio

Collections of annotated spoken language have formed an important basis for the development of speech technology. Their existence has promoted speech analysis research as well as enabled robust synthesis and recognition methods to be developed. However, many complex relationships remain unspecified within a corpus due to a lack of metadata that describes the raw information in sufficient detail as well as the inter-relationships between signals, recording conditions, talkers, etc. A deficit of standards and formats, needed to express complex relationships, has also hindered the potential use and value of available corpora. This paper presents a novel three-dimensional model for exploring temporal as well as atemporal information existing in speech corpora. Examined are the potential benefits that are gained through corpus visualisation during the phases of creation, editing, verification, use, and exploration. The paper suggests that by providing a threedimensional model of speech data, more of the inherent and potential value of a corpus can be utilised.


doi: 10.21437/Eurospeech.2001-380

Cite as: Altosaar, T., Karjalainen, M., Vainio, M. (2001) Three-dimensional modelling of speech corpora: added value through visualisation. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 1537-1540, doi: 10.21437/Eurospeech.2001-380

@inproceedings{altosaar01_eurospeech,
  author={Toomas Altosaar and Matti Karjalainen and Martti Vainio},
  title={{Three-dimensional modelling of speech corpora: added value through visualisation}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={1537--1540},
  doi={10.21437/Eurospeech.2001-380}
}