![]() |
ESCA Tutorial and Research Workshop on
|
![]() |
In current acoustic-phonetic research, there is a need for large databases. There are considerable problems in administering such databases, both to transcribe and segment the speech and to easily access stored material. We have created a speech analysis system to attempt to alleviate these problems. Speech data are stored in sentence sized files. These files are segmented and transcribed semi-automatically given a phonetic transcription of the utterance. This transcription is generated by the text-to-phonetic component of our synthesis system. The same rule structure, similar to the notation used in generative phonology, is used for accessing the data. By a brief rule statement, speech segments meeting the specified contextual conditions can be identified. Durational data can be collected directly during the database search. Spectral analysis programs operating with a variety of spectral representations have also been created that display the result, typically as a mean/SD spectrum or as a contour histogram spectrum.
Bibliographic reference. Carlson, Rolf / Granström, Björn / Nord, Lennart (1989): "The KTH speech database", In SIOA-1989, Vol.2, 75-78.