In current acoustic-phonetic research, there is a need for large databases. There are considerable problems in administering such databases, both to transcribe and segment the speech and to easily access stored material. We have created a speech analysis system to attempt to alleviate these problems. Speech data are stored in sentence sized files. These files are segmented and transcribed semi-automatically given a phonetic transcription of the utterance. This transcription is generated by the text-to-phonetic component of our synthesis system. The same rule structure, similar to the notation used in generative phonology, is used for accessing the data. By a brief rule statement, speech segments meeting the specified contextual conditions can be identified. Durational data can be collected directly during the database search. Spectral analysis programs operating with a variety of spectral representations have also been created that display the result, typically as a mean/SD spectrum or as a contour histogram spectrum.
Cite as: Carlson, R., Granström, B., Nord, L. (1989) The KTH speech database. Proc. Speech Input/Output Assessment and Speech Databases, Vol.2, 75-78
@inproceedings{carlson89b_sioa, author={Rolf Carlson and Björn Granström and Lennart Nord}, title={{The KTH speech database}}, year=1989, booktitle={Proc. Speech Input/Output Assessment and Speech Databases}, pages={Vol.2, 75-78} }