ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

Labeler agreement in phonetic labeling of continuous speech

Ronald Cole, Beatrice T. Oshika, Mike Noel, Terri Lander, Mark Fanty

This paper analyzes inter-labeler agreement of label choice and boundary placement for human phonetic transcriptions of continuous telephone speech in different languages. In experiment one, English, German, Mandarin and Spanish are labeled by fluent speakers of the languages. In experiment two, German and Hindi are labeled by linguists who do not speak the languages. Experiment two uses a somewhat finer phonetic transcription set than experiment one. We compare the transcriptions of the utterances in terms of the minimum number of substitutions, insertions and deletions needed to map one transcription to the other. Native speakers agree on the average 67.52% of the time at the finest level of labeling, including diacritics. Non-native linguists agree 34.41% of the time. The implications of the results are discussed for evaluation of phonetic recognition algorithms.


Cite as: Cole, R., Oshika, B.T., Noel, M., Lander, T., Fanty, M. (1994) Labeler agreement in phonetic labeling of continuous speech. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 2131-2134

@inproceedings{cole94c_icslp,
  author={Ronald Cole and Beatrice T. Oshika and Mike Noel and Terri Lander and Mark Fanty},
  title={{Labeler agreement in phonetic labeling of continuous speech}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={2131--2134}
}