Automatic detection of Brazil’s prosodic tone unit

David Johnson, Okim Kang


This research is focused on the automatic detection of one of the fundamental elements of Brazil’s prosody model, the tone unit. We compared the performance of using silent pause duration alone to delimit tone units and using pitch resets and slow pace (or post-boundary lengthening) along with silent pause duration to delimit them. The corpus used for the comparison is composed of 18 highly proficient speakers giving academic lectures in six varieties of English which are representative of the inner (American and British), outer (Indian and South African), and expanding (Chinese and Spanish) concentric circles of Kachru’s World Englishes. The performance was compared by computing Pearson’s correlation between the numbers of tone units in a trained linguist’s transcription of the corpus and the numbers automatically detected by the computer. The computer detected the tone units from phone sequences identified in the audio files by a large vocabulary spontaneous speech recognition (LVCSR) program. We found including pitch resets and slow pace along with silent pause duration in the computer algorithm improved the correlation between the numbers of tone units in the linguist’s transcription of the corpus and the numbers automatically detected by the computer from 0.935 to 0.959.


DOI: 10.21437/SpeechProsody.2016-59

Cite as

Johnson, D., Kang, O. (2016) Automatic detection of Brazil’s prosodic tone unit. Proc. Speech Prosody 2016, 287-291.

Bibtex
@inproceedings{Johnson+2016,
author={David Johnson and Okim Kang},
title={Automatic detection of Brazil’s prosodic tone unit},
year=2016,
booktitle={Speech Prosody 2016},
doi={10.21437/SpeechProsody.2016-59},
url={http://dx.doi.org/10.21437/SpeechProsody.2016-59},
pages={287--291}
}