Interspeech'2005 - Eurospeech
This paper reports on the evaluation of automatic prosodic phrase break assignment. We utilize two tree-structured predictors, the commonly used CART and a C4.5, to predict break placement from sequences of easily to extract shallow textual features. We are experimenting with two 500-utterance prosodic corpora developed by two Greek universities that originate from different domains in order to focus on the differences in prediction between generic and limited domain datasets. The evaluation shows that while the limited dataset achieves better accuracy than the generic one in the CART case, this difference is lowered with the introduction of C4.5. Minor breaks proved to be the most difficult class to predict in CART case, while we achieved a 50% improvement with C4.5.
Bibliographic reference. Xydas, Gerasimos / Zervas, Panagiotis / Kouroupetroglou, Georgios / Fakotakis, Nikolaos / Kokkinakis, George (2005): "Tree-based prediction of prosodic phrase breaks on top of shallow textual features", In INTERSPEECH-2005, 3237-3240.