ISCA Archive SSW 2010
ISCA Archive SSW 2010

Automatic prosodic labeling of accent information for Japanese spoken sentences

Asami Yamamoto, Kazuhiro Suzuki, Kook Cho, Yoichi Yamashita

This paper describes a method of automatic labeling of prosodic information focusing on accent types and accent phrase boundaries for Japanese spoken sentences. They are predicted by CRF (Conditional Random Fields) using linguistic information and F0 contour information. In the prediction of the accent type, we propose a method that uses a provisional accent type predicted by linguistic information and accentuation rules. The actual accent type is predicted by F0 information and linguistic information which includes the provisional accent type as one of features, under the condition that contents of speech and accent phrase boundaries are given. Evaluation experiments show that the introduction of accentuation rules improves accuracy of the accent type prediction by 6.1% and the prediction rate is 59.6% for spontaneous Japanese speech data. In the prediction of the accent phrase boundary, we propose a method that uses linguistic and prosodic probability models under the condition that the contents of speech and word labels are given. The prediction accuracy of accent phrase boundary is 76.5%.

Index Terms: Prosodic labeling, Accent type, Accent Phrase Boundary, F0 pattern, Conditional Random Fields, Accentuation rule


Cite as: Yamamoto, A., Suzuki, K., Cho, K., Yamashita, Y. (2010) Automatic prosodic labeling of accent information for Japanese spoken sentences. Proc. 7th ISCA Workshop on Speech Synthesis (SSW 7), 300-305

@inproceedings{yamamoto10_ssw,
  author={Asami Yamamoto and Kazuhiro Suzuki and Kook Cho and Yoichi Yamashita},
  title={{Automatic prosodic labeling of accent information for Japanese spoken sentences}},
  year=2010,
  booktitle={Proc. 7th ISCA Workshop on Speech Synthesis (SSW 7)},
  pages={300--305}
}