EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Use of Linguistic Information for Automatic Extraction of F_0 Contour Generation Process Model Parameters

Keikichi Hirose, Yusuke Furuyama, Shuichi Narusawa, Nobuaki Minematsu, Hiroya Fujisaki

University of Tokyo, Japan

A method was developed to utilize linguistic information (lexical accent types and syntactic boundaries) to improve the performance of the automatic extraction of the F_0 contour generation process model commands. The extraction scheme is first to smooth the observed F_0 contour by a piecewise 3rd order polynomial function and to locate accent command positions by taking the derivative of the function. If the results of automatic extraction differ from those estimated from the linguistic information, they are modified according to the several rules. The results showed that some errors could be corrected by the use of linguistic information, especially when the initial word of an accent phrase is type 0 (flat) accent. As a whole, the correct extraction rate (recall rate) was increased from 79.8% to 82.3% for phrase commands and from 81.6% to 85.9% for accent commands.

Full Paper

Bibliographic reference.  Hirose, Keikichi / Furuyama, Yusuke / Narusawa, Shuichi / Minematsu, Nobuaki / Fujisaki, Hiroya (2003): "Use of linguistic information for automatic extraction of f_0 contour generation process model parameters", In EUROSPEECH-2003, 141-144.