5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

On Representation of Fundamental Frequency of Speech for Prosody Analysis Using Reliability Function

Mitsuru Nakai, Hiroshi Shimodaira

Japan Advanced Institute of Science and Technology, Asahidai, Tatsunokuchi, Nomi, Ishikawa, Japan

This paper highlights on a method that provides a new prosodic feature called 'F0 reliability field' based on a reliability function of the fundamental frequency (F0 ). The proposed method does not employ any correction process for F0 estimation errors that occur during automatic F0 extraction. By applying this feature as a score function for prosodic analyses like prosodic structure estimation or superpositional modeling of prosodic commands, these prosodic information could be acquired with higher accuracy. The feature has been applied to 'F0 template matching method', which detects accent phrase boundaries in Japanese continuous speech. The experimental results show that compared to the conventional F0 contour, the proposed feature overcomes the harmful influence caused by F0 errors.

