ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Adaptive filter based prosody modification approach

Qingcai Chen, Shusen Zhou, Dandan Wang, Xiaohong Yang

This paper proposes an adaptive filter based prosody modification approach. It consists of three components, i.e., a frequency estimator, a pitch tracker and a prosody modifier. Firstly, the frequency estimator is applied to segment the speech signal into sentences and the maximum frequency level of each sentence is estimated. Then, the adaptive filter that is adapted according to the maximum frequency level of each sentence is constructed. Its output is applied to estimate the pitch of speech. The prosody of the speech, which includes its pitch and duration, is then modified according to estimated pitch and the given pitch modification target. Be compared with existing techniques, the advantage of the proposed approach is its capability of adapting with the frequency of given speech. This advantage makes the approach especially suitable for the large range pitch modification. Finally, the performance of the proposed approach is evaluated and is compared with three existing prosody modification approach and the result is quit encouraged.


doi: 10.21437/Interspeech.2008-242

Cite as: Chen, Q., Zhou, S., Wang, D., Yang, X. (2008) Adaptive filter based prosody modification approach. Proc. Interspeech 2008, 789-792, doi: 10.21437/Interspeech.2008-242

@inproceedings{chen08_interspeech,
  author={Qingcai Chen and Shusen Zhou and Dandan Wang and Xiaohong Yang},
  title={{Adaptive filter based prosody modification approach}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={789--792},
  doi={10.21437/Interspeech.2008-242}
}