This paper presents a class of methods for automatically extracting formant parameters from speech. The methods rely on an iterative optimization algorithm. It was found that formant parameter data derived with these methods was less prone to discontinuity errors than conventional methods. Also, experiments were conducted that demonstrated that these methods are capable of better accuracy in formant estimation than LPC, especially for the first formant. In some cases, the analytic (non-iterative) solution has been derived, making real time applications feasible. The main target that we have been pursuing is text-to-speech (TTS) conversion. These methods are being used to automatically analyze a concatenation database, without the need for a tuning phase to fix errors. In addition, they are instrumental in realizing high quality pitch tracking, and pitch epoch marking.
Cite as: Pearson, S. (1998) A novel method of formant analysis and glottal inverse filtering. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0647, doi: 10.21437/ICSLP.1998-543
@inproceedings{pearson98b_icslp, author={Steve Pearson}, title={{A novel method of formant analysis and glottal inverse filtering}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0647}, doi={10.21437/ICSLP.1998-543} }