This paper describes a new method for estimating formant frequencies. It operates in two phases. The first phase, which is similar to a technique developed by Talkin [JASA, vol. 82, S1], finds optimal formant track estimates by imposing frequency continuity constraints using Dynamic Programming (DP). DP is used to select a mapping of candidate frequencies to formant frequencies in oral sonorant regions based on the minimum cost from all possible mappings. The second phase performs a series of postprocessing steps to make formant estimates more robust and accurate and extends the formant estimates into nasal and obstruent regions. Performance statistics comparing the formants obtained with this technique with a set of reference formants using 34 sentences randomly selected from the TIMIT database shows our algorithm gives excellent results when the formants are among the candidate frequencies.
Cite as: Xia, K., Espy-Wilson, C. (2000) A new strategy of formant tracking based on dynamic programming. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 55-58, doi: 10.21437/ICSLP.2000-476
@inproceedings{xia00_icslp, author={Kun Xia and Carol Espy-Wilson}, title={{A new strategy of formant tracking based on dynamic programming}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 3, 55-58}, doi={10.21437/ICSLP.2000-476} }