This paper deals with the application of information extracted from AM and FM time-frequency representations of speech to the task of determining speech quality. The representations are introduced and then the procedure for data extraction is outlined. The experimental setup for the assessment of objective quality covers distortions typically found in speech communication systems. To determine how well these quality measures perform, regression analysis is used to evaluate how well they estimate the results of subjective testing. Considering each class of distortions individually the objective measures demonstrate good performance, however, this level does not seem to hold as well in the aggregate case. This leads to suggestions as to where possible improvements can be made to the procedure.
Cite as: Timoney, J., Foley, J.B. (2000) Speech quality evaluation based on AM-FM time-frequency representations. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 472-475, doi: 10.21437/ICSLP.2000-851
@inproceedings{timoney00_icslp, author={Joe Timoney and J. Brian Foley}, title={{Speech quality evaluation based on AM-FM time-frequency representations}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 4, 472-475}, doi={10.21437/ICSLP.2000-851} }