ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

Comparison of spectral estimation techniques for low bit-rate speech coding

D. J. Molyneux, C. I. Parris, X. Q. Sun, B. M. G. Cheetham

Many low bit-rate speech coders represent the spectral envelope by an all-pole digital filter whose coefficients are calculated by a form of linear prediction (LP) analysis. The lower the bit-rate, the more critical will be the accuracy of the spectral analysis for achieving good quality speech. This paper compares four known techniques: a technique based on cubic spline interpolation, DAP, MVDR, and iterative all-pole modelling. First, the accuracy obtained for artificial and real speech spectra is assessed for each technique by calculating the degree of spectral distortion with reference to the spectral envelope sampled at the pitch-harmonics. Then, each technique is used to characterise the spectral amplitudes generated by a 2.4 kb/s multi-band excitation (MBE) coder. Results show that significantly better spectral accuracy is obtained using DAP. However listening tests on MBE encoded speech indicate that the advantage of DAP over the other techniques is not strongly perceptible.


doi: 10.21437/ICSLP.1998-390

Cite as: Molyneux, D.J., Parris, C.I., Sun, X.Q., Cheetham, B.M.G. (1998) Comparison of spectral estimation techniques for low bit-rate speech coding. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0946, doi: 10.21437/ICSLP.1998-390

@inproceedings{molyneux98_icslp,
  author={D. J. Molyneux and C. I. Parris and X. Q. Sun and B. M. G. Cheetham},
  title={{Comparison of spectral estimation techniques for low bit-rate speech coding}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 0946},
  doi={10.21437/ICSLP.1998-390}
}