In this paper we investigate an alternative approach to the design of low-bit rate (LBR) quantisation. This approach incorporates phonetic information into the structure of Line Spectral Frequency (LSF) codebooks. In prior work vector quantisation (VQ) has been used to quantise stochastic processes. Speech signals can, however, be described in terms of phonetic segments and linguistic rules. A trained LSF codebook, like the phonetic inventory of a language, is a static description of spectral behaviour of speech. As clear relationships exist between phonetic segments and LSFs the structure of an LSF codebook can be analysed in terms of the phonetic segments. The investigation leads to the conclusion that phonetic information can be usefully employed in codebook training in terms of perceptual performance and bit-rate reductions.
Cite as: Parry, J.J., Burnett, I.S., Chicharo, J.F. (1998) Using linguistic knowledge to improve the design of low-bit rate LSF quantisation. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0137, doi: 10.21437/ICSLP.1998-393
@inproceedings{parry98_icslp, author={John J. Parry and Ian S. Burnett and Joe F. Chicharo}, title={{Using linguistic knowledge to improve the design of low-bit rate LSF quantisation}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0137}, doi={10.21437/ICSLP.1998-393} }