The efficiency of phonetic vocoders stems from the fact that the only transmitted information is the index of the recognised units and the corresponding prosodic parameters. Hence, speaker recognisability is one of the main issues in this class of coders. Our approach to minimise this drawback was to include some speaker adaptation capability. The purpose of this paper is two-folded: on one hand, to describe the recognisability and intelligibility tests that were performed with our phonetic vocoder with and without speaker adaptation; on the other hand, to present our recent developments of this coder, using the SpeechDat corpus for Portuguese, that includes telephone calls from 5000 speakers. This allowed us to generate improved HMM models, codebooks, and quantization tables, and to investigate the performance of the coder in non-clean environments and with a much wider speaker population.
Cite as: Ribeiro, C.M., Trancoso, I.M., Caseiro, D.A. (2000) Phonetic vocoder assessment. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 830-833, doi: 10.21437/ICSLP.2000-663
@inproceedings{ribeiro00_icslp, author={Carlos M. Ribeiro and Isabel M. Trancoso and Diamantino A. Caseiro}, title={{Phonetic vocoder assessment}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 3, 830-833}, doi={10.21437/ICSLP.2000-663} }