ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

High performance "general purpose" phonetic recognition for Italian

Piero Cosi, John-Paul Hosom

The development of a speaker independent "general purpose" phonetic recognizer for Italian is described. The CSLU Toolkit was used to develop and implement the system. The recognizer, based on a frame-based hybrid HMM/ANN architecture trained on context-dependent categories to account for coarticulatory variation, recognizes 38 different phonemes (not including silence or closures), and can distinguish between stressed and unstressed vowels as well as open and closed vowels. The APASCI corpus, containing nearly 2500 sentences read by 100 speakers, where the sentences have been designed to maximize the number of phonemes occurring in different contexts, was used for training and testing. As of the time of this writing, a phoneme-level accuracy of 82.90% on the development set and of 80.53% on the test set has been obtained. This level of accuracy is much greater than on a similar English-language corpus (with state-of-the-art performance of slightly better than 70%) and it represents the best performance obtained so far on this corpus.


Cite as: Cosi, P., Hosom, J.-P. (2000) High performance "general purpose" phonetic recognition for Italian. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 527-530

@inproceedings{cosi00_icslp,
  author={Piero Cosi and John-Paul Hosom},
  title={{High performance "general purpose" phonetic recognition for Italian}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 527-530}
}