In this paper a comparative study between One-Class-One-Network (OCON) and Multi-Layered Perceptron (MLP) neural networks for vowel phoneme recognition is presented. The OCON architecture, first proposed by I.C.Jou et al, is similar in design to a conventional feed-forward MLP, only each class had its own dedicated sub-network containing a single output node. Conventional MLPs usually consist of fully-connected nodes which not only result in a large number of weighted connections but also create the problem of cross-class interference. Using vowel phoneme data from the DARPA TIMIT corpus of read speech, MLP and OCON architectures were trained and the relative effects of recognition and convergence rates during both intra and inter-class adaptation tested. The OCON showed an increase in the convergence rate of 273% and an improvement of adapted recognition rates against the MLP of over 12%.
Cite as: Haskey, S.J., Datta, S. (1998) A comparative study of OCON and MLP architectures for phoneme recognition. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0568, doi: 10.21437/ICSLP.1998-400
@inproceedings{haskey98_icslp, author={Stephen J. Haskey and Sekharajit Datta}, title={{A comparative study of OCON and MLP architectures for phoneme recognition}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0568}, doi={10.21437/ICSLP.1998-400} }