The orthographic surface structure of Swedish words has been used for predicting parts-of-speech information using a connectionist approach. This technique can be used to aid syntactic processing within a text-to-speech system. The error back-propagation technique has been used for the connectionist learning. A corpus of the 10 000 most frequent Swedish words have been used for training and testing the system. The results indicate that around 80% of the words can be correctly classified by using the last part of each word. The system is compared to a rule based system that makes the same sort of predictions from word endings. Both systems give comparable results for the lexicon used.
Cite as: Elenius, K., Carlson, R. (1989) Assigning parts-of-speech to words from their orthography using a connectionist model. Proc. First European Conference on Speech Communication and Technology (Eurospeech 1989), 1534-1537, doi: 10.21437/Eurospeech.1989-117
@inproceedings{elenius89_eurospeech, author={Kjell Elenius and Rolf Carlson}, title={{Assigning parts-of-speech to words from their orthography using a connectionist model}}, year=1989, booktitle={Proc. First European Conference on Speech Communication and Technology (Eurospeech 1989)}, pages={1534--1537}, doi={10.21437/Eurospeech.1989-117} }