8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Bootstrapping Morphological Analysis of Gĩkũyũ Using Unsupervised Maximum Entropy Learning

Guy De Pauw (1), Peter Waiganjo Wagacha (2)

(1) University of Antwerp, Belgium
(2) University of Nairobi, Kenya

This paper describes a proof-of-the-principle experiment in which maximum entropy learning is used for the automatic induction of shallow morphological features for the resource-scarce Bantu language of Gĩkũyũ. This novel approach circumvents the limitations of typical unsupervised morphological induction methods that employ minimum-edit distance metrics to establish morphological similarity between words. The experimental results show that the unsupervised maximum entropy learning approach compares favorably to those of the established AutoMorphology method.

Full Paper

Bibliographic reference.  Pauw, Guy De / Wagacha, Peter Waiganjo (2007): "Bootstrapping morphological analysis of gĩkũyũ using unsupervised maximum entropy learning", In INTERSPEECH-2007, 1517-1520.