5th International Conference on Spoken Language Processing

Sydney, Australia
November 30 - December 4, 1998

A Comparative Study Between Polyclass and Multiclass Language Models

Imed Zitouni (1), Kamel Smaili (1), Jean-Paul Haton (1), Sabine Deligne (2), Frédéric Bimbot (3)

(1) LORIA / INRIA-Lorraine, France
(2) ATR-ITL, Japan
(3) IRISA-CNRS/INRIA, France

In this paper, we introduce the concept of Multiclass for language modeling and we compare it to the Polyclass model. The originality of the Multiclass is its capability to parse a string of class/tags into variable length independent sequences. A few experimental tests were carried out on a class corpus extracted from the French "Le Monde" word corpus labeled automatically. This corpus contains a set of 43 million of words. In our experiments, Multiclass outperform first-order Polyclass but are slightly outperformed by second-order Polyclass.

Full Paper

Bibliographic reference.  Zitouni, Imed / Smaili, Kamel / Haton, Jean-Paul / Deligne, Sabine / Bimbot, Frédéric (1998): "A comparative study between polyclass and multiclass language models", In ICSLP-1998, paper 0498.