Sixth European Conference on Speech Communication and Technology
In this paper, a distance-related unit association maximum entropy (DUAME) language modeling is proposed. This approach can model an event (unit subsequence) using the co-occurrence of full distance unit association (UA) features so that it is able to pursue a functional approximation to higher order N-gram with significantly less memory requirement. A smoothing strategy related to this modeling will also be discussed. Preliminary experimental results have shown that DUAME modeling is comparable to conventional N-gram modeling in perplexity with significantly small number of parameters.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Zhang, Shuwu / Singer, Harald / Wu, Dekai / Sagisaka, Yoshinori (1999): "Improving n-gram modeling using distance-related unit association maximum entropy language modeling", In EUROSPEECH'99, 1611-1614.