5th International Conference on Spoken Language Processing
The use of cache memories and symmetric Kullback-Leibler distances is proposed for topic classification and topic-shift detection. Experiments with a large corpus of articles from the French newspaper "Le Monde show tangible advantages when different models are combined with a suitable strategy. Experimental results show that different strategies for topic shift detection have to be used depending on whether high recall or high precision are sought. Furthermore, methods based on topic independent distributions provide complementary candidates with respect to the use of topic-dependent distributions leading to an increase in recall with a minor loss in precision.
Bibliographic reference. Bigi, Brigitte / Mori, Renato De / El-Beze, Marc / Spriet, Thierry (1998): "Detecting topic shifts using a cache memory", In ICSLP-1998, paper 0077.