Classifying document titles based on information inference

Dawei Song, Peter Bruza, Zi Huang, Raymond Lau

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Citations (Scopus)

Abstract

We propose an intelligent document title classification agent based on a theory of information inference. The information is represented as vectorial spaces computed by a cognitively motivated model, namely Hyperspace Analogue to Language (HAL). A combination heuristic is used to combine a group of concepts into one single combination vector. Information inference can be performed on the HAL spaces via computing information flow between vectors or combination vectors. Based on this theory, a document title is treated as a combination vector by applying the combination heuristic to all the nonstop terms in the title. Two methodologies for learning and assigning categories to document titles are addressed. Experimental results on Reuters-21578 corpus show that our framework is promising and its performance achieves 71% of the upper bound (which is approximated by using whole documents).

Original languageEnglish
Title of host publicationFoundations of Intelligent Systems - 14th International Symposium, ISMIS 2003, Proceedings
EditorsNing Zhong, Zbigniew W. Ras, Shusaku Tsumoto, Einoshin Suzuki
PublisherSpringer Verlag
Pages297-306
Number of pages10
ISBN (Print)3540202560, 9783540202561
DOIs
Publication statusPublished - 2003
Externally publishedYes
Event14th International Symposium on Methodologies for Intelligent Systems, ISMIS 2003 - Maebashi City, Japan
Duration: 28 Oct 200331 Oct 2003

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2871
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Symposium on Methodologies for Intelligent Systems, ISMIS 2003
Country/TerritoryJapan
CityMaebashi City
Period28/10/0331/10/03

Fingerprint

Dive into the research topics of 'Classifying document titles based on information inference'. Together they form a unique fingerprint.

Cite this