Itembank redundancy checking based on multi-instance learning

Shi Ping Tang*, Xiao Zhong Fan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

A method based on multi-instance learning to improve the itembank redundancy checking algorithm is proposed. Redundancy checking for items with multiple questions is addressed through transforming it into a multi-instance learning problem. High-frequency words extracting algorithm based on suffix tree is used to extract content features of items and the use of thesaurus can be avoided. Combined with metadata features of items, a method to compute item similarity is proposed. Experiments on the realworld itembank dataset show that the proposed method is an effective and feasible solution to the itembank redundancy checking problem, and achieves 91.3% precision and 92.3% recall. It laid groundwork for future work on the integration of itembank systems.

Original languageEnglish
Pages (from-to)1071-1074
Number of pages4
JournalBeijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
Volume25
Issue number12
Publication statusPublished - Dec 2005

Keywords

  • Itembank redundancy checking
  • Minimum Hausdorff distance
  • Multi-instance learning

Fingerprint

Dive into the research topics of 'Itembank redundancy checking based on multi-instance learning'. Together they form a unique fingerprint.

Cite this