Extracting fine-grained entities based on coordinate graph

Qing Yang, Peng Jiang, Chunxia Zhang*, Zhendong Niu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Most previous entity extraction studies focus on a small set of coarse-grained classes, such as person etc. However, the distribution of entities within query logs of search engine indicates that users are more interested in a wider range of fine-grained entities, such as GRAMMY winner and Ivy League member etc. In this paper, we present a semi-supervised method to extract fine-grained entities from an open-domain corpus. We build a graph based on entities in coordinate lists, which are html nodes with the same tag path of the DOM trees. Then class labels are propagated over the graph from known entities to unknowns. Experiments on a large corpus from ClueWeb09a dataset show that our proposed approach achieves the promising results.

Original languageEnglish
Title of host publicationNatural Language Processing and Information Systems - 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013, Proceedings
Pages367-371
Number of pages5
DOIs
Publication statusPublished - 2013
Event18th International Conference on Application of Natural Language to Information Systems, NLDB 2013 - Salford, United Kingdom
Duration: 19 Jun 201321 Jun 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7934 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference18th International Conference on Application of Natural Language to Information Systems, NLDB 2013
Country/TerritoryUnited Kingdom
CitySalford
Period19/06/1321/06/13

Keywords

  • Coordinate Graph
  • Fine-Grained Entity Extraction
  • Label Propagation

Fingerprint

Dive into the research topics of 'Extracting fine-grained entities based on coordinate graph'. Together they form a unique fingerprint.

Cite this