Capturing semantic hierarchies to perform meaningful integration in HTML tables

Shijun Li*, Mengchi Liu, Guoren Wang, Zhiyong Peng

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

We present a new approach that automatically captures the semantic hierarchies in HTML tables, and semi-automatically integrates HTML tables belonging to a domain. It first automatically captures the attribute-value pairs in HTML tables by normalization and recognizing their headings. After generating global schema manually, it learns the lexical semantic sets and contexts, by which it then eliminates the conflicts and solves the nondeterministic problems in mapping each source schema to the global schema to integrate the data in HTML tables.

Original languageEnglish
Pages (from-to)899-902
Number of pages4
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3007
DOIs
Publication statusPublished - 2004
Externally publishedYes

Fingerprint

Dive into the research topics of 'Capturing semantic hierarchies to perform meaningful integration in HTML tables'. Together they form a unique fingerprint.

Cite this