A holistic approach to aligning geospatial data with multidimensional similarity measuring

Li Yu; Peiyuan Qiu; Xiliang Liu; Feng Lu; Bo Wan

doi:10.1080/17538947.2017.1359688

A holistic approach to aligning geospatial data with multidimensional similarity measuring

Li Yu, Peiyuan Qiu, Xiliang Liu, Feng Lu^*, Bo Wan

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

23 Citations (Scopus)

Abstract

Semantically aligning the heterogeneous geospatial datasets (GDs) produced by different organizations demands efficient similarity matching methods. However, the strategies employed to align the schema (concept and property) and instances are usually not reusable, and the effects of unbalanced information tend to be neglected in GD alignment. To solve this problem, a holistic approach is presented in this paper to integrally align the geospatial entities (concepts, properties and instances) simultaneously. Spatial, lexical, structural and extensional similarity metrics are designed and automatically aggregated by means of approval voting. The presented approach is validated with real geographical semantic webs, Geonames and OpenStreetMap. Compared with the well-known extensional-based aligning system, the presented approach not only considers more information involved in GD alignment, but also avoids the artificial parameter setting in metric aggregation. It reduces the dependency on specific information, and makes the alignment more robust under the unbalanced distribution of various information.

Original language	English
Pages (from-to)	845-862
Number of pages	18
Journal	International Journal of Digital Earth
Volume	11
Issue number	8
DOIs	https://doi.org/10.1080/17538947.2017.1359688
Publication status	Published - 3 Aug 2018
Externally published	Yes

Keywords

Geospatial data
data alignment
semantic web
similarity matching

Access to Document

10.1080/17538947.2017.1359688

Cite this

Yu, L., Qiu, P., Liu, X., Lu, F., & Wan, B. (2018). A holistic approach to aligning geospatial data with multidimensional similarity measuring. International Journal of Digital Earth, 11(8), 845-862. https://doi.org/10.1080/17538947.2017.1359688

@article{f5629ba4d5b5453793e395770db7c9b9,

title = "A holistic approach to aligning geospatial data with multidimensional similarity measuring",

abstract = "Semantically aligning the heterogeneous geospatial datasets (GDs) produced by different organizations demands efficient similarity matching methods. However, the strategies employed to align the schema (concept and property) and instances are usually not reusable, and the effects of unbalanced information tend to be neglected in GD alignment. To solve this problem, a holistic approach is presented in this paper to integrally align the geospatial entities (concepts, properties and instances) simultaneously. Spatial, lexical, structural and extensional similarity metrics are designed and automatically aggregated by means of approval voting. The presented approach is validated with real geographical semantic webs, Geonames and OpenStreetMap. Compared with the well-known extensional-based aligning system, the presented approach not only considers more information involved in GD alignment, but also avoids the artificial parameter setting in metric aggregation. It reduces the dependency on specific information, and makes the alignment more robust under the unbalanced distribution of various information.",

keywords = "Geospatial data, data alignment, semantic web, similarity matching",

author = "Li Yu and Peiyuan Qiu and Xiliang Liu and Feng Lu and Bo Wan",

note = "Publisher Copyright: {\textcopyright} 2017, {\textcopyright} 2017 Informa UK Limited, trading as Taylor & Francis Group.",

year = "2018",

month = aug,

day = "3",

doi = "10.1080/17538947.2017.1359688",

language = "English",

volume = "11",

pages = "845--862",

journal = "International Journal of Digital Earth",

issn = "1753-8947",

publisher = "Taylor and Francis Ltd.",

number = "8",

}

TY - JOUR

T1 - A holistic approach to aligning geospatial data with multidimensional similarity measuring

AU - Yu, Li

AU - Qiu, Peiyuan

AU - Liu, Xiliang

AU - Lu, Feng

AU - Wan, Bo

PY - 2018/8/3

Y1 - 2018/8/3

N2 - Semantically aligning the heterogeneous geospatial datasets (GDs) produced by different organizations demands efficient similarity matching methods. However, the strategies employed to align the schema (concept and property) and instances are usually not reusable, and the effects of unbalanced information tend to be neglected in GD alignment. To solve this problem, a holistic approach is presented in this paper to integrally align the geospatial entities (concepts, properties and instances) simultaneously. Spatial, lexical, structural and extensional similarity metrics are designed and automatically aggregated by means of approval voting. The presented approach is validated with real geographical semantic webs, Geonames and OpenStreetMap. Compared with the well-known extensional-based aligning system, the presented approach not only considers more information involved in GD alignment, but also avoids the artificial parameter setting in metric aggregation. It reduces the dependency on specific information, and makes the alignment more robust under the unbalanced distribution of various information.

AB - Semantically aligning the heterogeneous geospatial datasets (GDs) produced by different organizations demands efficient similarity matching methods. However, the strategies employed to align the schema (concept and property) and instances are usually not reusable, and the effects of unbalanced information tend to be neglected in GD alignment. To solve this problem, a holistic approach is presented in this paper to integrally align the geospatial entities (concepts, properties and instances) simultaneously. Spatial, lexical, structural and extensional similarity metrics are designed and automatically aggregated by means of approval voting. The presented approach is validated with real geographical semantic webs, Geonames and OpenStreetMap. Compared with the well-known extensional-based aligning system, the presented approach not only considers more information involved in GD alignment, but also avoids the artificial parameter setting in metric aggregation. It reduces the dependency on specific information, and makes the alignment more robust under the unbalanced distribution of various information.

KW - Geospatial data

KW - data alignment

KW - semantic web

KW - similarity matching

UR - http://www.scopus.com/inward/record.url?scp=85026753553&partnerID=8YFLogxK

U2 - 10.1080/17538947.2017.1359688

DO - 10.1080/17538947.2017.1359688

M3 - Article

AN - SCOPUS:85026753553

SN - 1753-8947

VL - 11

SP - 845

EP - 862

JO - International Journal of Digital Earth

JF - International Journal of Digital Earth

IS - 8

ER -

A holistic approach to aligning geospatial data with multidimensional similarity measuring

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this