Abstract
Semantically aligning the heterogeneous geospatial datasets (GDs) produced by different organizations demands efficient similarity matching methods. However, the strategies employed to align the schema (concept and property) and instances are usually not reusable, and the effects of unbalanced information tend to be neglected in GD alignment. To solve this problem, a holistic approach is presented in this paper to integrally align the geospatial entities (concepts, properties and instances) simultaneously. Spatial, lexical, structural and extensional similarity metrics are designed and automatically aggregated by means of approval voting. The presented approach is validated with real geographical semantic webs, Geonames and OpenStreetMap. Compared with the well-known extensional-based aligning system, the presented approach not only considers more information involved in GD alignment, but also avoids the artificial parameter setting in metric aggregation. It reduces the dependency on specific information, and makes the alignment more robust under the unbalanced distribution of various information.
Original language | English |
---|---|
Pages (from-to) | 845-862 |
Number of pages | 18 |
Journal | International Journal of Digital Earth |
Volume | 11 |
Issue number | 8 |
DOIs | |
Publication status | Published - 3 Aug 2018 |
Externally published | Yes |
Keywords
- Geospatial data
- data alignment
- semantic web
- similarity matching