TY - GEN
T1 - Structured Poi data extraction from internet news
AU - Zhang, Hua Ping
PY - 2010
Y1 - 2010
N2 - POI (Point of Interest) database is key resources for GIS (Geographic Information System) application. POI manual gathering is expensive and time consuming. This paper presents a state-of-the-art solution that automatically extracts structured POI data from Internet news. The procedure includes making lexical analysis news Internet, and then identifying time expression, location and organization entities, extracting an event scenario based on POI heuristic features. With POI data extraction, consistency between event and entity, result optimization and filtering with heuristics was taken into account. Open testing with experiment conducted on 1,000 news, the precision is 93.60% and recall is 75.48%. The method within POI oriented event extraction is effective and has been applied in industrial POI collection.
AB - POI (Point of Interest) database is key resources for GIS (Geographic Information System) application. POI manual gathering is expensive and time consuming. This paper presents a state-of-the-art solution that automatically extracts structured POI data from Internet news. The procedure includes making lexical analysis news Internet, and then identifying time expression, location and organization entities, extracting an event scenario based on POI heuristic features. With POI data extraction, consistency between event and entity, result optimization and filtering with heuristics was taken into account. Open testing with experiment conducted on 1,000 news, the precision is 93.60% and recall is 75.48%. The method within POI oriented event extraction is effective and has been applied in industrial POI collection.
UR - http://www.scopus.com/inward/record.url?scp=78651431168&partnerID=8YFLogxK
U2 - 10.1109/IUCS.2010.5666648
DO - 10.1109/IUCS.2010.5666648
M3 - Conference contribution
AN - SCOPUS:78651431168
SN - 9781424478200
T3 - 2010 4th International Universal Communication Symposium, IUCS 2010 - Proceedings
SP - 116
EP - 122
BT - 2010 4th International Universal Communication Symposium, IUCS 2010 - Proceedings
T2 - 2010 4th International Universal Communication Symposium, IUCS 2010
Y2 - 18 October 2010 through 19 October 2010
ER -