TY - GEN
T1 - Pay "attention" to Chart Images for What You Read on Text
AU - Yang, Chenyu
AU - Fan, Ruixue
AU - Tang, Nan
AU - Zhang, Meihui
AU - Zhao, Xiaoman
AU - Fan, Ju
AU - Du, Xiaoyong
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/6/4
Y1 - 2023/6/4
N2 - Data visualization is changing how we understand data, by showing why's, how's, and what's behind important patterns/trends in almost every corner of the world, such as in academic papers, news articles, financial reports, etc. However, along with the increasing complexity and richness of data visualizations, given a text description (e.g., "fewer teens say they attended school completely online (8%)"), it becomes harder for users to pinpoint where to pay attention to on a chart (e.g., a grouped bar chart). In this demonstration paper, we present a system HiChart for text-chart image highlighting: when a user selects a span of text, HiChart automatically analyzes the chart image (e.g., a jpeg or a png file) and highlights the parts that are relevant to the span. From a technical perspective, HiChart devises the following techniques. Reverse-engineering visualizations: given a chart image, HiChart uses computer vision techniques to generate a visualization specification using Vega-Lite language, as well as the underlying dataset; Visualization calibration by data tuning: HiChart calibrates the re-generated chart by tuning the recovered dataset through value perturbation; and Chart highlighting for a span: HiChart maps the span to corresponding data cells and uses the built-in highlighting functions of Vega-Lite to highlight the chart.
AB - Data visualization is changing how we understand data, by showing why's, how's, and what's behind important patterns/trends in almost every corner of the world, such as in academic papers, news articles, financial reports, etc. However, along with the increasing complexity and richness of data visualizations, given a text description (e.g., "fewer teens say they attended school completely online (8%)"), it becomes harder for users to pinpoint where to pay attention to on a chart (e.g., a grouped bar chart). In this demonstration paper, we present a system HiChart for text-chart image highlighting: when a user selects a span of text, HiChart automatically analyzes the chart image (e.g., a jpeg or a png file) and highlights the parts that are relevant to the span. From a technical perspective, HiChart devises the following techniques. Reverse-engineering visualizations: given a chart image, HiChart uses computer vision techniques to generate a visualization specification using Vega-Lite language, as well as the underlying dataset; Visualization calibration by data tuning: HiChart calibrates the re-generated chart by tuning the recovered dataset through value perturbation; and Chart highlighting for a span: HiChart maps the span to corresponding data cells and uses the built-in highlighting functions of Vega-Lite to highlight the chart.
KW - chart highlighting
KW - data extraction
KW - data visualization
UR - http://www.scopus.com/inward/record.url?scp=85162926321&partnerID=8YFLogxK
U2 - 10.1145/3555041.3589714
DO - 10.1145/3555041.3589714
M3 - Conference contribution
AN - SCOPUS:85162926321
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 111
EP - 114
BT - SIGMOD 2023 - Companion of the 2023 ACM/SIGMOD International Conference on Management of Data
PB - Association for Computing Machinery
T2 - 2023 ACM/SIGMOD International Conference on Management of Data, SIGMOD 2023
Y2 - 18 June 2023 through 23 June 2023
ER -