Pay "attention" to Chart Images for What You Read on Text

Chenyu Yang; Ruixue Fan; Nan Tang; Meihui Zhang; Xiaoman Zhao; Ju Fan; Xiaoyong Du

doi:10.1145/3555041.3589714

Pay "attention" to Chart Images for What You Read on Text

Chenyu Yang, Ruixue Fan, Nan Tang, Meihui Zhang, Xiaoman Zhao^*, Ju Fan, Xiaoyong Du

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Data visualization is changing how we understand data, by showing why's, how's, and what's behind important patterns/trends in almost every corner of the world, such as in academic papers, news articles, financial reports, etc. However, along with the increasing complexity and richness of data visualizations, given a text description (e.g., "fewer teens say they attended school completely online (8%)"), it becomes harder for users to pinpoint where to pay attention to on a chart (e.g., a grouped bar chart). In this demonstration paper, we present a system HiChart for text-chart image highlighting: when a user selects a span of text, HiChart automatically analyzes the chart image (e.g., a jpeg or a png file) and highlights the parts that are relevant to the span. From a technical perspective, HiChart devises the following techniques. Reverse-engineering visualizations: given a chart image, HiChart uses computer vision techniques to generate a visualization specification using Vega-Lite language, as well as the underlying dataset; Visualization calibration by data tuning: HiChart calibrates the re-generated chart by tuning the recovered dataset through value perturbation; and Chart highlighting for a span: HiChart maps the span to corresponding data cells and uses the built-in highlighting functions of Vega-Lite to highlight the chart.

Original language	English
Title of host publication	SIGMOD 2023 - Companion of the 2023 ACM/SIGMOD International Conference on Management of Data
Publisher	Association for Computing Machinery
Pages	111-114
Number of pages	4
ISBN (Electronic)	9781450395076
DOIs	https://doi.org/10.1145/3555041.3589714
Publication status	Published - 4 Jun 2023
Event	2023 ACM/SIGMOD International Conference on Management of Data, SIGMOD 2023 - Seattle, United States Duration: 18 Jun 2023 → 23 Jun 2023

Publication series

Name	Proceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)	0730-8078

Conference

Conference	2023 ACM/SIGMOD International Conference on Management of Data, SIGMOD 2023
Country/Territory	United States
City	Seattle
Period	18/06/23 → 23/06/23

Keywords

chart highlighting
data extraction
data visualization

Access to Document

10.1145/3555041.3589714

Cite this

Yang, C., Fan, R., Tang, N., Zhang, M., Zhao, X., Fan, J., & Du, X. (2023). Pay "attention" to Chart Images for What You Read on Text. In SIGMOD 2023 - Companion of the 2023 ACM/SIGMOD International Conference on Management of Data (pp. 111-114). (Proceedings of the ACM SIGMOD International Conference on Management of Data). Association for Computing Machinery. https://doi.org/10.1145/3555041.3589714

@inproceedings{c7de912cbb5645d4bc922293ce3c3cd9,

title = "Pay {"}attention{"} to Chart Images for What You Read on Text",

abstract = "Data visualization is changing how we understand data, by showing why's, how's, and what's behind important patterns/trends in almost every corner of the world, such as in academic papers, news articles, financial reports, etc. However, along with the increasing complexity and richness of data visualizations, given a text description (e.g., {"}fewer teens say they attended school completely online (8%){"}), it becomes harder for users to pinpoint where to pay attention to on a chart (e.g., a grouped bar chart). In this demonstration paper, we present a system HiChart for text-chart image highlighting: when a user selects a span of text, HiChart automatically analyzes the chart image (e.g., a jpeg or a png file) and highlights the parts that are relevant to the span. From a technical perspective, HiChart devises the following techniques. Reverse-engineering visualizations: given a chart image, HiChart uses computer vision techniques to generate a visualization specification using Vega-Lite language, as well as the underlying dataset; Visualization calibration by data tuning: HiChart calibrates the re-generated chart by tuning the recovered dataset through value perturbation; and Chart highlighting for a span: HiChart maps the span to corresponding data cells and uses the built-in highlighting functions of Vega-Lite to highlight the chart.",

keywords = "chart highlighting, data extraction, data visualization",

author = "Chenyu Yang and Ruixue Fan and Nan Tang and Meihui Zhang and Xiaoman Zhao and Ju Fan and Xiaoyong Du",

note = "Publisher Copyright: {\textcopyright} 2023 ACM.; 2023 ACM/SIGMOD International Conference on Management of Data, SIGMOD 2023 ; Conference date: 18-06-2023 Through 23-06-2023",

year = "2023",

month = jun,

day = "4",

doi = "10.1145/3555041.3589714",

language = "English",

series = "Proceedings of the ACM SIGMOD International Conference on Management of Data",

publisher = "Association for Computing Machinery",

pages = "111--114",

booktitle = "SIGMOD 2023 - Companion of the 2023 ACM/SIGMOD International Conference on Management of Data",

}

Yang, C, Fan, R, Tang, N, Zhang, M, Zhao, X, Fan, J & Du, X 2023, Pay "attention" to Chart Images for What You Read on Text. in SIGMOD 2023 - Companion of the 2023 ACM/SIGMOD International Conference on Management of Data. Proceedings of the ACM SIGMOD International Conference on Management of Data, Association for Computing Machinery, pp. 111-114, 2023 ACM/SIGMOD International Conference on Management of Data, SIGMOD 2023, Seattle, United States, 18/06/23. https://doi.org/10.1145/3555041.3589714

Pay "attention" to Chart Images for What You Read on Text. / Yang, Chenyu; Fan, Ruixue; Tang, Nan et al.
SIGMOD 2023 - Companion of the 2023 ACM/SIGMOD International Conference on Management of Data. Association for Computing Machinery, 2023. p. 111-114 (Proceedings of the ACM SIGMOD International Conference on Management of Data).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Pay "attention" to Chart Images for What You Read on Text

AU - Yang, Chenyu

AU - Fan, Ruixue

AU - Tang, Nan

AU - Zhang, Meihui

AU - Zhao, Xiaoman

AU - Fan, Ju

AU - Du, Xiaoyong

PY - 2023/6/4

Y1 - 2023/6/4

N2 - Data visualization is changing how we understand data, by showing why's, how's, and what's behind important patterns/trends in almost every corner of the world, such as in academic papers, news articles, financial reports, etc. However, along with the increasing complexity and richness of data visualizations, given a text description (e.g., "fewer teens say they attended school completely online (8%)"), it becomes harder for users to pinpoint where to pay attention to on a chart (e.g., a grouped bar chart). In this demonstration paper, we present a system HiChart for text-chart image highlighting: when a user selects a span of text, HiChart automatically analyzes the chart image (e.g., a jpeg or a png file) and highlights the parts that are relevant to the span. From a technical perspective, HiChart devises the following techniques. Reverse-engineering visualizations: given a chart image, HiChart uses computer vision techniques to generate a visualization specification using Vega-Lite language, as well as the underlying dataset; Visualization calibration by data tuning: HiChart calibrates the re-generated chart by tuning the recovered dataset through value perturbation; and Chart highlighting for a span: HiChart maps the span to corresponding data cells and uses the built-in highlighting functions of Vega-Lite to highlight the chart.

AB - Data visualization is changing how we understand data, by showing why's, how's, and what's behind important patterns/trends in almost every corner of the world, such as in academic papers, news articles, financial reports, etc. However, along with the increasing complexity and richness of data visualizations, given a text description (e.g., "fewer teens say they attended school completely online (8%)"), it becomes harder for users to pinpoint where to pay attention to on a chart (e.g., a grouped bar chart). In this demonstration paper, we present a system HiChart for text-chart image highlighting: when a user selects a span of text, HiChart automatically analyzes the chart image (e.g., a jpeg or a png file) and highlights the parts that are relevant to the span. From a technical perspective, HiChart devises the following techniques. Reverse-engineering visualizations: given a chart image, HiChart uses computer vision techniques to generate a visualization specification using Vega-Lite language, as well as the underlying dataset; Visualization calibration by data tuning: HiChart calibrates the re-generated chart by tuning the recovered dataset through value perturbation; and Chart highlighting for a span: HiChart maps the span to corresponding data cells and uses the built-in highlighting functions of Vega-Lite to highlight the chart.

KW - chart highlighting

KW - data extraction

KW - data visualization

UR - http://www.scopus.com/inward/record.url?scp=85162926321&partnerID=8YFLogxK

U2 - 10.1145/3555041.3589714

DO - 10.1145/3555041.3589714

M3 - Conference contribution

AN - SCOPUS:85162926321

T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data

SP - 111

EP - 114

BT - SIGMOD 2023 - Companion of the 2023 ACM/SIGMOD International Conference on Management of Data

PB - Association for Computing Machinery

T2 - 2023 ACM/SIGMOD International Conference on Management of Data, SIGMOD 2023

Y2 - 18 June 2023 through 23 June 2023

ER -

Yang C, Fan R, Tang N, Zhang M, Zhao X, Fan J et al. Pay "attention" to Chart Images for What You Read on Text. In SIGMOD 2023 - Companion of the 2023 ACM/SIGMOD International Conference on Management of Data. Association for Computing Machinery. 2023. p. 111-114. (Proceedings of the ACM SIGMOD International Conference on Management of Data). doi: 10.1145/3555041.3589714

Pay "attention" to Chart Images for What You Read on Text

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this