Natural Language to Visualization by Neural Machine Translation

Yuyu Luo; Nan Tang; Guoliang Li; Jiawei Tang; Chengliang Chai; Xuedi Qin

doi:10.1109/TVCG.2021.3114848

Natural Language to Visualization by Neural Machine Translation

Yuyu Luo, Nan Tang, Guoliang Li^*, Jiawei Tang, Chengliang Chai, Xuedi Qin

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

80 引用（Scopus）

摘要

Supporting the translation from natural language (NL) query to visualization (NL2VIS) can simplify the creation of data visualizations because if successful, anyone can generate visualizations by their natural language from the tabular data. The state-of-the-art NL2VIS approaches (e.g., NL4DV and FlowSense) are based on semantic parsers and heuristic algorithms, which are not end-to-end and are not designed for supporting (possibly) complex data transformations. Deep neural network powered neural machine translation models have made great strides in many machine translation tasks, which suggests that they might be viable for NL2VIS as well. In this paper, we present ncNet, a Transformer-based sequence-to-sequence model for supporting NL2VIS, with several novel visualization-aware optimizations, including using attention-forcing to optimize the learning process, and visualization-aware rendering to produce better visualization results. To enhance the capability of machine to comprehend natural language queries, ncNet is also designed to take an optional chart template (e.g., a pie chart or a scatter plot) as an additional input, where the chart template will be served as a constraint to limit what could be visualized. We conducted both quantitative evaluation and user study, showing that ncNet achieves good accuracy in the nvBench benchmark and is easy-to-use.

源语言	英语
页（从-至）	217-226
页数	10
期刊	IEEE Transactions on Visualization and Computer Graphics
卷	28
期	1
DOI	https://doi.org/10.1109/TVCG.2021.3114848
出版状态	已出版 - 1 1月 2022
已对外发布	是

访问文件

10.1109/TVCG.2021.3114848

其它文件与链接

链接到 Scopus 的出版物

引用此

Luo, Y., Tang, N., Li, G., Tang, J., Chai, C., & Qin, X. (2022). Natural Language to Visualization by Neural Machine Translation. IEEE Transactions on Visualization and Computer Graphics, 28(1), 217-226. https://doi.org/10.1109/TVCG.2021.3114848

@article{f5a99a4f1b7c4357abace46f80ec43ce,

title = "Natural Language to Visualization by Neural Machine Translation",

abstract = "Supporting the translation from natural language (NL) query to visualization (NL2VIS) can simplify the creation of data visualizations because if successful, anyone can generate visualizations by their natural language from the tabular data. The state-of-the-art NL2VIS approaches (e.g., NL4DV and FlowSense) are based on semantic parsers and heuristic algorithms, which are not end-to-end and are not designed for supporting (possibly) complex data transformations. Deep neural network powered neural machine translation models have made great strides in many machine translation tasks, which suggests that they might be viable for NL2VIS as well. In this paper, we present ncNet, a Transformer-based sequence-to-sequence model for supporting NL2VIS, with several novel visualization-aware optimizations, including using attention-forcing to optimize the learning process, and visualization-aware rendering to produce better visualization results. To enhance the capability of machine to comprehend natural language queries, ncNet is also designed to take an optional chart template (e.g., a pie chart or a scatter plot) as an additional input, where the chart template will be served as a constraint to limit what could be visualized. We conducted both quantitative evaluation and user study, showing that ncNet achieves good accuracy in the nvBench benchmark and is easy-to-use.",

keywords = "Natural language interface, chart template, data visualization, neural machine translation",

author = "Yuyu Luo and Nan Tang and Guoliang Li and Jiawei Tang and Chengliang Chai and Xuedi Qin",

note = "Publisher Copyright: {\textcopyright} 1995-2012 IEEE.",

year = "2022",

month = jan,

day = "1",

doi = "10.1109/TVCG.2021.3114848",

language = "English",

volume = "28",

pages = "217--226",

journal = "IEEE Transactions on Visualization and Computer Graphics",

issn = "1077-2626",

publisher = "IEEE Computer Society",

number = "1",

}

TY - JOUR

T1 - Natural Language to Visualization by Neural Machine Translation

AU - Luo, Yuyu

AU - Tang, Nan

AU - Li, Guoliang

AU - Tang, Jiawei

AU - Chai, Chengliang

AU - Qin, Xuedi

PY - 2022/1/1

Y1 - 2022/1/1

N2 - Supporting the translation from natural language (NL) query to visualization (NL2VIS) can simplify the creation of data visualizations because if successful, anyone can generate visualizations by their natural language from the tabular data. The state-of-the-art NL2VIS approaches (e.g., NL4DV and FlowSense) are based on semantic parsers and heuristic algorithms, which are not end-to-end and are not designed for supporting (possibly) complex data transformations. Deep neural network powered neural machine translation models have made great strides in many machine translation tasks, which suggests that they might be viable for NL2VIS as well. In this paper, we present ncNet, a Transformer-based sequence-to-sequence model for supporting NL2VIS, with several novel visualization-aware optimizations, including using attention-forcing to optimize the learning process, and visualization-aware rendering to produce better visualization results. To enhance the capability of machine to comprehend natural language queries, ncNet is also designed to take an optional chart template (e.g., a pie chart or a scatter plot) as an additional input, where the chart template will be served as a constraint to limit what could be visualized. We conducted both quantitative evaluation and user study, showing that ncNet achieves good accuracy in the nvBench benchmark and is easy-to-use.

AB - Supporting the translation from natural language (NL) query to visualization (NL2VIS) can simplify the creation of data visualizations because if successful, anyone can generate visualizations by their natural language from the tabular data. The state-of-the-art NL2VIS approaches (e.g., NL4DV and FlowSense) are based on semantic parsers and heuristic algorithms, which are not end-to-end and are not designed for supporting (possibly) complex data transformations. Deep neural network powered neural machine translation models have made great strides in many machine translation tasks, which suggests that they might be viable for NL2VIS as well. In this paper, we present ncNet, a Transformer-based sequence-to-sequence model for supporting NL2VIS, with several novel visualization-aware optimizations, including using attention-forcing to optimize the learning process, and visualization-aware rendering to produce better visualization results. To enhance the capability of machine to comprehend natural language queries, ncNet is also designed to take an optional chart template (e.g., a pie chart or a scatter plot) as an additional input, where the chart template will be served as a constraint to limit what could be visualized. We conducted both quantitative evaluation and user study, showing that ncNet achieves good accuracy in the nvBench benchmark and is easy-to-use.

KW - Natural language interface

KW - chart template

KW - data visualization

KW - neural machine translation

UR - http://www.scopus.com/inward/record.url?scp=85122122541&partnerID=8YFLogxK

U2 - 10.1109/TVCG.2021.3114848

DO - 10.1109/TVCG.2021.3114848

M3 - Article

C2 - 34784276

AN - SCOPUS:85122122541

SN - 1077-2626

VL - 28

SP - 217

EP - 226

JO - IEEE Transactions on Visualization and Computer Graphics

JF - IEEE Transactions on Visualization and Computer Graphics

IS - 1

ER -

Natural Language to Visualization by Neural Machine Translation

摘要

访问文件

其它文件与链接

指纹

引用此