TY - GEN
T1 - A Topic-Aware Data Generation Framework for Math Word Problems
AU - Zhao, Tianyu
AU - Chai, Chengliang
AU - Liu, Jiabin
AU - Li, Guoliang
AU - Feng, Jianhua
AU - Liu, Zitao
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - In educational practice, math word problems (MWPs) are important material to train pupils’ to solve real-world math problems. In this paper, our focus is on the learning-based topic-aware MWP generation problem. To summarize, there are several challenges with respect to the problem. The first one is how to design the generation model considering the structural characteristics of math equations as well as the topic information. Second, how to generate multiple MWPs covering diverse topics is another challenge. To address the above challenges, we propose a novel framework, GenMWP, to automatically generate high-quality MWPs. First, we use the Tree-LSTM model to capture the structure of the equation. Second, we design a topic embedding model to consider the topic information, allowing the user to input any words or sentences to describe the topic. Third, given only an equation, we also support generating multiple MWPs covering different topics. The automatic evaluation result shows that GenMWP outperforms other baseline methods, and the human evaluation result shows that the MWPs generated by GenMWP have a higher rating in fluency, consistency, equation relevance, and topic relevance.
AB - In educational practice, math word problems (MWPs) are important material to train pupils’ to solve real-world math problems. In this paper, our focus is on the learning-based topic-aware MWP generation problem. To summarize, there are several challenges with respect to the problem. The first one is how to design the generation model considering the structural characteristics of math equations as well as the topic information. Second, how to generate multiple MWPs covering diverse topics is another challenge. To address the above challenges, we propose a novel framework, GenMWP, to automatically generate high-quality MWPs. First, we use the Tree-LSTM model to capture the structure of the equation. Second, we design a topic embedding model to consider the topic information, allowing the user to input any words or sentences to describe the topic. Third, given only an equation, we also support generating multiple MWPs covering different topics. The automatic evaluation result shows that GenMWP outperforms other baseline methods, and the human evaluation result shows that the MWPs generated by GenMWP have a higher rating in fluency, consistency, equation relevance, and topic relevance.
KW - Math Word Problem
KW - Natural Language Generation
KW - Topic model
UR - http://www.scopus.com/inward/record.url?scp=85161629747&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-30678-5_22
DO - 10.1007/978-3-031-30678-5_22
M3 - Conference contribution
AN - SCOPUS:85161629747
SN - 9783031306778
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 286
EP - 302
BT - Database Systems for Advanced Applications - 28th International Conference, DASFAA 2023, Proceedings
A2 - Wang, Xin
A2 - Sapino, Maria Luisa
A2 - Han, Wook-Shin
A2 - El Abbadi, Amr
A2 - Dobbie, Gill
A2 - Feng, Zhiyong
A2 - Shao, Yingxiao
A2 - Yin, Hongzhi
PB - Springer Science and Business Media Deutschland GmbH
T2 - 28th International Conference on Database Systems for Advanced Applications, DASFAA 2023
Y2 - 17 April 2023 through 20 April 2023
ER -