A Topic-Aware Data Generation Framework for Math Word Problems

Tianyu Zhao, Chengliang Chai, Jiabin Liu, Guoliang Li*, Jianhua Feng, Zitao Liu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In educational practice, math word problems (MWPs) are important material to train pupils’ to solve real-world math problems. In this paper, our focus is on the learning-based topic-aware MWP generation problem. To summarize, there are several challenges with respect to the problem. The first one is how to design the generation model considering the structural characteristics of math equations as well as the topic information. Second, how to generate multiple MWPs covering diverse topics is another challenge. To address the above challenges, we propose a novel framework, GenMWP, to automatically generate high-quality MWPs. First, we use the Tree-LSTM model to capture the structure of the equation. Second, we design a topic embedding model to consider the topic information, allowing the user to input any words or sentences to describe the topic. Third, given only an equation, we also support generating multiple MWPs covering different topics. The automatic evaluation result shows that GenMWP outperforms other baseline methods, and the human evaluation result shows that the MWPs generated by GenMWP have a higher rating in fluency, consistency, equation relevance, and topic relevance.

Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications - 28th International Conference, DASFAA 2023, Proceedings
EditorsXin Wang, Maria Luisa Sapino, Wook-Shin Han, Amr El Abbadi, Gill Dobbie, Zhiyong Feng, Yingxiao Shao, Hongzhi Yin
PublisherSpringer Science and Business Media Deutschland GmbH
Pages286-302
Number of pages17
ISBN (Print)9783031306778
DOIs
Publication statusPublished - 2023
Event28th International Conference on Database Systems for Advanced Applications, DASFAA 2023 - Tianjin, China
Duration: 17 Apr 202320 Apr 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13946 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference28th International Conference on Database Systems for Advanced Applications, DASFAA 2023
Country/TerritoryChina
CityTianjin
Period17/04/2320/04/23

Keywords

  • Math Word Problem
  • Natural Language Generation
  • Topic model

Fingerprint

Dive into the research topics of 'A Topic-Aware Data Generation Framework for Math Word Problems'. Together they form a unique fingerprint.

Cite this