Expression Syntax Information Bottleneck for Math Word Problems

Jing Xiong, Chengming Li*, Min Yang, Xiping Hu, Bin Hu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in the original text so as to enable the model to gain more comprehensive features. In this paper, we turn our attention in the opposite direction, and work on how to discard redundant features containing spurious correlations for MWP. To this end, we design an Expression Syntax Information Bottleneck method for MWP (called ESIB) based on variational information bottleneck, which extracts essential features of the expression syntax tree while filtering latent-specific redundancy containing syntax-irrelevant features. The key idea of ESIB is to encourage multiple models to predict the same expression syntax tree for different problem representations of the same problem by mutual learning so as to capture consistent information of expression syntax tree and discard latent-specific redundancy. To improve the generalization ability of the model and generate more diverse expressions, we design a self-distillation loss to encourage the model to rely more on the expression syntax information in the latent space. Experimental results on two large-scale benchmarks show that our model not only achieves state-of-the-art results but also generates more diverse solutions.

源语言英语
主期刊名SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
出版商Association for Computing Machinery, Inc
2166-2171
页数6
ISBN(电子版)9781450387323
DOI
出版状态已出版 - 6 7月 2022
已对外发布
活动45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022 - Madrid, 西班牙
期限: 11 7月 202215 7月 2022

出版系列

姓名SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

会议

会议45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022
国家/地区西班牙
Madrid
时期11/07/2215/07/22

指纹

探究 'Expression Syntax Information Bottleneck for Math Word Problems' 的科研主题。它们共同构成独一无二的指纹。

引用此