Expression Syntax Information Bottleneck for Math Word Problems

Jing Xiong; Chengming Li; Min Yang; Xiping Hu; Bin Hu

doi:10.1145/3477495.3531824

Expression Syntax Information Bottleneck for Math Word Problems

Jing Xiong, Chengming Li^*, Min Yang, Xiping Hu, Bin Hu

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

5 Citations (Scopus)

Abstract

Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in the original text so as to enable the model to gain more comprehensive features. In this paper, we turn our attention in the opposite direction, and work on how to discard redundant features containing spurious correlations for MWP. To this end, we design an Expression Syntax Information Bottleneck method for MWP (called ESIB) based on variational information bottleneck, which extracts essential features of the expression syntax tree while filtering latent-specific redundancy containing syntax-irrelevant features. The key idea of ESIB is to encourage multiple models to predict the same expression syntax tree for different problem representations of the same problem by mutual learning so as to capture consistent information of expression syntax tree and discard latent-specific redundancy. To improve the generalization ability of the model and generate more diverse expressions, we design a self-distillation loss to encourage the model to rely more on the expression syntax information in the latent space. Experimental results on two large-scale benchmarks show that our model not only achieves state-of-the-art results but also generates more diverse solutions.

Original language	English
Title of host publication	SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
Publisher	Association for Computing Machinery, Inc
Pages	2166-2171
Number of pages	6
ISBN (Electronic)	9781450387323
DOIs	https://doi.org/10.1145/3477495.3531824
Publication status	Published - 6 Jul 2022
Externally published	Yes
Event	45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022 - Madrid, Spain Duration: 11 Jul 2022 → 15 Jul 2022

Publication series

Name	SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Conference

Conference	45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022
Country/Territory	Spain
City	Madrid
Period	11/07/22 → 15/07/22

Keywords

math word problems
mutual learning
spurious correlations
variational information bottleneck

Access to Document

10.1145/3477495.3531824

Cite this

Xiong, J., Li, C., Yang, M., Hu, X., & Hu, B. (2022). Expression Syntax Information Bottleneck for Math Word Problems. In SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2166-2171). (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval). Association for Computing Machinery, Inc. https://doi.org/10.1145/3477495.3531824

Xiong, Jing ; Li, Chengming ; Yang, Min et al. / Expression Syntax Information Bottleneck for Math Word Problems. SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc, 2022. pp. 2166-2171 (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval).

@inproceedings{b8772fd7014343749f3653af3f799664,

title = "Expression Syntax Information Bottleneck for Math Word Problems",

abstract = "Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in the original text so as to enable the model to gain more comprehensive features. In this paper, we turn our attention in the opposite direction, and work on how to discard redundant features containing spurious correlations for MWP. To this end, we design an Expression Syntax Information Bottleneck method for MWP (called ESIB) based on variational information bottleneck, which extracts essential features of the expression syntax tree while filtering latent-specific redundancy containing syntax-irrelevant features. The key idea of ESIB is to encourage multiple models to predict the same expression syntax tree for different problem representations of the same problem by mutual learning so as to capture consistent information of expression syntax tree and discard latent-specific redundancy. To improve the generalization ability of the model and generate more diverse expressions, we design a self-distillation loss to encourage the model to rely more on the expression syntax information in the latent space. Experimental results on two large-scale benchmarks show that our model not only achieves state-of-the-art results but also generates more diverse solutions.",

keywords = "math word problems, mutual learning, spurious correlations, variational information bottleneck",

author = "Jing Xiong and Chengming Li and Min Yang and Xiping Hu and Bin Hu",

note = "Publisher Copyright: {\textcopyright} 2022 ACM.; 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022 ; Conference date: 11-07-2022 Through 15-07-2022",

year = "2022",

month = jul,

day = "6",

doi = "10.1145/3477495.3531824",

language = "English",

series = "SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval",

publisher = "Association for Computing Machinery, Inc",

pages = "2166--2171",

booktitle = "SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval",

}

Xiong, J, Li, C, Yang, M, Hu, X & Hu, B 2022, Expression Syntax Information Bottleneck for Math Word Problems. in SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, Inc, pp. 2166-2171, 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022, Madrid, Spain, 11/07/22. https://doi.org/10.1145/3477495.3531824

Expression Syntax Information Bottleneck for Math Word Problems. / Xiong, Jing; Li, Chengming; Yang, Min et al.
SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc, 2022. p. 2166-2171 (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Expression Syntax Information Bottleneck for Math Word Problems

AU - Xiong, Jing

AU - Li, Chengming

AU - Yang, Min

AU - Hu, Xiping

AU - Hu, Bin

PY - 2022/7/6

Y1 - 2022/7/6

N2 - Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in the original text so as to enable the model to gain more comprehensive features. In this paper, we turn our attention in the opposite direction, and work on how to discard redundant features containing spurious correlations for MWP. To this end, we design an Expression Syntax Information Bottleneck method for MWP (called ESIB) based on variational information bottleneck, which extracts essential features of the expression syntax tree while filtering latent-specific redundancy containing syntax-irrelevant features. The key idea of ESIB is to encourage multiple models to predict the same expression syntax tree for different problem representations of the same problem by mutual learning so as to capture consistent information of expression syntax tree and discard latent-specific redundancy. To improve the generalization ability of the model and generate more diverse expressions, we design a self-distillation loss to encourage the model to rely more on the expression syntax information in the latent space. Experimental results on two large-scale benchmarks show that our model not only achieves state-of-the-art results but also generates more diverse solutions.

AB - Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in the original text so as to enable the model to gain more comprehensive features. In this paper, we turn our attention in the opposite direction, and work on how to discard redundant features containing spurious correlations for MWP. To this end, we design an Expression Syntax Information Bottleneck method for MWP (called ESIB) based on variational information bottleneck, which extracts essential features of the expression syntax tree while filtering latent-specific redundancy containing syntax-irrelevant features. The key idea of ESIB is to encourage multiple models to predict the same expression syntax tree for different problem representations of the same problem by mutual learning so as to capture consistent information of expression syntax tree and discard latent-specific redundancy. To improve the generalization ability of the model and generate more diverse expressions, we design a self-distillation loss to encourage the model to rely more on the expression syntax information in the latent space. Experimental results on two large-scale benchmarks show that our model not only achieves state-of-the-art results but also generates more diverse solutions.

KW - math word problems

KW - mutual learning

KW - spurious correlations

KW - variational information bottleneck

UR - http://www.scopus.com/inward/record.url?scp=85135089557&partnerID=8YFLogxK

U2 - 10.1145/3477495.3531824

DO - 10.1145/3477495.3531824

M3 - Conference contribution

AN - SCOPUS:85135089557

T3 - SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

SP - 2166

EP - 2171

BT - SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

PB - Association for Computing Machinery, Inc

T2 - 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022

Y2 - 11 July 2022 through 15 July 2022

ER -

Xiong J, Li C, Yang M, Hu X, Hu B. Expression Syntax Information Bottleneck for Math Word Problems. In SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc. 2022. p. 2166-2171. (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval). doi: 10.1145/3477495.3531824

Expression Syntax Information Bottleneck for Math Word Problems

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this