跳到主要导航 跳到搜索 跳到主要内容

Enhancing math reasoning ability of large language models via computation logic graphs

  • Deji Zhao
  • , Donghong Han*
  • , Jia Wu
  • , Zhongjiang He
  • , Bo Ning
  • , Ye Yuan
  • , Yongxiang Li
  • , Chao Wang
  • , Shuangyong Song
  • *此作品的通讯作者
  • Northeastern University China
  • Macquarie University
  • China Telecommunications
  • Dalian Maritime University
  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

The reasoning capabilities of large language models (LLMs) are essential for a wide range of tasks, particularly in the domain of mathematical reasoning. Common chain of thought methods perform well in handling simple reasoning problems, but for complex problems, a single-dimensional chain of thought is inadequate to address multi-layered logical relationships. To tackle this challenge, this paper introduces the concept of a Computation Logic Graph (CLG), designed to enhance the logical reasoning abilities of LLMs when solving complex mathematical problems. The CLG decomposes complex mathematical problems into multiple simple intermediate computational units, and the final answer is obtained through multiple iterations of these units. On the one hand, the CLG improves the model's ability to decompose and solve complex mathematical problems step-by-step from a global perspective. On the other hand, the local inference process within the CLG helps enhance the model's accuracy in single step calculations. To develop models with the ability to construct Computation Logic Graphs automatically, we create a dataset of computational logic graphs for complex mathematical problems, called the Computation-intensive Math Logic Graph (CMLG) dataset. We fine-tune several open-source LLMs using the CMLG dataset. Experimental results demonstrate that the proposed CLG method significantly enhances the performance of LLMs in complex mathematical reasoning tasks, outperforming on both the CMLG dataset and six other publicly available datasets from diverse domains.

源语言英语
文章编号113905
期刊Knowledge-Based Systems
325
DOI
出版状态已出版 - 5 9月 2025
已对外发布

指纹

探究 'Enhancing math reasoning ability of large language models via computation logic graphs' 的科研主题。它们共同构成独一无二的指纹。

引用此