跳到主要导航 跳到搜索 跳到主要内容

Two-Round Voting: Improving Self-Consistency by Recycling Low-Vote Reasoning Evidence

  • Lingxiang Wei
  • , Peiwen Yuan
  • , Shaojie Qu
  • , Kan Li*
  • *此作品的通讯作者
  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Self-Consistency (SC) improves test-time reasoning by sampling multiple reasoning chains and voting on their final answers; however, it typically discards low-vote chains. On mathematical reasoning tasks, low-vote chains can still contain useful partial evidence and intermediate results while failing only near the final step, especially for uncertain problems where solution paths diverge. We propose Two-Round Voting for Self-Consistency, a training-free test-time framework that recycles low-vote chains. After a standard SC pass, we extract high-confidence prefixes from low-vote chains and use them as additional evidence to re-score candidates, followed by a second-round vote. The second-round signal is fused with first-round votes via a tunable weight, and a flip-possible gate triggers re-voting only when it can change the predicted answer to limit overhead. Across the AIME (1983-2003) and MATH-500 benchmarks, Two-Round Voting yields consistent gains over Self-Consistency for Qwen3 models at multiple scales, improving the accuracy-compute trade-off with matched FLOPs.

源语言英语
主期刊名2026 International Conference on Communication Networks and Machine Learning, CNML 2026
出版商Institute of Electrical and Electronics Engineers Inc.
1128-1133
页数6
ISBN(电子版)9798331590475
DOI
出版状态已出版 - 2026
活动4th International Conference on Communication Networks and Machine Learning, CNML 2026 - Chongqing, 中国
期限: 30 1月 20261 2月 2026

出版系列

姓名2026 International Conference on Communication Networks and Machine Learning, CNML 2026

会议

会议4th International Conference on Communication Networks and Machine Learning, CNML 2026
国家/地区中国
Chongqing
时期30/01/261/02/26

指纹

探究 'Two-Round Voting: Improving Self-Consistency by Recycling Low-Vote Reasoning Evidence' 的科研主题。它们共同构成独一无二的指纹。

引用此