跳到主要导航 跳到搜索 跳到主要内容

TabAgent: A Multi-Agent Table Extraction Framework for Unstructured Documents

  • Jingfei Wu
  • , Junyi Han
  • , Yujin Gao*
  • *此作品的通讯作者
  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

With the increasing amount of unstructured documents in various domains, extracting structured data from such sources has become critical for efficient data management and advanced analytics. However, existing methods for structured table extraction face challenges: (1) the complexity and implicitness of semantic context and patterns in unstructured documents hinder accurate information extraction, and (2) limited adaptability to evolving user intents and requirements for strict schema alignment and structural constraints. To address these limitations, we propose TabAgent, a novel multi-agent collaborative framework for structured table extraction from unstructured documents. TabAgent integrates four specialized agents, Schema Agent, Extraction Agent, Semantic Agent, and Validation Agent with a shared memory repository to iteratively refine extraction results. By leveraging collaborative reasoning and iterative self-correcting loops, TabAgent enables accurate, adaptive, and robust table extraction across diverse document domains and user instructions. Extensive experiments on four datasets demonstrate that TabAgent consistently outperforms several baselines, including pure LLM extractors and LLM-based systems, highlighting the effectiveness of this collaborative framework. Our work represents one of the first multi-agent frameworks for structured table extraction, offering an applicable solution for real-world applications.

源语言英语
主期刊名Proceedings of 2025 5th International Symposium on Artificial Intelligence and Big Data, AIBDF 2025
出版商Institute of Electrical and Electronics Engineers Inc.
600-607
页数8
ISBN(电子版)9798331569921
DOI
出版状态已出版 - 2025
已对外发布
活动2025 5th International Symposium on Artificial Intelligence and Big Data, AIBDF 2025 - Guiyang, 中国
期限: 26 12月 202528 12月 2025

出版系列

姓名Proceedings of 2025 5th International Symposium on Artificial Intelligence and Big Data, AIBDF 2025

会议

会议2025 5th International Symposium on Artificial Intelligence and Big Data, AIBDF 2025
国家/地区中国
Guiyang
时期26/12/2528/12/25

指纹

探究 'TabAgent: A Multi-Agent Table Extraction Framework for Unstructured Documents' 的科研主题。它们共同构成独一无二的指纹。

引用此