跳到主要导航 跳到搜索 跳到主要内容

MG-CTG: A Framework for Controllable Text Generation Across Multiple Granularities

  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Managing text data is crucial given the abundance of unstructured textual data in real-world applications. Text generation not only assists in managing massive amounts of text through tasks such as summarization and report generation but also has the capability to generate the needed content to enrich the textual database. However, the generated text is often open-ended and may not meet specific target requirements that fall into three categories: semantic, structural, and lexical. Fine-tuning pre-trained language models can meet each specific control requirement, but there is no simultaneous integration of controls from all three categories. On the other hand, post-processing methods are limited to semantic control or lexical control only. In this paper, we propose MG-CTG, a Muti-Granularity Controllable Text Generation framework to generated text satisfying controls across multiple granularities. Specifically, we design distinct controllers that employ different strategies based on post-processing methods to achieve control. Further, our proposed framework is able to attain fine-grained control at the structural granularity, as well as enhance the incorporation of keywords into the generated text via a designed keyword-guided weighted decoding method. We conduct experiments by combining control information from different granularities and evaluate the results on standard benchmark dataset for controllable text generation. The experimental results demonstrate that our method outperforms other post-processing methods on two real-world datasets.

源语言英语
主期刊名Database Systems for Advanced Applications - 29th International Conference, DASFAA 2024, Proceedings
编辑Makoto Onizuka, Chuan Xiao, Jae-Gil Lee, Yongxin Tong, Yoshiharu Ishikawa, Kejing Lu, Sihem Amer-Yahia, H.V. Jagadish
出版商Springer Science and Business Media Deutschland GmbH
138-154
页数17
ISBN(印刷版)9789819755684
DOI
出版状态已出版 - 2024
活动29th International Conference on Database Systems for Advanced Applications, DASFAA 2024 - Gifu, 日本
期限: 2 7月 20245 7月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14854 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议29th International Conference on Database Systems for Advanced Applications, DASFAA 2024
国家/地区日本
Gifu
时期2/07/245/07/24

指纹

探究 'MG-CTG: A Framework for Controllable Text Generation Across Multiple Granularities' 的科研主题。它们共同构成独一无二的指纹。

引用此