基于细粒度可解释矩阵的摘要生成模型

Haonan Wang; Yang Gao; Junlan Feng; Min Hu; Huixin Wang; Yu Bai

doi:10.13209/j.0479-8023.2020.082

基于细粒度可解释矩阵的摘要生成模型

Haonan Wang, Yang Gao^*, Junlan Feng, Min Hu, Huixin Wang, Yu Bai

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

According to the great challenge of summarizing and interpreting the information of a long article in the summary model. A summary model (Fine-Grained Interpretable Matrix, FGIM), which is retracted and then generated, is proposed to improve the interpretability of the long text on the significance, update and relevance, and then guide to automatically generate a summary. The model uses a pair-wise extractor to compress the content of the article, capture the sentence with a high degree of centrality, and uses the compressed text to combine with the generator to achieve the process of generating the summary. At the same time, the interpretable mask matrix can be used to control the direction of digest generation at the generation end. The encoder uses two methods based on Transformer and BERT respectively. This method is better than the best baseline model on the benchmark text summary data set (CNN/DailyMail and NYT50). The experiment further builds two test data sets to verify the update and relevance of the abstract, and the proposed model achieves corresponding improvements in the controllable generation of the data set.

投稿的翻译标题	Abstractive Summarization Based on Fine-Grained Interpretable Matrix
源语言	繁体中文
页（从-至）	23-30
页数	8
期刊	Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis
卷	57
期	1
DOI	https://doi.org/10.13209/j.0479-8023.2020.082
出版状态	已出版 - 20 1月 2021

关键词

Abstractive summarization
Centrality
Controllable
Interpretable extraction
Mask matrix

访问文件

10.13209/j.0479-8023.2020.082

其它文件与链接

链接到 Scopus 的出版物

引用此

Wang, H., Gao, Y., Feng, J., Hu, M., Wang, H., & Bai, Y. (2021). 基于细粒度可解释矩阵的摘要生成模型. Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 57(1), 23-30. https://doi.org/10.13209/j.0479-8023.2020.082

@article{159c21c45b7c44499af869866f35c1f6,

title = "基于细粒度可解释矩阵的摘要生成模型",

abstract = "According to the great challenge of summarizing and interpreting the information of a long article in the summary model. A summary model (Fine-Grained Interpretable Matrix, FGIM), which is retracted and then generated, is proposed to improve the interpretability of the long text on the significance, update and relevance, and then guide to automatically generate a summary. The model uses a pair-wise extractor to compress the content of the article, capture the sentence with a high degree of centrality, and uses the compressed text to combine with the generator to achieve the process of generating the summary. At the same time, the interpretable mask matrix can be used to control the direction of digest generation at the generation end. The encoder uses two methods based on Transformer and BERT respectively. This method is better than the best baseline model on the benchmark text summary data set (CNN/DailyMail and NYT50). The experiment further builds two test data sets to verify the update and relevance of the abstract, and the proposed model achieves corresponding improvements in the controllable generation of the data set.",

keywords = "Abstractive summarization, Centrality, Controllable, Interpretable extraction, Mask matrix",

author = "Haonan Wang and Yang Gao and Junlan Feng and Min Hu and Huixin Wang and Yu Bai",

note = "Publisher Copyright: {\textcopyright} 2021 Peking University.",

year = "2021",

month = jan,

day = "20",

doi = "10.13209/j.0479-8023.2020.082",

language = "繁体中文",

volume = "57",

pages = "23--30",

journal = "Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis",

issn = "0479-8023",

publisher = "Peking University",

number = "1",

}

TY - JOUR

T1 - 基于细粒度可解释矩阵的摘要生成模型

AU - Wang, Haonan

AU - Gao, Yang

AU - Feng, Junlan

AU - Hu, Min

AU - Wang, Huixin

AU - Bai, Yu

PY - 2021/1/20

Y1 - 2021/1/20

N2 - According to the great challenge of summarizing and interpreting the information of a long article in the summary model. A summary model (Fine-Grained Interpretable Matrix, FGIM), which is retracted and then generated, is proposed to improve the interpretability of the long text on the significance, update and relevance, and then guide to automatically generate a summary. The model uses a pair-wise extractor to compress the content of the article, capture the sentence with a high degree of centrality, and uses the compressed text to combine with the generator to achieve the process of generating the summary. At the same time, the interpretable mask matrix can be used to control the direction of digest generation at the generation end. The encoder uses two methods based on Transformer and BERT respectively. This method is better than the best baseline model on the benchmark text summary data set (CNN/DailyMail and NYT50). The experiment further builds two test data sets to verify the update and relevance of the abstract, and the proposed model achieves corresponding improvements in the controllable generation of the data set.

AB - According to the great challenge of summarizing and interpreting the information of a long article in the summary model. A summary model (Fine-Grained Interpretable Matrix, FGIM), which is retracted and then generated, is proposed to improve the interpretability of the long text on the significance, update and relevance, and then guide to automatically generate a summary. The model uses a pair-wise extractor to compress the content of the article, capture the sentence with a high degree of centrality, and uses the compressed text to combine with the generator to achieve the process of generating the summary. At the same time, the interpretable mask matrix can be used to control the direction of digest generation at the generation end. The encoder uses two methods based on Transformer and BERT respectively. This method is better than the best baseline model on the benchmark text summary data set (CNN/DailyMail and NYT50). The experiment further builds two test data sets to verify the update and relevance of the abstract, and the proposed model achieves corresponding improvements in the controllable generation of the data set.

KW - Abstractive summarization

KW - Centrality

KW - Controllable

KW - Interpretable extraction

KW - Mask matrix

UR - http://www.scopus.com/inward/record.url?scp=85101385191&partnerID=8YFLogxK

U2 - 10.13209/j.0479-8023.2020.082

DO - 10.13209/j.0479-8023.2020.082

M3 - 文章

AN - SCOPUS:85101385191

SN - 0479-8023

VL - 57

SP - 23

EP - 30

JO - Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis

JF - Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis

IS - 1

ER -

基于细粒度可解释矩阵的摘要生成模型

摘要

关键词

访问文件

其它文件与链接

指纹

引用此