TY - JOUR
T1 - Stop filtering
T2 - Multi-view attribute-enhanced dialogue learning
AU - Li, Yiwei
AU - Sun, Bin
AU - Feng, Shaoxiong
AU - Li, Kan
N1 - Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2023/10/9
Y1 - 2023/10/9
N2 - There is a growing interest in improving the conversational ability of models by filtering dialogue corpora. Previous filtering strategies rely on a scoring method to assess and discard samples from one perspective, enabling the model to enhance the corresponding dialogue attributes (e.g., consistency). However, the discarded samples may achieve high scores in other perspectives and can also provide regularization effects on the model learning. In this work, we propose a multi-view attribute-enhanced dialogue learning framework that can capture the attribute-related features steadily and comprehensively. Instead of filtering the raw dataset, our framework introduces adapters to learn knowledge from the attribute-related sub-sets after pre-training the model on the full dataset. Considering the variety of dialogue attributes, we further design a multi-view enhancement mechanism, including multi-view selection and inter-view fusion. It groups the high-quality samples from multiple perspectives, respectively, and enhances different attributes of responses with the corresponding sub-sets and adapters, keeping knowledge independent and allowing flexible integration. Empirical results and analysis show that our framework outperforms the state-of-the-art data filtering methods significantly in terms of enhancing dialogue attributes and fusing view-specific knowledge.
AB - There is a growing interest in improving the conversational ability of models by filtering dialogue corpora. Previous filtering strategies rely on a scoring method to assess and discard samples from one perspective, enabling the model to enhance the corresponding dialogue attributes (e.g., consistency). However, the discarded samples may achieve high scores in other perspectives and can also provide regularization effects on the model learning. In this work, we propose a multi-view attribute-enhanced dialogue learning framework that can capture the attribute-related features steadily and comprehensively. Instead of filtering the raw dataset, our framework introduces adapters to learn knowledge from the attribute-related sub-sets after pre-training the model on the full dataset. Considering the variety of dialogue attributes, we further design a multi-view enhancement mechanism, including multi-view selection and inter-view fusion. It groups the high-quality samples from multiple perspectives, respectively, and enhances different attributes of responses with the corresponding sub-sets and adapters, keeping knowledge independent and allowing flexible integration. Empirical results and analysis show that our framework outperforms the state-of-the-art data filtering methods significantly in terms of enhancing dialogue attributes and fusing view-specific knowledge.
KW - Data filtering
KW - Dialogue attributes
KW - Open-domain dialogue generation
UR - http://www.scopus.com/inward/record.url?scp=85166650993&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2023.110833
DO - 10.1016/j.knosys.2023.110833
M3 - Article
AN - SCOPUS:85166650993
SN - 0950-7051
VL - 277
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 110833
ER -