Stop filtering: Multi-view attribute-enhanced dialogue learning

Yiwei Li; Bin Sun; Shaoxiong Feng; Kan Li

doi:10.1016/j.knosys.2023.110833

Stop filtering: Multi-view attribute-enhanced dialogue learning

Yiwei Li, Bin Sun, Shaoxiong Feng, Kan Li^*

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

There is a growing interest in improving the conversational ability of models by filtering dialogue corpora. Previous filtering strategies rely on a scoring method to assess and discard samples from one perspective, enabling the model to enhance the corresponding dialogue attributes (e.g., consistency). However, the discarded samples may achieve high scores in other perspectives and can also provide regularization effects on the model learning. In this work, we propose a multi-view attribute-enhanced dialogue learning framework that can capture the attribute-related features steadily and comprehensively. Instead of filtering the raw dataset, our framework introduces adapters to learn knowledge from the attribute-related sub-sets after pre-training the model on the full dataset. Considering the variety of dialogue attributes, we further design a multi-view enhancement mechanism, including multi-view selection and inter-view fusion. It groups the high-quality samples from multiple perspectives, respectively, and enhances different attributes of responses with the corresponding sub-sets and adapters, keeping knowledge independent and allowing flexible integration. Empirical results and analysis show that our framework outperforms the state-of-the-art data filtering methods significantly in terms of enhancing dialogue attributes and fusing view-specific knowledge.

Original language	English
Article number	110833
Journal	Knowledge-Based Systems
Volume	277
DOIs	https://doi.org/10.1016/j.knosys.2023.110833
Publication status	Published - 9 Oct 2023

Keywords

Data filtering
Dialogue attributes
Open-domain dialogue generation

Access to Document

10.1016/j.knosys.2023.110833

Cite this

@article{db973ed0e52049638362f332bb90c818,

title = "Stop filtering: Multi-view attribute-enhanced dialogue learning",

abstract = "There is a growing interest in improving the conversational ability of models by filtering dialogue corpora. Previous filtering strategies rely on a scoring method to assess and discard samples from one perspective, enabling the model to enhance the corresponding dialogue attributes (e.g., consistency). However, the discarded samples may achieve high scores in other perspectives and can also provide regularization effects on the model learning. In this work, we propose a multi-view attribute-enhanced dialogue learning framework that can capture the attribute-related features steadily and comprehensively. Instead of filtering the raw dataset, our framework introduces adapters to learn knowledge from the attribute-related sub-sets after pre-training the model on the full dataset. Considering the variety of dialogue attributes, we further design a multi-view enhancement mechanism, including multi-view selection and inter-view fusion. It groups the high-quality samples from multiple perspectives, respectively, and enhances different attributes of responses with the corresponding sub-sets and adapters, keeping knowledge independent and allowing flexible integration. Empirical results and analysis show that our framework outperforms the state-of-the-art data filtering methods significantly in terms of enhancing dialogue attributes and fusing view-specific knowledge.",

keywords = "Data filtering, Dialogue attributes, Open-domain dialogue generation",

author = "Yiwei Li and Bin Sun and Shaoxiong Feng and Kan Li",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier B.V.",

year = "2023",

month = oct,

day = "9",

doi = "10.1016/j.knosys.2023.110833",

language = "English",

volume = "277",

journal = "Knowledge-Based Systems",

issn = "0950-7051",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Stop filtering

T2 - Multi-view attribute-enhanced dialogue learning

AU - Li, Yiwei

AU - Sun, Bin

AU - Feng, Shaoxiong

AU - Li, Kan

PY - 2023/10/9

Y1 - 2023/10/9

N2 - There is a growing interest in improving the conversational ability of models by filtering dialogue corpora. Previous filtering strategies rely on a scoring method to assess and discard samples from one perspective, enabling the model to enhance the corresponding dialogue attributes (e.g., consistency). However, the discarded samples may achieve high scores in other perspectives and can also provide regularization effects on the model learning. In this work, we propose a multi-view attribute-enhanced dialogue learning framework that can capture the attribute-related features steadily and comprehensively. Instead of filtering the raw dataset, our framework introduces adapters to learn knowledge from the attribute-related sub-sets after pre-training the model on the full dataset. Considering the variety of dialogue attributes, we further design a multi-view enhancement mechanism, including multi-view selection and inter-view fusion. It groups the high-quality samples from multiple perspectives, respectively, and enhances different attributes of responses with the corresponding sub-sets and adapters, keeping knowledge independent and allowing flexible integration. Empirical results and analysis show that our framework outperforms the state-of-the-art data filtering methods significantly in terms of enhancing dialogue attributes and fusing view-specific knowledge.

AB - There is a growing interest in improving the conversational ability of models by filtering dialogue corpora. Previous filtering strategies rely on a scoring method to assess and discard samples from one perspective, enabling the model to enhance the corresponding dialogue attributes (e.g., consistency). However, the discarded samples may achieve high scores in other perspectives and can also provide regularization effects on the model learning. In this work, we propose a multi-view attribute-enhanced dialogue learning framework that can capture the attribute-related features steadily and comprehensively. Instead of filtering the raw dataset, our framework introduces adapters to learn knowledge from the attribute-related sub-sets after pre-training the model on the full dataset. Considering the variety of dialogue attributes, we further design a multi-view enhancement mechanism, including multi-view selection and inter-view fusion. It groups the high-quality samples from multiple perspectives, respectively, and enhances different attributes of responses with the corresponding sub-sets and adapters, keeping knowledge independent and allowing flexible integration. Empirical results and analysis show that our framework outperforms the state-of-the-art data filtering methods significantly in terms of enhancing dialogue attributes and fusing view-specific knowledge.

KW - Data filtering

KW - Dialogue attributes

KW - Open-domain dialogue generation

UR - http://www.scopus.com/inward/record.url?scp=85166650993&partnerID=8YFLogxK

U2 - 10.1016/j.knosys.2023.110833

DO - 10.1016/j.knosys.2023.110833

M3 - Article

AN - SCOPUS:85166650993

SN - 0950-7051

VL - 277

JO - Knowledge-Based Systems

JF - Knowledge-Based Systems

M1 - 110833

ER -

Stop filtering: Multi-view attribute-enhanced dialogue learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this