跳到主要导航 跳到搜索 跳到主要内容

Learning Interpretable Rules for Scalable Data Representation and Classification

  • Zhuo Wang
  • , Wei Zhang*
  • , Ning Liu
  • , Jianyong Wang*
  • *此作品的通讯作者
  • Tsinghua University
  • Ant Group
  • East China Normal University
  • Shandong University

科研成果: 期刊稿件文章同行评审

摘要

Rule-based models, e.g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity. However, rule-based models are hard to optimize, especially on large data sets, due to their discrete parameters and structures. Ensemble methods and fuzzy/soft rules are commonly used to improve performance, but they sacrifice the model interpretability. To obtain both good scalability and interpretability, we propose a new classifier, named Rule-based Representation Learner (RRL), that automatically learns interpretable non-fuzzy rules for data representation and classification. To train the non-differentiable RRL effectively, we project it to a continuous space and propose a novel training method, called Gradient Grafting, that can directly optimize the discrete model using gradient descent. A novel design of logical activation functions is also devised to increase the scalability of RRL and enable it to discretize the continuous features end-to-end. Exhaustive experiments on ten small and four large data sets show that RRL outperforms the competitive interpretable approaches and can be easily adjusted to obtain a trade-off between classification accuracy and model complexity for different scenarios.

源语言英语
页(从-至)1121-1133
页数13
期刊IEEE Transactions on Pattern Analysis and Machine Intelligence
46
2
DOI
出版状态已出版 - 1 2月 2024
已对外发布

指纹

探究 'Learning Interpretable Rules for Scalable Data Representation and Classification' 的科研主题。它们共同构成独一无二的指纹。

引用此