ARM-Net: Adaptive Relation Modeling Network for Structured Data

Shaofeng Cai; Kaiping Zheng; Gang Chen; H. V. Jagadish; Beng Chin Ooi; Meihui Zhang

doi:10.1145/3448016.3457321

ARM-Net: Adaptive Relation Modeling Network for Structured Data

Shaofeng Cai, Kaiping Zheng, Gang Chen, H. V. Jagadish, Beng Chin Ooi, Meihui Zhang

National University of Singapore

科研成果: 期刊稿件 › 会议文章 › 同行评审

29 引用（Scopus）

摘要

Relational databases are the de facto standard for storing and querying structured data, and extracting insights from structured data requires advanced analytics. Deep neural networks (DNNs) have achieved super-human prediction performance in particular data types, e.g., images. However, existing DNNs may not produce meaningful results when applied to structured data. The reason is that there are correlations and dependencies across combinations of attribute values in a table, and these do not follow simple additive patterns that can be easily mimicked by a DNN. The number of possible such cross features is combinatorial, making them computationally prohibitive to model. Furthermore, the deployment of learning models in real-world applications has also highlighted the need for interpretability, especially for high-stakes applications, which remains another issue of concern to DNNs. In this paper, we present ARM-Net, an adaptive relation modeling network tailored for structured data, and a lightweight framework ARMOR based on ARM-Net for relational data analytics. The key idea is to model feature interactions with cross features selectively and dynamically, by first transforming the input features into exponential space, and then determining the interaction order and interaction weights adaptively for each cross feature. We propose a novel sparse attention mechanism to dynamically generate the interaction weights given the input tuple, so that we can explicitly model cross features of arbitrary orders with noisy features filtered selectively. Then during model inference, ARM-Net can specify the cross features being used for each prediction for higher accuracy and better interpretability. Our extensive experiments on real-world datasets demonstrate that ARM-Net consistently outperforms existing models and provides more interpretable predictions for data-driven decision making.

源语言	英语
页（从-至）	207-220
页数	14
期刊	Proceedings of the ACM SIGMOD International Conference on Management of Data
DOI	https://doi.org/10.1145/3448016.3457321
出版状态	已出版 - 2021
已对外发布	是
活动	2021 International Conference on Management of Data, SIGMOD 2021 - Virtual, Online, 中国期限: 20 6月 2021 → 25 6月 2021

访问文件

10.1145/3448016.3457321

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{e9326124cf2644bca27c6f310ee7171c,

title = "ARM-Net: Adaptive Relation Modeling Network for Structured Data",

abstract = "Relational databases are the de facto standard for storing and querying structured data, and extracting insights from structured data requires advanced analytics. Deep neural networks (DNNs) have achieved super-human prediction performance in particular data types, e.g., images. However, existing DNNs may not produce meaningful results when applied to structured data. The reason is that there are correlations and dependencies across combinations of attribute values in a table, and these do not follow simple additive patterns that can be easily mimicked by a DNN. The number of possible such cross features is combinatorial, making them computationally prohibitive to model. Furthermore, the deployment of learning models in real-world applications has also highlighted the need for interpretability, especially for high-stakes applications, which remains another issue of concern to DNNs. In this paper, we present ARM-Net, an adaptive relation modeling network tailored for structured data, and a lightweight framework ARMOR based on ARM-Net for relational data analytics. The key idea is to model feature interactions with cross features selectively and dynamically, by first transforming the input features into exponential space, and then determining the interaction order and interaction weights adaptively for each cross feature. We propose a novel sparse attention mechanism to dynamically generate the interaction weights given the input tuple, so that we can explicitly model cross features of arbitrary orders with noisy features filtered selectively. Then during model inference, ARM-Net can specify the cross features being used for each prediction for higher accuracy and better interpretability. Our extensive experiments on real-world datasets demonstrate that ARM-Net consistently outperforms existing models and provides more interpretable predictions for data-driven decision making.",

keywords = "feature importance, feature interaction, interpretability, multi-head gated attention, neural networks, structured data",

author = "Shaofeng Cai and Kaiping Zheng and Gang Chen and Jagadish, {H. V.} and Ooi, {Beng Chin} and Meihui Zhang",

note = "Publisher Copyright: {\textcopyright} 2021 ACM.; 2021 International Conference on Management of Data, SIGMOD 2021 ; Conference date: 20-06-2021 Through 25-06-2021",

year = "2021",

doi = "10.1145/3448016.3457321",

language = "English",

pages = "207--220",

journal = "Proceedings of the ACM SIGMOD International Conference on Management of Data",

issn = "0730-8078",

}

TY - JOUR

T1 - ARM-Net

T2 - 2021 International Conference on Management of Data, SIGMOD 2021

AU - Cai, Shaofeng

AU - Zheng, Kaiping

AU - Chen, Gang

AU - Jagadish, H. V.

AU - Ooi, Beng Chin

AU - Zhang, Meihui

PY - 2021

Y1 - 2021

N2 - Relational databases are the de facto standard for storing and querying structured data, and extracting insights from structured data requires advanced analytics. Deep neural networks (DNNs) have achieved super-human prediction performance in particular data types, e.g., images. However, existing DNNs may not produce meaningful results when applied to structured data. The reason is that there are correlations and dependencies across combinations of attribute values in a table, and these do not follow simple additive patterns that can be easily mimicked by a DNN. The number of possible such cross features is combinatorial, making them computationally prohibitive to model. Furthermore, the deployment of learning models in real-world applications has also highlighted the need for interpretability, especially for high-stakes applications, which remains another issue of concern to DNNs. In this paper, we present ARM-Net, an adaptive relation modeling network tailored for structured data, and a lightweight framework ARMOR based on ARM-Net for relational data analytics. The key idea is to model feature interactions with cross features selectively and dynamically, by first transforming the input features into exponential space, and then determining the interaction order and interaction weights adaptively for each cross feature. We propose a novel sparse attention mechanism to dynamically generate the interaction weights given the input tuple, so that we can explicitly model cross features of arbitrary orders with noisy features filtered selectively. Then during model inference, ARM-Net can specify the cross features being used for each prediction for higher accuracy and better interpretability. Our extensive experiments on real-world datasets demonstrate that ARM-Net consistently outperforms existing models and provides more interpretable predictions for data-driven decision making.

AB - Relational databases are the de facto standard for storing and querying structured data, and extracting insights from structured data requires advanced analytics. Deep neural networks (DNNs) have achieved super-human prediction performance in particular data types, e.g., images. However, existing DNNs may not produce meaningful results when applied to structured data. The reason is that there are correlations and dependencies across combinations of attribute values in a table, and these do not follow simple additive patterns that can be easily mimicked by a DNN. The number of possible such cross features is combinatorial, making them computationally prohibitive to model. Furthermore, the deployment of learning models in real-world applications has also highlighted the need for interpretability, especially for high-stakes applications, which remains another issue of concern to DNNs. In this paper, we present ARM-Net, an adaptive relation modeling network tailored for structured data, and a lightweight framework ARMOR based on ARM-Net for relational data analytics. The key idea is to model feature interactions with cross features selectively and dynamically, by first transforming the input features into exponential space, and then determining the interaction order and interaction weights adaptively for each cross feature. We propose a novel sparse attention mechanism to dynamically generate the interaction weights given the input tuple, so that we can explicitly model cross features of arbitrary orders with noisy features filtered selectively. Then during model inference, ARM-Net can specify the cross features being used for each prediction for higher accuracy and better interpretability. Our extensive experiments on real-world datasets demonstrate that ARM-Net consistently outperforms existing models and provides more interpretable predictions for data-driven decision making.

KW - feature importance

KW - feature interaction

KW - interpretability

KW - multi-head gated attention

KW - neural networks

KW - structured data

UR - http://www.scopus.com/inward/record.url?scp=85108977020&partnerID=8YFLogxK

U2 - 10.1145/3448016.3457321

DO - 10.1145/3448016.3457321

M3 - Conference article

AN - SCOPUS:85108977020

SN - 0730-8078

SP - 207

EP - 220

JO - Proceedings of the ACM SIGMOD International Conference on Management of Data

JF - Proceedings of the ACM SIGMOD International Conference on Management of Data

Y2 - 20 June 2021 through 25 June 2021

ER -

ARM-Net: Adaptive Relation Modeling Network for Structured Data

摘要

访问文件

其它文件与链接

指纹

引用此