Coresets for fast causal discovery with the additive noise model

Boxiang Zhao, Shuliang Wang*, Lianhua Chi, Hanning Yuan, Ye Yuan, Qi Li, Jing Geng, Shao Liang Zhang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Causal discovery reveals the true causal relationships behind data and discovering causal relationships from observed data is a particularly challenging problem, especially in large-scale datasets. The functional causal model is an effective method for causal discovery, but its time efficiency cannot be guaranteed. How to efficiently apply it to massive data still needs to be solved. In this paper, we propose a coreset construction for the additive noise model to accelerate causal discovery. According to the asymmetry characteristic of causality, samples were assigned different weights to construct the coreset. With the constructed coreset, we propose a Fast causal discovery algorithm based on the Additive Noise Model (FANM) to improve the time efficiency of the functional causal model while ensuring the result performance of causal discovery. Experiments on synthetic data and real-world data show that our proposed algorithm is much more time-efficient than the methods based on the functional causal model, and the runtime of FANM remains consistent as sample size increases while maintaining or exceeding the accuracy of the original nonlinear additive noise model.

Original languageEnglish
Article number110149
JournalPattern Recognition
Volume148
DOIs
Publication statusPublished - Apr 2024

Keywords

  • Additive noise model
  • Big data
  • Causal discovery
  • Coresets
  • Functional causal model

Fingerprint

Dive into the research topics of 'Coresets for fast causal discovery with the additive noise model'. Together they form a unique fingerprint.

Cite this