Scalable and Effective Conductance-Based Graph Clustering

Longlong Lin; Rong Hua Li; Tao Jia

Scalable and Effective Conductance-Based Graph Clustering

Longlong Lin, Rong Hua Li, Tao Jia

计算机学院

Southwest University

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

7 引用（Scopus）

摘要

Conductance-based graph clustering has been recognized as a fundamental operator in numerous graph analysis applications. Despite the significant success of conductance-based graph clustering, existing algorithms are either hard to obtain satisfactory clustering qualities, or have high time and space complexity to achieve provable clustering qualities. To overcome these limitations, we devise a powerful peeling-based graph clustering framework PCon. We show that many existing solutions can be reduced to our framework. Namely, they first define a score function for each vertex, then iteratively remove the vertex with the smallest score. Finally, they output the result with the smallest conductance during the peeling process. Based on our framework, we propose two novel algorithms PCon core and PCon de with linear time and space complexity, which can efficiently and effectively identify clusters from massive graphs with more than a few billion edges. Surprisingly, we prove that PCon de can identify clusters with near-constant approximation ratio, resulting in an important theoretical improvement over the well-known quadratic Cheeger bound. Empirical results on real-life and synthetic datasets show that our algorithms can achieve 5∼42 times speedup with a high clustering accuracy, while using 1.4∼7.8 times less memory than the baseline algorithms.

源语言	英语
主期刊名	AAAI-23 Technical Tracks 4
编辑	Brian Williams, Yiling Chen, Jennifer Neville
出版商	AAAI press
页	4471-4478
页数	8
ISBN（电子版）	9781577358800
出版状态	已出版 - 27 6月 2023
活动	37th AAAI Conference on Artificial Intelligence, AAAI 2023 - Washington, 美国期限: 7 2月 2023 → 14 2月 2023

出版系列

姓名	Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023
卷	37

会议

会议	37th AAAI Conference on Artificial Intelligence, AAAI 2023
国家/地区	美国
市	Washington
时期	7/02/23 → 14/02/23

其它文件与链接

链接到 Scopus 的出版物

引用此

@inproceedings{9564f59e89874a4eb3a1b3f0c36b4c6e,

title = "Scalable and Effective Conductance-Based Graph Clustering",

abstract = "Conductance-based graph clustering has been recognized as a fundamental operator in numerous graph analysis applications. Despite the significant success of conductance-based graph clustering, existing algorithms are either hard to obtain satisfactory clustering qualities, or have high time and space complexity to achieve provable clustering qualities. To overcome these limitations, we devise a powerful peeling-based graph clustering framework PCon. We show that many existing solutions can be reduced to our framework. Namely, they first define a score function for each vertex, then iteratively remove the vertex with the smallest score. Finally, they output the result with the smallest conductance during the peeling process. Based on our framework, we propose two novel algorithms PCon core and PCon de with linear time and space complexity, which can efficiently and effectively identify clusters from massive graphs with more than a few billion edges. Surprisingly, we prove that PCon de can identify clusters with near-constant approximation ratio, resulting in an important theoretical improvement over the well-known quadratic Cheeger bound. Empirical results on real-life and synthetic datasets show that our algorithms can achieve 5∼42 times speedup with a high clustering accuracy, while using 1.4∼7.8 times less memory than the baseline algorithms.",

author = "Longlong Lin and Li, {Rong Hua} and Tao Jia",

note = "Publisher Copyright: Copyright {\textcopyright} 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.; 37th AAAI Conference on Artificial Intelligence, AAAI 2023 ; Conference date: 07-02-2023 Through 14-02-2023",

year = "2023",

month = jun,

day = "27",

language = "English",

series = "Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023",

publisher = "AAAI press",

pages = "4471--4478",

editor = "Brian Williams and Yiling Chen and Jennifer Neville",

booktitle = "AAAI-23 Technical Tracks 4",

}

Scalable and Effective Conductance-Based Graph Clustering. / Lin, Longlong; Li, Rong Hua; Jia, Tao.
AAAI-23 Technical Tracks 4. 编辑 / Brian Williams; Yiling Chen; Jennifer Neville. AAAI press, 2023. 页码 4471-4478 (Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023; 卷 37).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Scalable and Effective Conductance-Based Graph Clustering

AU - Lin, Longlong

AU - Li, Rong Hua

AU - Jia, Tao

PY - 2023/6/27

Y1 - 2023/6/27

N2 - Conductance-based graph clustering has been recognized as a fundamental operator in numerous graph analysis applications. Despite the significant success of conductance-based graph clustering, existing algorithms are either hard to obtain satisfactory clustering qualities, or have high time and space complexity to achieve provable clustering qualities. To overcome these limitations, we devise a powerful peeling-based graph clustering framework PCon. We show that many existing solutions can be reduced to our framework. Namely, they first define a score function for each vertex, then iteratively remove the vertex with the smallest score. Finally, they output the result with the smallest conductance during the peeling process. Based on our framework, we propose two novel algorithms PCon core and PCon de with linear time and space complexity, which can efficiently and effectively identify clusters from massive graphs with more than a few billion edges. Surprisingly, we prove that PCon de can identify clusters with near-constant approximation ratio, resulting in an important theoretical improvement over the well-known quadratic Cheeger bound. Empirical results on real-life and synthetic datasets show that our algorithms can achieve 5∼42 times speedup with a high clustering accuracy, while using 1.4∼7.8 times less memory than the baseline algorithms.

AB - Conductance-based graph clustering has been recognized as a fundamental operator in numerous graph analysis applications. Despite the significant success of conductance-based graph clustering, existing algorithms are either hard to obtain satisfactory clustering qualities, or have high time and space complexity to achieve provable clustering qualities. To overcome these limitations, we devise a powerful peeling-based graph clustering framework PCon. We show that many existing solutions can be reduced to our framework. Namely, they first define a score function for each vertex, then iteratively remove the vertex with the smallest score. Finally, they output the result with the smallest conductance during the peeling process. Based on our framework, we propose two novel algorithms PCon core and PCon de with linear time and space complexity, which can efficiently and effectively identify clusters from massive graphs with more than a few billion edges. Surprisingly, we prove that PCon de can identify clusters with near-constant approximation ratio, resulting in an important theoretical improvement over the well-known quadratic Cheeger bound. Empirical results on real-life and synthetic datasets show that our algorithms can achieve 5∼42 times speedup with a high clustering accuracy, while using 1.4∼7.8 times less memory than the baseline algorithms.

UR - http://www.scopus.com/inward/record.url?scp=85167870687&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85167870687

T3 - Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023

SP - 4471

EP - 4478

BT - AAAI-23 Technical Tracks 4

A2 - Williams, Brian

A2 - Chen, Yiling

A2 - Neville, Jennifer

PB - AAAI press

T2 - 37th AAAI Conference on Artificial Intelligence, AAAI 2023

Y2 - 7 February 2023 through 14 February 2023

ER -

Scalable and Effective Conductance-Based Graph Clustering

摘要

出版系列

会议

其它文件与链接

指纹

引用此