Scaling Up Maximal k-plex Enumeration

Qiangqiang Dai; Rong Hua Li; Hongchao Qin; Meihao Liao; Guoren Wang

doi:10.1145/3511808.3557444

Scaling Up Maximal k-plex Enumeration

Qiangqiang Dai, Rong Hua Li, Hongchao Qin, Meihao Liao, Guoren Wang

计算机学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

6 引用（Scopus）

摘要

Finding all maximal k-plexes on networks is a fundamental research problem in graph analysis due to many important applications, such as community detection, biological graph analysis, and so on. A k-plex is a subgraph in which every vertex is adjacent to all but at most k vertices within the subgraph. In this paper, we study the problem of enumerating all large maximal k-plexes of a graph and develop several new and efficient techniques to solve the problem. Specifically, we first propose several novel upper-bounding techniques to prune unnecessary computations during the enumeration procedure. We show that the proposed upper bounds can be computed in linear time. Then, we develop a new branch-and-bound algorithm with a carefully-designed pivot re-selection strategy to enumerate all k-plexes, which outputs all k-plexes in O(n2?kn) time theoretically, where n is the number of vertices of the graph and ? k is strictly smaller than 2. In addition, a parallel version of the proposed algorithm is further developed to scale up to process large real-world graphs. Finally, extensive experimental results show that the proposed sequential algorithm can achieve up to 2× to 100× speedup over the state-of-the-art sequential algorithms on most benchmark graphs. The results also demonstrate the high scalability of the proposed parallel algorithm. For example, on a large real-world graph with more than 200 million edges, our parallel algorithm can finish the computation within two minutes, while the state-of-the-art parallel algorithm cannot terminate within 24 hours.

源语言	英语
主期刊名	CIKM 2022 - Proceedings of the 31st ACM International Conference on Information and Knowledge Management
出版商	Association for Computing Machinery
页	345-354
页数	10
ISBN（电子版）	9781450392365
DOI	https://doi.org/10.1145/3511808.3557444
出版状态	已出版 - 17 10月 2022
活动	31st ACM International Conference on Information and Knowledge Management, CIKM 2022 - Atlanta, 美国期限: 17 10月 2022 → 21 10月 2022

出版系列

姓名	International Conference on Information and Knowledge Management, Proceedings

会议

会议	31st ACM International Conference on Information and Knowledge Management, CIKM 2022
国家/地区	美国
市	Atlanta
时期	17/10/22 → 21/10/22

访问文件

10.1145/3511808.3557444

其它文件与链接

链接到 Scopus 的出版物

引用此

Dai, Q., Li, R. H., Qin, H., Liao, M., & Wang, G. (2022). Scaling Up Maximal k-plex Enumeration. 在 CIKM 2022 - Proceedings of the 31st ACM International Conference on Information and Knowledge Management (页码 345-354). (International Conference on Information and Knowledge Management, Proceedings). Association for Computing Machinery. https://doi.org/10.1145/3511808.3557444

@inproceedings{5aeee3c7257b4049943588c355f51622,

title = "Scaling Up Maximal k-plex Enumeration",

abstract = "Finding all maximal k-plexes on networks is a fundamental research problem in graph analysis due to many important applications, such as community detection, biological graph analysis, and so on. A k-plex is a subgraph in which every vertex is adjacent to all but at most k vertices within the subgraph. In this paper, we study the problem of enumerating all large maximal k-plexes of a graph and develop several new and efficient techniques to solve the problem. Specifically, we first propose several novel upper-bounding techniques to prune unnecessary computations during the enumeration procedure. We show that the proposed upper bounds can be computed in linear time. Then, we develop a new branch-and-bound algorithm with a carefully-designed pivot re-selection strategy to enumerate all k-plexes, which outputs all k-plexes in O(n2?kn) time theoretically, where n is the number of vertices of the graph and ? k is strictly smaller than 2. In addition, a parallel version of the proposed algorithm is further developed to scale up to process large real-world graphs. Finally, extensive experimental results show that the proposed sequential algorithm can achieve up to 2× to 100× speedup over the state-of-the-art sequential algorithms on most benchmark graphs. The results also demonstrate the high scalability of the proposed parallel algorithm. For example, on a large real-world graph with more than 200 million edges, our parallel algorithm can finish the computation within two minutes, while the state-of-the-art parallel algorithm cannot terminate within 24 hours.",

keywords = "branch-and-bound enumeration, cohesive subgragh mining, maximal k-plex",

author = "Qiangqiang Dai and Li, {Rong Hua} and Hongchao Qin and Meihao Liao and Guoren Wang",

note = "Publisher Copyright: {\textcopyright} 2022 ACM.; 31st ACM International Conference on Information and Knowledge Management, CIKM 2022 ; Conference date: 17-10-2022 Through 21-10-2022",

year = "2022",

month = oct,

day = "17",

doi = "10.1145/3511808.3557444",

language = "English",

series = "International Conference on Information and Knowledge Management, Proceedings",

publisher = "Association for Computing Machinery",

pages = "345--354",

booktitle = "CIKM 2022 - Proceedings of the 31st ACM International Conference on Information and Knowledge Management",

}

Dai, Q, Li, RH, Qin, H, Liao, M & Wang, G 2022, Scaling Up Maximal k-plex Enumeration. 在 CIKM 2022 - Proceedings of the 31st ACM International Conference on Information and Knowledge Management. International Conference on Information and Knowledge Management, Proceedings, Association for Computing Machinery, 页码 345-354, 31st ACM International Conference on Information and Knowledge Management, CIKM 2022, Atlanta, 美国, 17/10/22. https://doi.org/10.1145/3511808.3557444

Scaling Up Maximal k-plex Enumeration. / Dai, Qiangqiang; Li, Rong Hua; Qin, Hongchao 等.
CIKM 2022 - Proceedings of the 31st ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, 2022. 页码 345-354 (International Conference on Information and Knowledge Management, Proceedings).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Scaling Up Maximal k-plex Enumeration

AU - Dai, Qiangqiang

AU - Li, Rong Hua

AU - Qin, Hongchao

AU - Liao, Meihao

AU - Wang, Guoren

PY - 2022/10/17

Y1 - 2022/10/17

N2 - Finding all maximal k-plexes on networks is a fundamental research problem in graph analysis due to many important applications, such as community detection, biological graph analysis, and so on. A k-plex is a subgraph in which every vertex is adjacent to all but at most k vertices within the subgraph. In this paper, we study the problem of enumerating all large maximal k-plexes of a graph and develop several new and efficient techniques to solve the problem. Specifically, we first propose several novel upper-bounding techniques to prune unnecessary computations during the enumeration procedure. We show that the proposed upper bounds can be computed in linear time. Then, we develop a new branch-and-bound algorithm with a carefully-designed pivot re-selection strategy to enumerate all k-plexes, which outputs all k-plexes in O(n2?kn) time theoretically, where n is the number of vertices of the graph and ? k is strictly smaller than 2. In addition, a parallel version of the proposed algorithm is further developed to scale up to process large real-world graphs. Finally, extensive experimental results show that the proposed sequential algorithm can achieve up to 2× to 100× speedup over the state-of-the-art sequential algorithms on most benchmark graphs. The results also demonstrate the high scalability of the proposed parallel algorithm. For example, on a large real-world graph with more than 200 million edges, our parallel algorithm can finish the computation within two minutes, while the state-of-the-art parallel algorithm cannot terminate within 24 hours.

AB - Finding all maximal k-plexes on networks is a fundamental research problem in graph analysis due to many important applications, such as community detection, biological graph analysis, and so on. A k-plex is a subgraph in which every vertex is adjacent to all but at most k vertices within the subgraph. In this paper, we study the problem of enumerating all large maximal k-plexes of a graph and develop several new and efficient techniques to solve the problem. Specifically, we first propose several novel upper-bounding techniques to prune unnecessary computations during the enumeration procedure. We show that the proposed upper bounds can be computed in linear time. Then, we develop a new branch-and-bound algorithm with a carefully-designed pivot re-selection strategy to enumerate all k-plexes, which outputs all k-plexes in O(n2?kn) time theoretically, where n is the number of vertices of the graph and ? k is strictly smaller than 2. In addition, a parallel version of the proposed algorithm is further developed to scale up to process large real-world graphs. Finally, extensive experimental results show that the proposed sequential algorithm can achieve up to 2× to 100× speedup over the state-of-the-art sequential algorithms on most benchmark graphs. The results also demonstrate the high scalability of the proposed parallel algorithm. For example, on a large real-world graph with more than 200 million edges, our parallel algorithm can finish the computation within two minutes, while the state-of-the-art parallel algorithm cannot terminate within 24 hours.

KW - branch-and-bound enumeration

KW - cohesive subgragh mining

KW - maximal k-plex

UR - http://www.scopus.com/inward/record.url?scp=85140844856&partnerID=8YFLogxK

U2 - 10.1145/3511808.3557444

DO - 10.1145/3511808.3557444

M3 - Conference contribution

AN - SCOPUS:85140844856

T3 - International Conference on Information and Knowledge Management, Proceedings

SP - 345

EP - 354

BT - CIKM 2022 - Proceedings of the 31st ACM International Conference on Information and Knowledge Management

PB - Association for Computing Machinery

T2 - 31st ACM International Conference on Information and Knowledge Management, CIKM 2022

Y2 - 17 October 2022 through 21 October 2022

ER -

Scaling Up Maximal k-plex Enumeration

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此