HGE-BVHD: Heterogeneous graph embedding scheme of complex structure functions for binary vulnerability homology discrimination

Jiyuan Xing; Senlin Luo; Limin Pan; Jingwei Hao; Yingdan Guan; Zhouting Wu

doi:10.1016/j.eswa.2023.121835

HGE-BVHD: Heterogeneous graph embedding scheme of complex structure functions for binary vulnerability homology discrimination

Jiyuan Xing^*, Senlin Luo, Limin Pan, Jingwei Hao, Yingdan Guan, Zhouting Wu

^*此作品的通讯作者

信息与电子学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文献综述 › 同行评审

2 引用（Scopus）

摘要

Homologous vulnerability detection is an important aspect of computer security. It has several key problems, including discriminating structurally complex functions, supporting cross-architecture programs, distinguishing false positives, etc. Non-homologous functions with similar control flow graph structures are easily misjudged, which decreases discrimination accuracy. The vectors generated by instruction-embedding models contain architectural features, which increases the distance between homologous function vectors and leads to misclassification. In this paper, we propose a novel heterogeneous graph embedding (HGE) binary vulnerability homology discrimination (BVHD) method. HGE is used to aggregate basic block features to generate function representations, perform different transformations according to control flow and data flow, and improve the discrimination of non-homologous functions to increase discrimination accuracy. A novel multi-architecture instruction-embedding model is proposed for abstracting common semantic features and weakening the interference of architectural features to avoid misclassification. The experimental results show that the proposed method achieves state-of-the-art results in homologous function discrimination, and the upgrade is significant for complex structure functions.

源语言	英语
文章编号	121835
期刊	Expert Systems with Applications
卷	238
DOI	https://doi.org/10.1016/j.eswa.2023.121835
出版状态	已出版 - 15 3月 2024

访问文件

10.1016/j.eswa.2023.121835

其它文件与链接

链接到 Scopus 的出版物

引用此

Xing, J., Luo, S., Pan, L., Hao, J., Guan, Y., & Wu, Z. (2024). HGE-BVHD: Heterogeneous graph embedding scheme of complex structure functions for binary vulnerability homology discrimination. Expert Systems with Applications, 238, 文章 121835. https://doi.org/10.1016/j.eswa.2023.121835

@article{f2171bd6802143f0bf2af722efced17a,

title = "HGE-BVHD: Heterogeneous graph embedding scheme of complex structure functions for binary vulnerability homology discrimination",

abstract = "Homologous vulnerability detection is an important aspect of computer security. It has several key problems, including discriminating structurally complex functions, supporting cross-architecture programs, distinguishing false positives, etc. Non-homologous functions with similar control flow graph structures are easily misjudged, which decreases discrimination accuracy. The vectors generated by instruction-embedding models contain architectural features, which increases the distance between homologous function vectors and leads to misclassification. In this paper, we propose a novel heterogeneous graph embedding (HGE) binary vulnerability homology discrimination (BVHD) method. HGE is used to aggregate basic block features to generate function representations, perform different transformations according to control flow and data flow, and improve the discrimination of non-homologous functions to increase discrimination accuracy. A novel multi-architecture instruction-embedding model is proposed for abstracting common semantic features and weakening the interference of architectural features to avoid misclassification. The experimental results show that the proposed method achieves state-of-the-art results in homologous function discrimination, and the upgrade is significant for complex structure functions.",

keywords = "Binary code, Heterogeneous graph embedding, Homology vulnerability discrimination, Multi-architecture instruction embedding",

author = "Jiyuan Xing and Senlin Luo and Limin Pan and Jingwei Hao and Yingdan Guan and Zhouting Wu",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Ltd",

year = "2024",

month = mar,

day = "15",

doi = "10.1016/j.eswa.2023.121835",

language = "English",

volume = "238",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - HGE-BVHD

T2 - Heterogeneous graph embedding scheme of complex structure functions for binary vulnerability homology discrimination

AU - Xing, Jiyuan

AU - Luo, Senlin

AU - Pan, Limin

AU - Hao, Jingwei

AU - Guan, Yingdan

AU - Wu, Zhouting

PY - 2024/3/15

Y1 - 2024/3/15

N2 - Homologous vulnerability detection is an important aspect of computer security. It has several key problems, including discriminating structurally complex functions, supporting cross-architecture programs, distinguishing false positives, etc. Non-homologous functions with similar control flow graph structures are easily misjudged, which decreases discrimination accuracy. The vectors generated by instruction-embedding models contain architectural features, which increases the distance between homologous function vectors and leads to misclassification. In this paper, we propose a novel heterogeneous graph embedding (HGE) binary vulnerability homology discrimination (BVHD) method. HGE is used to aggregate basic block features to generate function representations, perform different transformations according to control flow and data flow, and improve the discrimination of non-homologous functions to increase discrimination accuracy. A novel multi-architecture instruction-embedding model is proposed for abstracting common semantic features and weakening the interference of architectural features to avoid misclassification. The experimental results show that the proposed method achieves state-of-the-art results in homologous function discrimination, and the upgrade is significant for complex structure functions.

AB - Homologous vulnerability detection is an important aspect of computer security. It has several key problems, including discriminating structurally complex functions, supporting cross-architecture programs, distinguishing false positives, etc. Non-homologous functions with similar control flow graph structures are easily misjudged, which decreases discrimination accuracy. The vectors generated by instruction-embedding models contain architectural features, which increases the distance between homologous function vectors and leads to misclassification. In this paper, we propose a novel heterogeneous graph embedding (HGE) binary vulnerability homology discrimination (BVHD) method. HGE is used to aggregate basic block features to generate function representations, perform different transformations according to control flow and data flow, and improve the discrimination of non-homologous functions to increase discrimination accuracy. A novel multi-architecture instruction-embedding model is proposed for abstracting common semantic features and weakening the interference of architectural features to avoid misclassification. The experimental results show that the proposed method achieves state-of-the-art results in homologous function discrimination, and the upgrade is significant for complex structure functions.

KW - Binary code

KW - Heterogeneous graph embedding

KW - Homology vulnerability discrimination

KW - Multi-architecture instruction embedding

UR - http://www.scopus.com/inward/record.url?scp=85173619109&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2023.121835

DO - 10.1016/j.eswa.2023.121835

M3 - Review article

AN - SCOPUS:85173619109

SN - 0957-4174

VL - 238

JO - Expert Systems with Applications

JF - Expert Systems with Applications

M1 - 121835

ER -

HGE-BVHD: Heterogeneous graph embedding scheme of complex structure functions for binary vulnerability homology discrimination

摘要

访问文件

其它文件与链接

指纹

引用此