HGE-BVHD: Heterogeneous graph embedding scheme of complex structure functions for binary vulnerability homology discrimination

Jiyuan Xing*, Senlin Luo, Limin Pan, Jingwei Hao, Yingdan Guan, Zhouting Wu

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

1 Citation (Scopus)

Abstract

Homologous vulnerability detection is an important aspect of computer security. It has several key problems, including discriminating structurally complex functions, supporting cross-architecture programs, distinguishing false positives, etc. Non-homologous functions with similar control flow graph structures are easily misjudged, which decreases discrimination accuracy. The vectors generated by instruction-embedding models contain architectural features, which increases the distance between homologous function vectors and leads to misclassification. In this paper, we propose a novel heterogeneous graph embedding (HGE) binary vulnerability homology discrimination (BVHD) method. HGE is used to aggregate basic block features to generate function representations, perform different transformations according to control flow and data flow, and improve the discrimination of non-homologous functions to increase discrimination accuracy. A novel multi-architecture instruction-embedding model is proposed for abstracting common semantic features and weakening the interference of architectural features to avoid misclassification. The experimental results show that the proposed method achieves state-of-the-art results in homologous function discrimination, and the upgrade is significant for complex structure functions.

Original languageEnglish
Article number121835
JournalExpert Systems with Applications
Volume238
DOIs
Publication statusPublished - 15 Mar 2024

Keywords

  • Binary code
  • Heterogeneous graph embedding
  • Homology vulnerability discrimination
  • Multi-architecture instruction embedding

Fingerprint

Dive into the research topics of 'HGE-BVHD: Heterogeneous graph embedding scheme of complex structure functions for binary vulnerability homology discrimination'. Together they form a unique fingerprint.

Cite this