跳到主要导航 跳到搜索 跳到主要内容

Early-Stage Detection of Encrypted Malware Traffic via Multi-flow Temporal Graph Learning

  • Beijing Institute of Technology
  • State Key Laboratory of Cryptology
  • CAS - Institute of Information Engineering
  • National Computer Network Emergency Response Technical Team/Coordination Center of China
  • Xi'an Jiaotong University

科研成果: 期刊稿件文章同行评审

摘要

Malware widely adopts network traffic encryption techniques to conceal malicious activities. Recent research has demonstrated the effectiveness of machine learning (ML)-, deep learning (DL)-, and pre-training-based malware traffic detection methods. However, a vast majority of these methods rely on the collected complete traffic during the malware attack. While certain methods can operate on partial traffic, their detection accuracy often significantly decreases when the available data is restricted to the extreme early stage, where information is most sparse. In this paper, we propose DawnGuard, an effective early-stage encrypted malware traffic detection framework through multi-flow temporal graph learning. Specifically, based on the temporal packet density distribution analysis, DawnGuard innovatively proposes a self-adjusting data augmentation strategy for early-stage malware traffic, which can force the model to focus on the early-stage interaction phase with more distinguishable properties. Meanwhile, considering that temporal-topological correlations among multiple flows can provide more distinguishable properties in a malware attack, we further develop a temporal graph learning framework to extract features, which can form Multi-Flow Graph Features (MGF). By utilizing MGF, Dawn-Guard implements a Vision Transformer-based detection mechanism, enabling accurate and precise encrypted malware traffic detection with early-stage traffic by capturing both local and global contextual relationships. Extensive experiments with two real-world datasets demonstrate that DawnGuard outperforms the state-of-the-art (SOTA) methods in three typical scenarios: varying early-stage time windows, imbalanced data, and unseen malware detection. Particularly, DawnGuard achieves an average F1 of 95.11%, 8.7% higher than the SOTA method, by only utilizing the first 20% loading ratio of complete traffic.

源语言英语
期刊IEEE Transactions on Information Forensics and Security
DOI
出版状态已接受/待刊 - 2026
已对外发布

指纹

探究 'Early-Stage Detection of Encrypted Malware Traffic via Multi-flow Temporal Graph Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此