Multimodal Dual-Embedding Networks for Malware Open-Set Recognition

Jingcai Guo, Han Wang, Yuanyuan Xu, Wenchao Xu, Yufeng Zhan, Yuxia Sun, Song Guo

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Malware open-set recognition (MOSR) is an emerging research domain that aims at jointly classifying malware samples from known families and detecting the ones from novel unknown families, respectively. Existing works mostly rely on a <italic>well-trained</italic> classifier considering the predicted probabilities of each known family with a threshold-based detection to achieve the MOSR. However, our observation reveals that the feature distributions of malware samples are extremely similar to each other even between known and unknown families. Thus, the obtained classifier may produce overly high probabilities of testing unknown samples toward known families and degrade the model performance. In this article, we propose the multi<inline-formula> <tex-math notation="LaTeX">$\backslash$</tex-math> </inline-formula>modal dual-embedding networks, dubbed MDENet, to take advantage of comprehensive malware features from different modalities to enhance the diversity of malware feature space, which is more representative and discriminative for down-stream recognition. Concretely, we first generate a malware image for each observed sample based on their numeric features using our proposed numeric encoder with a re-designed multiscale CNN structure, which can better explore their statistical and spatial correlations. Besides, we propose to organize tokenized malware features into a sentence for each sample considering its behaviors and dynamics, and utilize language models as the textual encoder to transform it into a representable and computable textual vector. Such parallel multimodal encoders can fuse the above two components to enhance the feature diversity. Last, to further guarantee the open-set recognition (OSR), we dually embed the fused multimodal representation into one primary space and an associated sub-space, i.e., discriminative and exclusive spaces, with contrastive sampling and <inline-formula> <tex-math notation="LaTeX">$\rho$</tex-math> </inline-formula>-bounded enclosing sphere regularizations, which resort to classification and detection, respectively. Moreover, we also enrich our previously proposed large-scaled malware dataset MAL-100 with multimodal characteristics and contribute an improved version dubbed MAL-100<inline-formula> <tex-math notation="LaTeX">$^{+}$</tex-math> </inline-formula>. Experimental results on the widely used malware dataset <italic>Mailing</italic> and the proposed MAL-100<inline-formula> <tex-math notation="LaTeX">$^{+}$</tex-math> </inline-formula> demonstrate the effectiveness of our method.

源语言英语
页(从-至)1-15
页数15
期刊IEEE Transactions on Neural Networks and Learning Systems
DOI
出版状态已接受/待刊 - 2024

指纹

探究 'Multimodal Dual-Embedding Networks for Malware Open-Set Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此