TY - JOUR
T1 - A novel malware detection method based on API embedding and API parameters
AU - Zhou, Bo
AU - Huang, Hai
AU - Xia, Jun
AU - Tian, Donghai
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2024/1
Y1 - 2024/1
N2 - Malware is becoming increasingly prevalent in recent years with the widespread deployment of the information system. Many malicious programs pose a great threat to information systems. In the past decade, various malware detection methods are proposed. Particularly, many studies rely on API features for identifying malware. However, the existing methods do not fully make use of the API features. To address these issues, we propose APInspector, a novel dynamic malware detection solution by carefully inspecting API invocations. This method first leverages a dynamic instrumentation tool to hook the target program for collecting the API sequence and argument features. Then, it exploits a HAN (Hierarchical Attention Network) model to analyze the API sequence features. For analyzing the API argument features, we apply an MLP (Multi-Layer Perceptron) model. To fully leverage the API sequence and argument features, we propose a hybrid model, which combines the HAN and MLP models. The evaluation shows that our approach can detect and classify malware effectively and it outperforms the single models.
AB - Malware is becoming increasingly prevalent in recent years with the widespread deployment of the information system. Many malicious programs pose a great threat to information systems. In the past decade, various malware detection methods are proposed. Particularly, many studies rely on API features for identifying malware. However, the existing methods do not fully make use of the API features. To address these issues, we propose APInspector, a novel dynamic malware detection solution by carefully inspecting API invocations. This method first leverages a dynamic instrumentation tool to hook the target program for collecting the API sequence and argument features. Then, it exploits a HAN (Hierarchical Attention Network) model to analyze the API sequence features. For analyzing the API argument features, we apply an MLP (Multi-Layer Perceptron) model. To fully leverage the API sequence and argument features, we propose a hybrid model, which combines the HAN and MLP models. The evaluation shows that our approach can detect and classify malware effectively and it outperforms the single models.
KW - API
KW - Hierarchical attention network
KW - Malware detection
KW - Multi-layer perceptron
UR - http://www.scopus.com/inward/record.url?scp=85168368994&partnerID=8YFLogxK
U2 - 10.1007/s11227-023-05556-x
DO - 10.1007/s11227-023-05556-x
M3 - Article
AN - SCOPUS:85168368994
SN - 0920-8542
VL - 80
SP - 2748
EP - 2766
JO - Journal of Supercomputing
JF - Journal of Supercomputing
IS - 2
ER -