TY - JOUR
T1 - Deep-Reinforcement-Learning-Based Self-Evolving Moving Target Defense Approach Against Unknown Attacks
AU - Cao, Yuan
AU - Liu, Kun
AU - Lin, Yeming
AU - Wang, Luyao
AU - Xia, Yuanqing
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2024
Y1 - 2024
N2 - Deep reinforcement learning (DRL)-based moving target defense (MTD) emerges as an outstanding method to enhance the security of highly hostile Internet of Things (IoT) environments. However, due to the gap between certain stationary learning environment and real-world, even a well-trained DRL model may not adapt to unknown attacks in the real-world network environments. Therefore, we present a DRL-based self-evolving MTD approach against unknown attacks. First, we formulate the defense in a dynamic network environment as a Markov decision process (MDP), and utilize a DRL model based on actor-critic framework to obtain the optimal sequential defense strategies. Second, we deploy honeypots within the network environments to capture the traffic features of unknown attacks. These features are then specifically labeled to enable the DRL model to learn the characteristics of unknown attacks and identify them. Third, we design an actor network based on ResNet architecture to perceive the network states and make defense decisions in response to these unknown attacks. Finally, through extensive experiments on a platform based on software defined networks, we show that our proposed approach can enhance the defense performance compared with state-of-the-art solutions when dealing with unknown attacks.
AB - Deep reinforcement learning (DRL)-based moving target defense (MTD) emerges as an outstanding method to enhance the security of highly hostile Internet of Things (IoT) environments. However, due to the gap between certain stationary learning environment and real-world, even a well-trained DRL model may not adapt to unknown attacks in the real-world network environments. Therefore, we present a DRL-based self-evolving MTD approach against unknown attacks. First, we formulate the defense in a dynamic network environment as a Markov decision process (MDP), and utilize a DRL model based on actor-critic framework to obtain the optimal sequential defense strategies. Second, we deploy honeypots within the network environments to capture the traffic features of unknown attacks. These features are then specifically labeled to enable the DRL model to learn the characteristics of unknown attacks and identify them. Third, we design an actor network based on ResNet architecture to perceive the network states and make defense decisions in response to these unknown attacks. Finally, through extensive experiments on a platform based on software defined networks, we show that our proposed approach can enhance the defense performance compared with state-of-the-art solutions when dealing with unknown attacks.
KW - Deep reinforcement learning (DRL)
KW - Internet of Things (IoT)
KW - moving target defense (MTD)
KW - network security
KW - software-defined networks
UR - http://www.scopus.com/inward/record.url?scp=85197509451&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2024.3423022
DO - 10.1109/JIOT.2024.3423022
M3 - Article
AN - SCOPUS:85197509451
SN - 2327-4662
VL - 11
SP - 33027
EP - 33039
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 20
ER -