TY - JOUR
T1 - 基于流量异常分析多维优化的入侵检测方法
AU - Liu, Xinqian
AU - Shan, Chun
AU - Ren, Jiadong
AU - Wang, Qian
AU - Guo, Jiawei
N1 - Publisher Copyright:
© 2019 Chinese Academy of Sciences. All rights reserved.
PY - 2019
Y1 - 2019
N2 - In the process of detecting and preventing various network anomaly behaviors, intrusion detection system is facing the problem of low accuracy and high false alarm rate due to the massive and high-dimensional traffic data. An intrusion detection method based on multi-dimensional optimization of traffic anomaly analysis is proposed, in which both horizontal and vertical dimensions of intrusion detection dataset are optimized. In horizontal dimensions optimization, those categories with a large number are sampled and the optimal sampling proportion parameters of each category are obtained by genetic algorithm. Data equalization is accomplished. In vertical dimensions optimization, combining with the correlation analysis of features with label, recursive features addition algorithm is adopted to select features, and the average recall is proposed to evaluate the effect of features selection. The low-dimensional and high-efficient training data set is achieved. Based on optimized intrusion detection dataset, the random forest classifier is obtained by training dataset, and the real data set UNSW_NB15 is used to evaluate and validate the proposed method. Compared with other algorithms, the proposed algorithm has high accuracy and low false alarm rate, and effective recall rate on attack category is obtained.
AB - In the process of detecting and preventing various network anomaly behaviors, intrusion detection system is facing the problem of low accuracy and high false alarm rate due to the massive and high-dimensional traffic data. An intrusion detection method based on multi-dimensional optimization of traffic anomaly analysis is proposed, in which both horizontal and vertical dimensions of intrusion detection dataset are optimized. In horizontal dimensions optimization, those categories with a large number are sampled and the optimal sampling proportion parameters of each category are obtained by genetic algorithm. Data equalization is accomplished. In vertical dimensions optimization, combining with the correlation analysis of features with label, recursive features addition algorithm is adopted to select features, and the average recall is proposed to evaluate the effect of features selection. The low-dimensional and high-efficient training data set is achieved. Based on optimized intrusion detection dataset, the random forest classifier is obtained by training dataset, and the real data set UNSW_NB15 is used to evaluate and validate the proposed method. Compared with other algorithms, the proposed algorithm has high accuracy and low false alarm rate, and effective recall rate on attack category is obtained.
KW - data sampling
KW - genetic algorithm parameter optimization
KW - intrusion detection framework
KW - multi-dimensional optimization
KW - random forest
KW - recursive features addition
UR - http://www.scopus.com/inward/record.url?scp=85149884669&partnerID=8YFLogxK
U2 - 10.19363/J.cnki.cn10-1380/tn.2019.01.02
DO - 10.19363/J.cnki.cn10-1380/tn.2019.01.02
M3 - 文章
AN - SCOPUS:85149884669
SN - 2096-1146
VL - 4
SP - 14
EP - 26
JO - Journal of Cyber Security
JF - Journal of Cyber Security
IS - 1
ER -