TY - GEN
T1 - Software Misconfiguration Troubleshooting Based on State Analysis
AU - Li, Ke
AU - Xue, Yuan
AU - Shao, Yujie
AU - Su, Bing
AU - Tan, Yu An
AU - Hu, Jingjing
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Misconfiguration has become one of the dominant causes of service failure and software abnormality. It will not only affect the user experience and cause economic losses, but also require a lot of manpower for troubleshooting. Therefore, an automated tool to quickly diagnose misconfiguration is essential. This paper introduces a tool called ConfDetect, which treats system as data to derive the troubleshooting clues using state analysis. ConfDetect ranks logs based on heuristics to give problem-related messages, uses NLP for Part-Of-Speech tagging to filter out suspicious words which are compared with configuration key-value pairs and environment information, or searches the knowledge base to find misconfiguration items. The knowledge base realizes self-learning by analyzing the configuration changes of the sick system after it has been healed, and predicts the configuration item resulting in the error log. Misconfiguration tests on MySQL, Ngnix and an autonomous vehicle simulation program on ROS show that ConfDetect is able to deliver error message with 91% accuracy and find specific misconfiguration item with 74% accuracy in general. Besides, the time cost of ConfDetect is in proportion to the volume of logs and it takes less than a few seconds to process thousands of lines log. Experiment results prove that ConfDetect is effective in diagnosing misconfiguration and runs fast.
AB - Misconfiguration has become one of the dominant causes of service failure and software abnormality. It will not only affect the user experience and cause economic losses, but also require a lot of manpower for troubleshooting. Therefore, an automated tool to quickly diagnose misconfiguration is essential. This paper introduces a tool called ConfDetect, which treats system as data to derive the troubleshooting clues using state analysis. ConfDetect ranks logs based on heuristics to give problem-related messages, uses NLP for Part-Of-Speech tagging to filter out suspicious words which are compared with configuration key-value pairs and environment information, or searches the knowledge base to find misconfiguration items. The knowledge base realizes self-learning by analyzing the configuration changes of the sick system after it has been healed, and predicts the configuration item resulting in the error log. Misconfiguration tests on MySQL, Ngnix and an autonomous vehicle simulation program on ROS show that ConfDetect is able to deliver error message with 91% accuracy and find specific misconfiguration item with 74% accuracy in general. Besides, the time cost of ConfDetect is in proportion to the volume of logs and it takes less than a few seconds to process thousands of lines log. Experiment results prove that ConfDetect is effective in diagnosing misconfiguration and runs fast.
KW - Black-box Testing
KW - Constraint and Simplification
KW - Log Analysis
KW - Misconfiguration
UR - http://www.scopus.com/inward/record.url?scp=85128783916&partnerID=8YFLogxK
U2 - 10.1109/DSC53577.2021.00057
DO - 10.1109/DSC53577.2021.00057
M3 - Conference contribution
AN - SCOPUS:85128783916
T3 - Proceedings - 2021 IEEE 6th International Conference on Data Science in Cyberspace, DSC 2021
SP - 361
EP - 366
BT - Proceedings - 2021 IEEE 6th International Conference on Data Science in Cyberspace, DSC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th IEEE International Conference on Data Science in Cyberspace, DSC 2021
Y2 - 9 October 2021 through 11 October 2021
ER -