Learning from Metadata: A Fuzzy Token Matching Based Configuration File Discovery Approach

Han Wang, Fan Jing Meng, Xuejun Zhuo, Lin Yang, Chang Sheng Li, Jing Min Xu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Discovery of configuration files is one of the prerequisite activities for a successful workload migration to the cloud. The complicated and super-sized file systems, the considerable variance of configuration files, and the multiple-presence of configuration items make configuration file discovery very difficult. Traditional approaches usually highly rely on experts to compose software specific scripts or rules to discover configuration files, which is very expensive and labor-intensive. In this paper, we propose a novel learning based approach named MetaConf to convert configuration file discovery to a supervised file classification task using the file metadata as learning features such that it can be conducted automatically, efficiently, and independently of domain expertise. We report our evaluation with extensive and real-world case studies, and the experimental results validate that our approach is effective and it outperforms our baseline method.

Original languageEnglish
Title of host publicationProceedings - 2015 IEEE 8th International Conference on Cloud Computing, CLOUD 2015
EditorsCalton Pu, Ajay Mohindra
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages405-412
Number of pages8
ISBN (Electronic)9781467372879
DOIs
Publication statusPublished - 19 Aug 2015
Externally publishedYes
Event8th IEEE International Conference on Cloud Computing, CLOUD 2015 - New York, United States
Duration: 27 Jun 20152 Jul 2015

Publication series

NameProceedings - 2015 IEEE 8th International Conference on Cloud Computing, CLOUD 2015

Conference

Conference8th IEEE International Conference on Cloud Computing, CLOUD 2015
Country/TerritoryUnited States
CityNew York
Period27/06/152/07/15

Keywords

  • Cloud Migration
  • Configuration file discovery
  • Data imbalance
  • File metadata
  • Fuzzy string matching

Fingerprint

Dive into the research topics of 'Learning from Metadata: A Fuzzy Token Matching Based Configuration File Discovery Approach'. Together they form a unique fingerprint.

Cite this