Ensemble feature selection with discriminative and representative properties for malware detection

  • Xiao Yu Zhang
  • , Shupeng Wang*
  • , Lei Zhang
  • , Chunjie Zhang
  • , Changsheng Li
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Malware data are typically depicted with extremely high-dimensional features, which lays an excessive computational burden on detection methods. For the sake of effectiveness and efficiency, feature selection is an indispensable part for malware detection. In this paper, we propose an ensemble feature selection method with integration of discriminative and representative properties for malware detection. Based on the labeled and unlabeled data, the most discriminative and representative features are selected, respectively. The former extracts the features that are most distinctive with respect to the classes, and the latter focuses on the features that best represent the data. A comprehensive metric is subsequently obtained, which retains the most informative features.

Original languageEnglish
Title of host publication2016 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages674-675
Number of pages2
ISBN (Electronic)9781467399555
DOIs
Publication statusPublished - 6 Sept 2016
Externally publishedYes
Event35th IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2016 - San Francisco, United States
Duration: 10 Apr 201614 Apr 2016

Publication series

NameProceedings - IEEE INFOCOM
Volume2016-September
ISSN (Print)0743-166X

Conference

Conference35th IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2016
Country/TerritoryUnited States
CitySan Francisco
Period10/04/1614/04/16

Keywords

  • discriminative feature
  • feature selection
  • malware detection
  • matrix optimization
  • representative feature

Fingerprint

Dive into the research topics of 'Ensemble feature selection with discriminative and representative properties for malware detection'. Together they form a unique fingerprint.

Cite this