A robust fuzzy C-means clustering algorithm for incomplete data

Jinhua Li, Shiji Song*, Yuli Zhang, Kang Li

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Citations (Scopus)

Abstract

Date sets with missing feature values are prevalent in clustering analysis. Most existing clustering methods for incomplete data rely on imputations of missing feature values. However, accurate imputations are usually hard to obtain especially for small-size or highly corrupted data sets. To address this issue, this paper proposes a robust fuzzy c-means (RFCM) clustering algorithm, which does not require imputations. The proposed RFCM represents the missing feature values by intervals, which can be easily constructed using the K-nearest neighbors method, and adopts a min-max optimization model to reduce the impact of noises on clustering performance. We give an equivalent tractable reformulation of the min-max optimization problem and propose an efficient solution method based on smoothing and gradient projection techniques. Experiments on UCI data sets validate the effectiveness of the proposed RFCM algorithm by comparison with existing clustering methods for incomplete data.

Original languageEnglish
Title of host publicationIntelligent Computing, Networked Control, and Their Engineering Applications - International Conference on Life System Modeling and Simulation, LSMS 2017 and International Conference on Intelligent Computing for Sustainable Energy and Environment, ICSEE 2017, Proceedings
EditorsDong Yue, Tengfei Zhang, Chen Peng, Dajun Du, Min Zheng, Qinglong Han
PublisherSpringer Verlag
Pages3-12
Number of pages10
ISBN (Print)9789811063725
DOIs
Publication statusPublished - 2017
Externally publishedYes
EventInternational Conference on Life System Modeling and Simulation, LSMS 2017 and International Conference on Intelligent Computing for Sustainable Energy and Environment, ICSEE 2017 - Nanjing, China
Duration: 22 Sept 201724 Sept 2017

Publication series

NameCommunications in Computer and Information Science
Volume762
ISSN (Print)1865-0929

Conference

ConferenceInternational Conference on Life System Modeling and Simulation, LSMS 2017 and International Conference on Intelligent Computing for Sustainable Energy and Environment, ICSEE 2017
Country/TerritoryChina
CityNanjing
Period22/09/1724/09/17

Keywords

  • Interval data
  • Robust FCM
  • Robust clustering algorithm

Fingerprint

Dive into the research topics of 'A robust fuzzy C-means clustering algorithm for incomplete data'. Together they form a unique fingerprint.

Cite this