A Global Clustering Approach Using Hybrid Optimization for Incomplete Data Based on Interval Reconstruction of Missing Value
Citations Over TimeTop 18% of 2015 papers
Abstract
Incomplete data clustering is often encountered in practice. Here the treatment of missing attribute value and the optimization procedure of clustering are the important factors impacting the clustering performance. In this study, a missing attribute value becomes an information granule and is represented as a certain interval. To avoid intervals determined by different cluster information, we propose a congeneric nearest-neighbor rule-based architecture of the preclassification result, which can improve the effectiveness of estimation of missing attribute interval. Furthermore, a global fuzzy clustering approach using particle swarm optimization assisted by the Fuzzy C-Means is proposed. A novel encoding scheme where particles are composed of the cluster prototypes and the missing attribute values is considered in the optimization procedure. The proposed approach improves the accuracy of clustering results, moreover, the missing attribute imputation can be implemented at the same time. The experimental results of several UCI data sets show the efficiency of the proposed approach.
Related Papers
- → A reinforcement learning-based approach for imputing missing data(2022)29 cited
- → Guided Multiple Imputation of Missing Data(2007)52 cited
- [Imputation of missing data].(2013)
- → Missing Value Imputation Using a Semi-supervised Rank Aggregation Approach(2008)12 cited
- Application of SOLAS to the Multiple Imputation for Missing Data(2003)