earticle

논문검색

A Novel Multilayer Data Clustering Framework based on Feature Selection and Modified K-Means Algorithm

초록

영어

With the rapid development of computer science and technology, the data analysis technique has been a hottest research area in the pattern recognition research community. Cluster analysis is an important step in data mining. For clustering, various multi-objective techniques are evolved, which can automatically partition the data. In this paper, we propose a novel multilayer data clustering framework based on feature selection and modified K-Means algorithm. To facilitate the clustering, the proposed algorithm selects a representative feature subset to reduce the dimension of the raw data set. Besides, the selected feature subset has fewer missing values than the raw data set, which may improve the cluster accuracy. Another unique property of the proposed algorithm is the use of partial distance strategy. The experimental analysis and simulation indicate the feasibility and robustness of our method, in the future, we plan to conduct more mathematical analysis to modify our algorithm to achieve better result.

목차

Abstract
 1. Introduction
 2. Overview of Clustering Algorithms
  2.1. Fuzzy C-Means Algorithm
  2.2. The DENCLUE Algorithm
  2.3. The Expectation-Maximization (EM) Algorithm
 3. Our Proposed Framework
  3.1. Feature Selection Through Hierarchical Clustering
  3.2. Feature Selection Through Hierarchical Clustering
 4. Experimental Analysis and Simulation
  4.1. Set-up of the Experiment
  4.2. Accuracy Experiment
  4.3. Experimental Analysis on Execution Time
 5. Conclusion and Summary
 Acknowledgements
 References

저자정보

  • Ganglong Duan Xi'an University of Technology, Shaanxi 710054, China
  • Wenxiu Hu Xi'an University of Technology, Shaanxi 710054, China
  • Zhiguang Zhang Xi'an University of Technology, Shaanxi 710054, China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.