earticle

논문검색

An Efficient K-Means Algorithm and its Benchmarking against other Algorithms

초록

영어

K-Means is a widely used partition based clustering algorithm famous for its simplicity and speed. It organizes input dataset into predefined number of clusters. K-Means has a major limitation -- the number of clusters, K, need to be pre-specified as an input. Pre-specifying K in the K-Means algorithm sometimes becomes difficult in absence of thorough domain knowledge, or for a new and unknown dataset. This limitation of advance specification of cluster number can lead to “forced” clustering of data and proper classification does not emerge. In this paper, a new algorithm based on the K-Means is developed. This algorithm has advance features of intelligent data analysis and automatic generation of appropriate number of clusters. The clusters generated by the new algorithm are compared against results obtained with the original K-Means and various other famous clustering algorithms. This comparative analysis is done using sets of real data.

목차

Abstract
 1. Introduction
 2. Related Work
 3. An Extended K-Means Algorithm
 4. Illustrative Examples
 5. Results and Discussion
 6. Conclusion
 References

저자정보

  • Anupama Chadha Faculty of Computer Applications, MRIU, Faridabad, India
  • Suresh Kumar Faculty of Engineering and Technology, MRIU, Faridabad, India

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.