An Efficient K-Means Algorithm and its Benchmarking against other Algorithms

Anupama Chadha; Suresh Kumar

An Efficient K-Means Algorithm and its Benchmarking against other Algorithms

원문정보

Anupama Chadha, Suresh Kumar

보안공학연구지원센터(IJGDC) International Journal of Grid and Distributed Computing Vol.9 No.11 2016.11 pp.119-132 SCOPUS

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

K-Means is a widely used partition based clustering algorithm famous for its simplicity and speed. It organizes input dataset into predefined number of clusters. K-Means has a major limitation -- the number of clusters, K, need to be pre-specified as an input. Pre-specifying K in the K-Means algorithm sometimes becomes difficult in absence of thorough domain knowledge, or for a new and unknown dataset. This limitation of advance specification of cluster number can lead to “forced” clustering of data and proper classification does not emerge. In this paper, a new algorithm based on the K-Means is developed. This algorithm has advance features of intelligent data analysis and automatic generation of appropriate number of clusters. The clusters generated by the new algorithm are compared against results obtained with the original K-Means and various other famous clustering algorithms. This comparative analysis is done using sets of real data.

키워드

저자정보

Anupama Chadha Faculty of Computer Applications, MRIU, Faridabad, India
Suresh Kumar Faculty of Engineering and Technology, MRIU, Faridabad, India

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle