Density-Based Heterogeneous Data Stream Clustering Algorithm with Mixed Distance Measure Methods

Chen Jin-yin; He Hui-hao

Density-Based Heterogeneous Data Stream Clustering Algorithm with Mixed Distance Measure Methods

원문정보

Chen Jin-yin, He Hui-hao

보안공학연구지원센터(IJDTA) International Journal of Database Theory and Application Vol.8 No.3 2015.06 pp.167-178 SCOPUS

초록

영어

Heterogeneous data stream clustering is an important issue in data stream mining, for the accuracy of the existing heterogeneous clustering algorithm is not high, and don’t have a common distance measure method, a heterogeneous data stream clustering algorithm based on the density with mixed distance measure method is proposed. HDSDen algorithm adopts an online/offline two-stage processing framework. According to the situation of dominant property, the online stage use corresponding distance measure method to define the core points among the arriving points, the purpose of the different distance calculation method is to reduce the influence of the non-dominant property on the whole clustering accuracy. All the density-reachable points form a cluster in the offline stage, and put all the not-clustered points into the reservoir, and the number of the reservoir exceeds the threshold value, we will re-cluster the points to improve the accuracy of clustering. Experiments on real data sets show that the algorithm can achieve better clustering results, and give the clustering results at any time, which can deal with the heterogeneous data stream efficiently.

Abstract
1. Introduction
2. The Traditional Density-based Clustering and Related Definitions
4. HDSDen Algorithm Framework and Related Concepts
  4.1. Algorithm Framework
  4.2. Distance Measure Method
5. Detailed Description of HDSDen Algorithm
  5.1. Pretreatment Process
  5.2. Online Maintenance
  5.3. Offline Stage
  5.4 The Process of HDSDen Algorithm
6. Experimental Evaluation
  6.1. Categorical Attributes Dominant Data
  6.2. Continuous Attributes Dominant Data
7. Conclusion
References

키워드

저자정보

Chen Jin-yin Zhejiang University of Technology, Hangzhou310000
He Hui-hao Zhejiang University of Technology, Hangzhou310000

참고문헌

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle