Imbalanced Data SVM Classification Method Based on Cluster Boundary Sampling and DT-KNN Pruning

Li Peng; Yu Xiao-yang; Bi Ting-ting; Huang Jiu-ling

Imbalanced Data SVM Classification Method Based on Cluster Boundary Sampling and DT-KNN Pruning

원문정보

Li Peng, Yu Xiao-yang, Bi Ting-ting, Huang Jiu-ling

보안공학연구지원센터(IJSIP) International Journal of Signal Processing, Image Processing and Pattern Recognition Vol.7 No.2 2014.04 pp.61-68

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

This paper presents a SVM classification method based on cluster boundary sampling and sample pruning. We actively explore an effective solution to solve the difficult problem of imbalanced data set classification from data re-sampling and algorithm improving. Firstly, we creatively propose the method of cluster boundary sampling, using the clustering density threshold and the boundary density threshold to determine the cluster boundaries, in order to guide the process of re-sampling more scientifically and accurately. Secondly, we put forward a new sample pruning algorithm based on dynamic threshold KNN to deal with the complexity and overlapping problem of imbalanced data set. The phenomenon of data complexity and overlapping will reduce the classification performance and generalization ability of SVM classifier. Experiments show that our method acquires obviously promotion effect in various different imbalanced data sets and it can prove the validity and st

Abstract
1. Introduction
2. Cluster Boundary Sampling Method based on Density Clustering
  2.1. Density Clustering Algorithm
  2.2. Cluster Boundary Under-sampling Method
3. Pruning Algorithm based on Dynamic Threshold KNN
  3.1. Complexity and Overlapping Analysis of Imbalanced Data Set
  3.2. DT-KNN Pruning Algorithm
4. The Results and Analysis of Experiment
5. Conclusion
Acknowledgements
References

키워드

저자정보

Li Peng Higher Educational Key Laboratory for Measuring and Control Technology, Instrumentations of Heilongjiang Province, Harbin University of Science and Technology, 150080 Harbin, China, School of Computer Science and Technology, Harbin University of Science and Technology, 150080 Harbin, China
Yu Xiao-yang Higher Educational Key Laboratory for Measuring and Control Technology, Instrumentations of Heilongjiang Province, Harbin University of Science and Technology, 150080 Harbin, China
Bi Ting-ting School of Computer Science and Technology, Harbin University of Science and Technology, 150080 Harbin, China
Huang Jiu-ling School of Computer Science and Technology, Harbin University of Science and Technology, 150080 Harbin, China

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle