Learning to Detect Spam : Naive-Euclidean Approach

Tony Y.T. Chan; Jie Ji; Qiangfu Zhao

Learning to Detect Spam : Naive-Euclidean Approach

원문정보

보안공학연구지원센터(IJSIP) International Journal of Signal Processing, Image Processing and Pattern Recognition vol.1 no.1 2008.12 pp.31-38

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

A method is proposed for learning to classify spam and nonspam emails. It combines the strategy of the Best Stepwise Feature Selection with a classifier of Euclidean nearest-neighbor. Each text email is first transformed into a vector of D-dimensional Euclidean space. Emails were divided into training and test sets in the manner of 10-fold crossvalidation. Three experiments were performed, and their elapsed CPU times and accuracies reported. The proposed spam detection learner was found to be extremely fast in recognition and with good error rates. It could be used as a baseline learning agent, in terms of CPU time and accuracy, against which other learning agents can be measured.

키워드

저자정보

Tony Y.T. Chan The University of Akureyri
Jie Ji The University of Aizu, Aizuwakamatsu
Qiangfu Zhao The University of Aizu, Aizuwakamatsu,

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle