earticle

논문검색

Hybrid Learning Architectures for Advanced Data Mining:An Application to Binary Classification for Fraud Management

원문정보

개선된 데이터마이닝을 위한 혼합 학습구조의 제시

Steven H. Kim, Sung Woo Shin

피인용수 : 0(자료제공 : 네이버학술정보)

초록

영어

The task of classification permeates all walks of life, from business and economics to science and public policy. In this context, nonlinear techniques from artificial intelligence have often proven to be more effective than the methods of classical statistics. The objective of knowledge discovery and data mining is to support decision making through the effective use of information. The automated approach to knowledge discovery is especially useful when dealing with large data sets or complex relationships. For many applications, automated software may find subtle patterns which escape the notice of manual analysis, or whose complexity exceeds the cognitive capabilities of humans. This paper explores the utility of a collaborative learning approach involving integrated models in the preprocessing and postprocessing stages. For instance, a genetic algorithm effects feature-weight optimization in a preprocessing module. Moreover, an inductive tree, artificial neural network (ANN), and k-nearest neighbor (kNN) techniques serve as postprocessing modules. More specifically, the postprocessors act as second0order classifiers which determine the best first-order classifier on a case-by-case basis. In addition to the second-order models, a voting scheme is investigated as a simple, but efficient, postprocessing model. The first-order models consist of statistical and machine learning models such as logistic regression (logit), multivariate discriminant analysis (MDA), ANN, and kNN. The genetic algorithm, inductive decision tree, and voting scheme act as kernel modules for collaborative learning. These ideas are explored against the background of a practical application relating to financial fraud management which exemplifies a binary classification problem.

목차

Abstract
 1. INTRODUCTION
  1.1 Purpose
  1.2 Background
 2. COLLABORATIVE LEARNING IN THE CLASSIFICATION TASK
  2.1 GA as the Preprocessor
  2.2 Postprocessing approaches
 3. EXPERIMENTAL STUDY
  3.1 Data sets
  3.2 Preprocessing, Implementation, and Experiments
  3.3 Results
 4. CONCLUSIONS
 Reference
 요약

저자정보

  • Steven H. Kim 김형관. Graduate School of Management, Korea Advanced Institute of Science and Technology, Seoul, Korea
  • Sung Woo Shin 신성우. Samsung SDS, Seoul, Korea

참고문헌

자료제공 : 네이버학술정보

    ※ 기관로그인 시 무료 이용이 가능합니다.

    • 8,400원

    0개의 논문이 장바구니에 담겼습니다.