A New Boosting Algorithm for Classification on Distributed Databases

Nguyen Thi Van Uyen; Seung Gwan Lee; TaeChoong Chung

A New Boosting Algorithm for Classification on Distributed Databases

원문정보

Nguyen Thi Van Uyen, Seung Gwan Lee, TaeChoong Chung

보안공학연구지원센터(IJSEIA) International Journal of Software Engineering and Its Applications Vol.2 No.2 2008.04 pp.13-20

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

In this paper, we propose a new boosting algorithm for distributed databases. The main idea of the proposed method is to utilize the parallelism of the distributed databases to build an ensemble of classifiers. At each round of the algorithm, each site processes its own data locally, and calculates all needed information. A center site will collect information from all sites and build the global classifier, which is then a classifier in the ensemble. This global classifier is also used by each distributed site to compute required information for the next round. By epeating this process, an ensemble of classifiers, which is almost identical to the one built on the whole data, will be produced from the distributed databases. The experiments were erformed on 5 different datasets from the UCI repository [9]. The experimental results show that the accuracy of the proposed algorithm is almost equal to or higher than the accuracy when pplying boosting algorithm to the whole database.

저자정보

Nguyen Thi Van Uyen Artificial Intelligent Lab., KyungHee University
Seung Gwan Lee Artificial Intelligent Lab., KyungHee University
TaeChoong Chung Artificial Intelligent Lab., KyungHee University

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle