Research on Parallel Algorithm Based On Hadoop Distributed Computing Platform

Guo Weiwei; Liu Feng

Research on Parallel Algorithm Based On Hadoop Distributed Computing Platform

원문정보

Guo Weiwei, Liu Feng

보안공학연구지원센터(IJGDC) International Journal of Grid and Distributed Computing Vol.8 No.4 2015.08 pp.163-170

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

With the rapid development of the 3G network, traditional calculation methods are unable to adapt to the data scene that telecom users' Network access behavior's data scale increase rapidly dozens of TB. The cloud techniques such as Hadoop platform are introduced to solve the data storage problem. The appropriate data mining algorithms are designed from the perspective of practical application. This paper improves the traditional decision tree SPRINT algorithms, proposes a parallel computing program and successfully applies to the Hadoop platform.

Abstract
1. Introduction
2. Design and Implementation of Hadoop Platform
  2.1. Basic Design Ideas.
  2.2. System Structural Model.
3. Parallelized Layout of Sprint Algorithm
  3.1. Main Features of Sprint Method.
  3.2. Paralleling Strategy of the Algorithm.
4. Experiment Design and Discussion
  4.1. Analysis of rationality.
  4.2. Analysis of Effectiveness.
  4.3. Analysis of accuracy.
5. Conclusion
References

키워드

저자정보

Guo Weiwei Heilongjiang University of Technology, Jixi 158100, china,
Liu Feng Heilongjiang University of Technology, Jixi 158100, china,

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle