earticle

논문검색

Session C - 기획세션 : KrAIS, 좌장 : 김희웅(연세대학교), 강현정(홍익대학교)

Airline Data Set Analysis using Big Data in Cloud Computing

초록

영어

In this paper, 10 years of airline data set in USA is collected. And, the analysis of the airline data set is performed using cloud computing service of Microsoft Azure which runs Hadoop cluster in the cloud. Hadoop Hive QL statements have been used for analyzing the data. Data visualization has been achieved by extracting the output of the Hive code. Using big data technologies like Hadoop and Hive in the cloud, it is easy to analyze the massive data set in SQL like language, export the output results and visualize them excel spreadsheets. Interesting sets of trends and patterns exists in this large data sets, the paper presents the insights that resides between flight diversions and flight distance, flight cancellation and flight distance for the time period.

목차

Abstract
 Introduction
 Related Work
 Methods
  Hadoop Hive
  Cloud Computing: HDInsight
  Analysis of Airline Data Set
 Experimental Results
 CONCLUSION
 References

저자정보

  • Nillohit Bhattacharya InfoSys Limited, Los Angeles, 90032 U.S.A.
  • Jongwook Woo California State University, Los Angeles, 90032 U.S.A.

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 기관로그인 시 무료 이용이 가능합니다.
      ※ 학술발표대회집, 워크숍 자료집 중 4페이지 이내 논문은 '요약'만 제공되는 경우가 있으니, 구매 전에 간행물명, 페이지 수 확인 부탁 드립니다.

      • 4,000원

      0개의 논문이 장바구니에 담겼습니다.