earticle

논문검색

A Novel Crawler Based on Loginning Simulation for Weibo Social Network

초록

영어

With the rapid development of Weibo, which is the most popular microblog in china, more and more attention was paid to relative studies about it. With the objective of gathering precise information data from Weibo, which is the groundwork of these researches, a novel high efficient Weibo crawler (WCrawler) based on loginning simulation is designed. The priority evaluation is described to ensure the correlation between entires. MD5 is introduced to check for duplicates of URL crawled. Experiments demonstrate that the novel crawler has an efficiency and integrity of information collecting compared with API crawler. In addition, we present a summary of the data that collected from Weibo social network by WCrawler.

목차

Abstract
 1. Introduction
 2. Data Specification and Priority Evaluation
  2.1. Data Specification
  2.2. Priority Evaluation
 3. Design and Implementation of WCrawler
  3.1. Control Module and Storage Module
  3.2. Login Simulation Module
  3.3. Crawling Module
 4. Experimental Results and Analysis
  4.1. Performance of WCrawler
  4.2. Analyzing the Crawled Data
 5. Conclusions
 References

저자정보

  • Ling Xing School of Information Engineering, Southwest University of Science and Technology, Mianyang, 621010, China, Robot Technology Used for Special Environment Key Laboratory of Sichuan Province, Mianyang, 621010, China
  • Ling Jiang Department of Mathematics and Computer Science, Wuyi University, Wuyishan, 354300,China
  • Bao Peng School of Electronic and Communication, Shenzhen Institute of Information Technology, Shenzhen, 518172, China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.