Network Information Target Search Model and Strategy Based on Web Vertical Crawler System

Bin Wang; Jian Zhang; Na Wang; Xiaohua Sun; Yanhui Wang

Network Information Target Search Model and Strategy Based on Web Vertical Crawler System

원문정보

Bin Wang, Jian Zhang, Na Wang, Xiaohua Sun, Yanhui Wang

보안공학연구지원센터(IJFGCN) International Journal of Future Generation Communication and Networking Vol.9 No.4 2016.04 pp.171-178

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

With the explosive growth of the web information, acquiring the target information quickly, precisely and effectively in a large number of network information is restricted by many factors. In response to it, this paper analyzed the key technician part in the network information search engine-web crawler technology and proposed a network information target search model based on the web vertical crawler system with discussion on the implementation of the corresponding search strategy. Firstly, it builds the structure of the web crawler system and analyzes different function models. Next, it discusses some crucial problems including the options of deleting duplicated URL, the strategies to choose duplicated URL deletion method and the control model of error probability estimate model to acquire the closest network information to the target information. Finally, it verifies and discusses the operability and effectiveness of the model and the implementation strategy through case study.

Abstract
1. Introduction
2. Build the Structure of the Web Vertical Crawler System
3. The Options and Strategies to Delete the Duplicated URL in the Web Vertical Crawler System
3.1. The Options to Delete the Duplicated URL
3.2. Implementation Strategy of Deleting the Duplicated URL
4. The Control Model of the Search Estimate Error Probability of Web Vertical Crawler
5. Case Validation and Analysis
6. Conclusion
Acknowledgment
References

키워드

저자정보

Bin Wang College of Information Science and Technology, Agricultural University of Hebei, Department Economics and Management, Baoding Vocational and Technical College, Department of Digital Media, Hebei Software Institute, Baoding, China College of Life Sciences, Agricultural University of Hebei
Jian Zhang College of Information Science and Technology, Agricultural University of Hebei, Department Economics and Management, Baoding Vocational and Technical College, Department of Digital Media, Hebei Software Institute, Baoding, China College of Life Sciences, Agricultural University of Hebei
Na Wang College of Information Science and Technology, Agricultural University of Hebei, Department Economics and Management, Baoding Vocational and Technical College, Department of Digital Media, Hebei Software Institute, Baoding, China College of Life Sciences, Agricultural University of Hebei
Xiaohua Sun College of Information Science and Technology, Agricultural University of Hebei, Department Economics and Management, Baoding Vocational and Technical College, Department of Digital Media, Hebei Software Institute, Baoding, China College of Life Sciences, Agricultural University of Hebei
Yanhui Wang College of Information Science and Technology, Agricultural University of Hebei, Department Economics and Management, Baoding Vocational and Technical College, Department of Digital Media, Hebei Software Institute, Baoding, China College of Life Sciences, Agricultural University of Hebei

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle