earticle

논문검색

Detecting Arabic Cloaking Web Pages Using Hybrid Techniques

초록

영어

Many challenges are emerging in the every day expanding Internet environment, whether for the Internet users or the Web sites owners. The Internet users need to retrieve the high quality relevant information which are relevant to their queries within a short period of time, in order to be a regular users who satisfied by search engine performance. While the Web site owners aim in most cases to increase the rank of their Web pages within SERP to attract more customers to their Web sites, and consequently gaining more visits, which in turn means more revenues. The top rank of the Web pages within SERPs, is very important to the e-commerce and commercial Web pages. The owners of Web sites can attract more visitors to their Web pages, and gain more revenue, through Pay Per Click when their pages appear in the top results of SERPs. This paper proposed new approach of Arabic Web spam detection, dedicated with the cloaking Web pages, using hybrid techniques of content and link analysis. The proposed detection system built the first Arabic cloaking dataset contains around 5,000 Arabic cloaked Web pages. The proposed system extracts all possible rules from HTML element to monitor the cloaking behaviors, and then used three classification algorithms (K-NN, Decision Tree, and Logistic Recognition) in the experimental tests. This novel system yielded a high accuracy results with an accuracy of 94.1606% in detecting cloaking behaviors in Arabic Web pages.

목차

Abstract
 1. Introduction
 2. Related Work
  2.1 Content-based Web spam Detection
  2.2 Link-based Web spam detection
  2.3 Non Arabic cloaking and hybrid Web spam detection
 3. Proposed Frame work
 4. Cloaking Features Extraction
 5. Experimental Results
  5.1 Cloaking Detection Classifiers
  5.2 Cloaking Results Analysis
  5.3 Cloaking Detection System
 6. Evaluate the Proposed Arabic Cloaking Detection System
 7. Conclusions and Future Work
 References

저자정보

  • Heider A. Wahsheh Computer Science Department, College of Computer Science King Khalid University, Abha, Saudi Arabia
  • Mohammed N. Al-Kabi Faculty of Sciences and IT, Zarqa University, Zarqa, Jordan
  • Izzat M. Alsmadi Information Systems Department, College of Computer & Information Sciences Prince Sultan University, Riyadh, Saudi Arabia

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.