earticle

논문검색

Learning Extraction of Chinese Comparative Sentences for Evaluative Text

초록

영어

With the prevalence of Web 2.0, people increasingly prefer to express opinions and exchange information through CGM (consumer-generated media), such as blog, Internet forum and etc. Many studies pay attention to extract and analysis user opinions in consumer reviews. This paper studies how to automatically extract Chinese comparative sentences from consumer reviews. At first, the paper describes a method for solving the class imbalance problem of comparatives and non-comparatives in review data. Then we built a support vector machine learning model to classify comparatives and non-comparatives into different group on a balanced dataset. Experiments were conducted on consumer-generated product reviews, including 9600 sentences, of which 1,624 (16.92% of the total) were comparisons. Experiments show an overall F-score of 87.26%, which presents the effectiveness of the proposed approach.

목차

Abstract
 1. Introduction
 2. Related Work
 3. Feature Representations
  3.1. Feature Sets 1: Term Features
  3.2. Feature Sets 2: Comparative Keywords
  3.3. Feature Sets 3: Frequent Sequences
  3.4. Feature Sets 4: Infrequent Sequences
 4. Classification Learning
 5. Experimental Evaluation
  5.1. Data Sets
 6. Conclusion and Future Work
 Acknowledgement
 References

저자정보

  • Wei Wang Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
  • TieJun Zhao Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
  • GuoDong Xin Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.