Learning Extraction of Chinese Comparative Sentences for Evaluative Text

Wei Wang; TieJun Zhao; GuoDong Xin

Learning Extraction of Chinese Comparative Sentences for Evaluative Text

원문정보

Wei Wang, TieJun Zhao, GuoDong Xin

보안공학연구지원센터(IJGDC) International Journal of Grid and Distributed Computing Vol.9 No.3 2016.03 pp.53-62 SCOPUS

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

With the prevalence of Web 2.0, people increasingly prefer to express opinions and exchange information through CGM (consumer-generated media), such as blog, Internet forum and etc. Many studies pay attention to extract and analysis user opinions in consumer reviews. This paper studies how to automatically extract Chinese comparative sentences from consumer reviews. At first, the paper describes a method for solving the class imbalance problem of comparatives and non-comparatives in review data. Then we built a support vector machine learning model to classify comparatives and non-comparatives into different group on a balanced dataset. Experiments were conducted on consumer-generated product reviews, including 9600 sentences, of which 1,624 (16.92% of the total) were comparisons. Experiments show an overall F-score of 87.26%, which presents the effectiveness of the proposed approach.

Abstract
1. Introduction
2. Related Work
3. Feature Representations
  3.1. Feature Sets 1: Term Features
  3.2. Feature Sets 2: Comparative Keywords
  3.3. Feature Sets 3: Frequent Sequences
  3.4. Feature Sets 4: Infrequent Sequences
4. Classification Learning
5. Experimental Evaluation
  5.1. Data Sets
6. Conclusion and Future Work
Acknowledgement
References

키워드

저자정보

Wei Wang Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
TieJun Zhao Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
GuoDong Xin Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle