earticle

논문검색

Exploiting Machine Learning for Comparative Sentences Extraction

초록

영어

This paper studies the problem of extracting Chinese comparative sentences from user reviews, which is a problem of text classification in the level of sentence. This paper first deals with the class skewed problem of review data, and then builds a SVM (support vector machine) model to classify comparative and non-comparative sentences into different groups on a balanced dataset. Various linguistic and statistical features are introduced to characterize a sentence. Experiments were conducted on user-generated product reviews. As a result, our experiments show significant performance, an overall F-score of 85.87%.

목차

Abstract
 1. Introduction
 2. Related Work
 3. Balancing of Corpus
  3.1. Sentence Types
  3.2. Data Balance Strategy
 4. Extracting Comparative Sentences in Balanced Corpus
  4.1. Mining Sequence Pattern Features
  4.2. Manual Rule Features
 5. Experimental Results
  5.1. Data Sets
  5.2. Experimental Results
 6. Conclusion
 Acknowledgement
 References

저자정보

  • Wei Wang School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
  • TieJun Zhao School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
  • GuoDong Xin School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
  • YongDong Xu School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.