earticle

논문검색

Detecting Polarizing Language in Twitter using Topic Models and ML Algorithms

초록

영어

The upsurge in the use of social media in public discourses has made it possible for social scientists to engage in emerging and interesting areas of research. Normally, public debates tend to assume polar positions along political, social or ideological lines. Generally, polarity in the language used is more of blaming the opposing group in such debates. In this paper, we investigated the detection of polarizing language in tweets in the event of a disaster. Our approach entails combining topic modeling and Machine Learning (ML) algorithms to generate topics that we consider to be polarized thereby classifying a given tweet as polar or not. Our latent Dirichlet allocation (LDA)-based model incorporates external resources in the form of a lexicon of blame-oriented words to induce the generation of polar topics. The Collapsed Gibbs sampling is used to infer new documents and to estimate the values of parameters employed in our model. We computed the log likelihood (LL) ratios using our model and two other state-of-the-art LDA-based models for evaluation. Furthermore, we compared polarized detection classification accuracy using the features extracted from polarized topics, bag of words (BOW) and part of speech (POS)-based features. Preliminary experiments returned higher overall accuracy results of 87.67% using topic-based features compared to BOW and POS-based features.

목차

Abstract
 1. Introduction
 2. Related Work
 3. Blame Detection Approach
  3.1. L DA-Based Topic Modeling
 4. Data Collection
  4.1. Preprocessing
 5. Experimental Results
  5.1. Evaluation using Log Likelihood
  5.2. Feature Engineering with BLDA
  5.3. Classification using LibLinear SVM
 6. Conclusion
 References

저자정보

  • Njagi Dennis Gitari School of Information Science and Engineering, Central South University Changsha, 410083, China, Department of Information Technology, Jomo Kenyatta University of Science and Technology (JKUAT), 62000, Kenya
  • Zhang Zuping School of Information Science and Engineering, Central South University Changsha, 410083, China
  • Wandabwa Herman School of Information Science and Engineering, Xiamen University, Xiamen, 361005, China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.