earticle

논문검색

Culture Convergence (CC)

A Practical Deep Learning-Based Approach for the Classification of Hate Speech and Offensive Language

원문정보

초록

영어

The proliferation of social media has facilitated communication but has also accelerated the spread of hate speech and offensive language. The automatic detection of such harmful content is essential for ensuring the safety and integrity of online platforms. This study proposes a practical classification system for hate speech detection using Google’s BERT (Bidirectional Encoder Representations from Transformers) model. A dataset consisting of 24,783 tweets was utilized, categorized into three labels: Hate Speech, Offensive Language, and Neither. To address the issue of data imbalance, techniques such as SMOTE, class weighting, and threshold optimization were applied. The preprocessing pipeline included the removal of URLs, user tags, and special characters to enhance data consistency. Compared with traditional machine learning models—Logistic Regression, Random Forest, and SVM—the BERT-based model demonstrated superior performance in both contextual understanding and classification accuracy. Model performance was evaluated using confusion matrices, precision, recall, F1-score, as well as ROC and Precision-Recall curves, showing particularly strong results for the Offensive Language category. Future research will extend this work by applying advanced transformer-based models such as RoBERTa and GPT-3, and by constructing multilingual datasets to develop a scalable, real-time detection system applicable to global social media platforms.

목차

Abstract
1. INTRODUCTION
2. Main Body
2.1 Related Work
2.2 Dataset and Data Imbalance Problem
3. Proposed Method
3.1 Model Overview
3.2 Training Configuration
3.3 Training and Validation Strategy
4. Results
4.1 Comparison with Traditional Machine Learning Models
4.2 Performance Evaluation and Visualization
5. Conclusion and Future Work
REFERENCES

저자정보

  • Kwon. Yong-Kwang Prof., Dept. of Shinansan Univ., Korea

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.