파이썬 기반 규칙 중심 텍스트 분석을 활용한 한·중 한국어교육 학술논문의 구조·주제·정서 경향 탐색

최지유

파이썬 기반 규칙 중심 텍스트 분석을 활용한 한·중 한국어교육 학술논문의 구조·주제·정서 경향 탐색

원문정보

Exploring Structural, Topical, and Sentiment Tendencies in Korean Language Education Research Articles from China and Korea Using Python-Based Rule-Oriented Text Analysis

최지유

국제차세대융합기술학회 차세대융합기술학회논문지 제10권 3호 2026.03 pp.825-833 KCI 등재

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

This study explores structural elements, topic distribution, and sentiment indicators in Korean language education research articles published in China and Korea using Python-based rule-oriented text analysis. A total of 300 articles indexed in CNKI and KCI were analyzed after converting PDF documents into text data and applying a unified preprocessing procedure. Structural analysis was conducted through rule-based quantification of keyword frequency and positional distribution rather than automated classification models. Topic distribution was examined by focusing on documents with high cluster coherence to identify keyword concentration and thematic tendencies. Sentiment analysis employed lexicon- and rule-based methods to calculate sentiment intensity indicators for each article. The results show that articles from both countries share similar overall structural proportions and positional patterns, while differences were observed in lexical density and topic concentration within certain structural categories. Sentiment indicators in both datasets were largely concentrated in the neutral range. This study demonstrates the applicability of rule-based automated text analysis for large-scale academic corpora and offers methodological implications for future research in Korean language education.

한국어

본 연구는 파이썬 기반 규칙 중심 텍스트 분석을 활용하여 한·중 한국어교육 학술논문의 구조 요소, 주제 분포 및 정서 지표의 분포 경향을 탐색적으로 분석하였다. 분석 대상은 CNKI와 KCI에 등재된 한국어교육 관련 학술논문 각 150편으로, PDF 문헌을 텍스트 데이터로 변환한 후 동일한 전처리 및 분석 절차를 적용하였다. 구조 분석에서는 자동 판별 모델을 사용하지 않고, 핵심 어휘의 출현 비율과 위치 분포를 규칙 기반으로 계량하였다. 주제 분석은 군집 안정도가 높은 문헌을 중심으로 키워드 분포와 주제 집중 경향을 살펴보았다. 정서 분석은 사전 및 규칙 기반 방식을 활용하여 논문별 정서 강도 지표를 산출하고 범주별 분포를 제시하였다. 분석 결과, 한·중 논문은 전반적인 구조 구성 비율과 위치 분포에서 유사한 경향을 보였으나, 일부 구조 범주에서 어휘 밀도와 주제 집중 양상에 차이가 관찰되었다. 또한 정서 지표는 양국 논문 모두 중립 범주에 집중되는 경향을 나타냈다. 본 연구는 대규모 학술 텍스트를 대상으로 한 규칙 기반 자동 분석의 적용 가능성을 제시하고, 한국어교육 연구에서 활용 가능한 방법론적 시사점을 제공한다.

요약
Abstract
Ⅰ. 서론
Ⅱ. 연구 방법
2.1 연구 자료
2.2 데이터 전처리 및 분석 절차
2.3 규칙 기반 구조 요소 탐색 방법
2.4 주제 분포 탐색을 위한 키워드 및 군집 분석
2.5 규칙 및 사전 기반 정서 지표 산출 방법
Ⅲ. 분석 결과
3.1 구조 요소 분포 결과
3.2 주제 분포 경향
3.3 정서 지표 분포 결과
Ⅳ. 결론 및 논의
REFERENCES

키워드

저자정보

최지유 Choi Jiyu. 중앙대학교 대학원 국어국문학과 학생

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 기관로그인 시 무료 이용이 가능합니다.

4,000원

0개의 논문이 장바구니에 담겼습니다.

earticle