원문정보
초록
영어
This study proposes a systematic methodology for evaluating Korean-Chinese translation quality using AI-HUB's ‘Broadcasting Content Korean-Chinese Translation Parallel Corpus Data’. While machine translation technology has advanced significantly, research on Korean-Chinese translation characteristics remains insufficient, particularly for domains featuring extensive colloquial expressions and cultural contexts such as broadcasting content. This research selected 10,000 sentences through stratified sampling from 1.2 million sentences and evaluated translation quality using BLEU, METEOR, and TER metrics. The analysis revealed that translation quality varied significantly by genre and sentence length. Educational programs achieved the highest BLEU score of 0.467, while reality variety shows recorded the lowest at 0.371. Translation quality declined sharply as sentence length increased, from 0.518 for short sentences to 0.287 for long sentences. Error analysis identified colloquial expression mistranslations (32%), demonstrative errors (21%), and literal translations of idioms (18%) as major challenges. The methodology established reproducibility through publicly available resources (AI-HUB data, Naver Papago API, Python/NLTK), while findings suggest that educational content shows higher translation reliability compared to entertainment programs requiring careful post-editing. This study is significant as the first systematic evaluation of Korean-Chinese translation quality in the broadcasting content domain, providing specific directions for improving translation of colloquial expressions and cultural context that are characteristic of Chinese language translation.
