원문정보
Effect of Text Length on the Validity of Automatic Evaluation of Human Translation.
초록
영어
This study explores the influence of the length of human-translated text on the validity of automatic evaluations. To this end, the Brevity Penalty (BP) was replaced with the Length Penalty (LP) to better align the evaluation methodology with the characteristics of human translation. The study examined if this modification improved evaluation validity. The impact of the BP on the total score was ascertained via the coefficient of determination (R²) of the regression analysis. Validity was further investigated by comparing the correlation coefficient of the similarity between BLEU and LP-applied BLEU (lBLEU) scores and human evaluation scores. The findings indicated an overall improvement in validity, and an increase in the coefficient of determination. The qualitative analysis revealed that students were more likely to lose points for longer translations when compared to the three reference translations, particularly in the case of extensive, informative original texts on unfamiliar topics. Therefore, the length of the human-translated text significantly influences the validity of the automatic evaluation, and substituting BP with LP positively improved this validity. The implications of this study can potentially extend to educational scenarios and the utilization of automated assessments.
목차
1. 서론
2. 텍스트 길이와 평가 타당성
2.1. 짧은 길이 감점(Brevity Penalty)
2.2. 길이 감점(Length Penalty)
3. 실험
3.1. 자료
3.2. 실험 방법
4. 결과
4.1. 타당도 비교
4.2. 세부 분석
5. 결론 및 논의
참고문헌
