초록 열기/닫기 버튼
The purpose of this study was to investigate inter- and intra- rater reliability in aninterview and a computerized oral test. It was also examined whether ratercharacteristics influenced on their reliability and biases, and finally the scores of bothtests were compared with those of the Versant test using an automated computer ratingsystem. For the study, the data from 21 Korean university students and 18 Korean ornative speakers of English raters with various characteristics were collected. Some ofthe main findings from the study were as follows. First, rater severity was significantlydifferent in each test, but each rater consistently graded on both tests suggesting lowerinter-rater reliability and higher intra-rater reliability. Secondly, rater severity wasimpacted by the rater characteristics such as mother tongue, gender, age, and major. Lastly, there existed a positive correlation among the scores of the three tests,indicating that the scores of human beings and computers are strongly related.