Click to download the Korean Essay Score Range Prediction Dataset This dataset consists of 304 Korean essays written by foreign students (300) and instructors (4) and corresponding grades (A, B, C, D) expressed in numbered labels (3, 2, 1, 0). This dataset is used in the following two papers (both in Korean): - Heeryon Cho, Hyeonyeol Im, Yumi Yi, and Junwoo Cha, "Comparison of Korean Classification Models’ Korean Essay Score Range Prediction Performance," KIPS Transaction on Software and Data Engineering (to appear).
- Heeryon Cho, Yumi Yi, Hyeonyeol Im, Junwoo Cha, and Chankyu Lee, "Automatic Score Range Classification of Korean Essays Using Deep Learning-based Korean Language Models -The Case of KoBERT & KoGPT2-," Journal of the International Network for Korean Language and Culture, Vol.18, No.1, pp. 217-241, April 30, 2021.
You can clone the Python code used in the experiments at the following GitHub repositories: - https://github.com/heeryoncho/three_korean_bert_LM_comparison
- https://github.com/heeryoncho/korean_essay_grade_prediction
|