• Chung-Ang University

    Research Institute for Humanities Contents
    HK+ Artificial Intelligence Humanities

ArchiveDatasets

Datasets

ArchiveDatasets

Datasets

TitleKorean Essay Grade Prediction Dataset2021-12-14 15:20
Writer Level 10

Click to download the Korean Essay Score Range Prediction Dataset

This dataset consists of 304 Korean essays written by foreign students (300) and instructors (4) and corresponding grades (A, B, C, D) expressed in numbered labels (3, 2, 1, 0).

This dataset is used in the following two papers (both in Korean):

  1. Heeryon Cho, Hyeonyeol Im, Yumi Yi, and Junwoo Cha, "Comparison of Korean Classification Models’ Korean Essay Score Range Prediction Performance," KIPS Transaction on Software and Data Engineering (to appear).
  2. Heeryon Cho, Yumi Yi, Hyeonyeol Im, Junwoo Cha, and Chankyu Lee, "Automatic Score Range Classification of Korean Essays Using Deep Learning-based Korean Language Models -The Case of KoBERT & KoGPT2-," Journal of the International Network for Korean Language and Culture, Vol.18, No.1, pp. 217-241, April 30, 2021.


You can clone the Python code used in the experiments at the following GitHub repositories:

  1. https://github.com/heeryoncho/three_korean_bert_LM_comparison
  2. https://github.com/heeryoncho/korean_essay_grade_prediction
 
Chung-Ang University, Humanities Research Institute
#828, 310 Hall, 84 Heukseok-ro, Dongjak-gu, Seoul, 06974, Korea  TEL +82-2-881-7354  FAX +82-2-813-7353  E-mail : aihumanities@cau.ac.krCOPYRIGHT(C) 2017-2023 CAU HUMANITIES RESEARCH INSTITUTE ALL RIGHTS RESERVED