• Chung-Ang University

    Humanities Research Institute
    HK+ Artificial Intelligence Humanities

JournalsPast Issues

Past Issues

eISSN: 2951-388X
Print ISSN: 2635-4691 / Online ISSN: 2951-388X
Title[Journal of Artificial Intelligence Humanities Vol.4] The Role of Domain Knowledge in Deep Learning-based Natural Language Processing_Jinho Park2021-02-03 22:07
Writer Level 10
Attachment07 Jinho Park.pdf (697.2KB)

The Role of Domain Knowledge in Deep Learning-based Natural Language Processing


Jinho Park


In Symbolic AI, the domain knowledge was considered indispensable. In rule-based NLP, likewise, the linguistic knowledge played an important role. As probabilistic NLP and machine learning techniques develop, the role of domain knowledge shrank. As deep learning appears, even the role of feature engineering and domain knowledge has become almost zero.

In order to prove the importance of domain knowledge even in this deep learning age, I built a parts-of-speech tagger of Korean. This task in Korean is challenging, due to morphophonological alternations, deletions and contractions. I reformulated this task of segmentation as that of classification. For this purpose, I examined a large corpus, and found empirically 200 types of mapping between an input syllable and an output string. Based on these categories, I built and trained an LSTM-based neural network. With this model of segmentation, the parts-of-speech tagging model is easily trained by the familiar sequence tagging algorithm. By combining these two models and a few dictionaries, I got 98.0% of the F1 score.



Key words: domain knowledge, natural language processing, tagging, segmentation, deep learning

Chung-Ang University, Humanities Research Institute
#828, 310 Hall, 84 Heukseok-ro, Dongjak-gu, Seoul, 06974, Korea  TEL +82-2-881-7354  FAX +82-2-813-7353  E-mail : aihumanities@cau.ac.krCOPYRIGHT(C) 2017-2023 CAU HUMANITIES RESEARCH INSTITUTE ALL RIGHTS RESERVED