Humanities Data for Age of Artificial Intelligence
Ba-Ro Kim HK Reasearch Professor, Chungang University
This paper examines the development process of humanities data so far and examines the future development direction of humanities data in the era of the Fourth Industrial Revolution. At first, I looked at the development history of character encoding that recognizes characters in computers from ASCII code to Unicode, and I was worried about the essence of character recognition. Then, the concept of human readable data and machine readable data was examined, and the corpus structure, which is a basic machine readable data, was examined. After that, we examined N-gram and Word2Vec, which are data meaning-giving methods by machines, and discussed the advantages and disadvantages. And the semantic web, which is a method of giving the meaning of the main data by the opposite term, was explored. Finally, based on the development of the data, the aspects of future human data were examined.
Key words: Artificial Intelligence Humanities, humanities data, semantic web, HumanReadable Data, Machine Readable Data |