التفاصيل البيبلوغرافية
العنوان: |
Exploiting Hanja-Based Resources in Processing Korean Historic Documents Written by Common Literati |
المؤلفون: |
Hyeonseok Moon, Myunghoon Kang, Jaehyung Seo, Sugyeong Eo, Chanjun Park, Yeongwook Yang, Heuiseok Lim |
المصدر: |
IEEE Access, Vol 12, Pp 59909-59919 (2024) |
بيانات النشر: |
IEEE, 2024. |
سنة النشر: |
2024 |
المجموعة: |
LCC:Electrical engineering. Electronics. Nuclear engineering |
مصطلحات موضوعية: |
Natural language processing, deep learning, named entity recognition, sentence segmentation, ancient language processing, Electrical engineering. Electronics. Nuclear engineering, TK1-9971 |
الوصف: |
This research aims to explore the comprehension of historical Korean archives authored by common literati. Numerous endeavors have been made to study Korean historical documents; however, the majority of these endeavors focus solely on royal documents. By comparing the distinct linguistic characteristics between royal and commoner languages, this study challenges the applicability of the royal language-centric approach to commoner documents. In particular, we investigate the feasibility and limitations of existing resources that share the same writing system (Hanja) as historical Korean documents for processing Korean common literati documents. Through our investigation, we propose a simple yet effective methodology that enables the utilization of Hanja-based language resources in processing Korean common literati documents: the removal of special characters. We demonstrate that aligning characteristics of Hanja-based resources allows considerable performance improvements. To the best of our knowledge, our study represents the first research endeavor to concentrate on the comprehension of common literati documents. |
نوع الوثيقة: |
article |
وصف الملف: |
electronic resource |
اللغة: |
English |
تدمد: |
2169-3536 |
Relation: |
https://ieeexplore.ieee.org/document/10504272/; https://doaj.org/toc/2169-3536 |
DOI: |
10.1109/ACCESS.2024.3390181 |
URL الوصول: |
https://doaj.org/article/68b1c30130ae4c73abba799b6170b7c6 |
رقم الانضمام: |
edsdoj.68b1c30130ae4c73abba799b6170b7c6 |
قاعدة البيانات: |
Directory of Open Access Journals |