Academic Journal

Lexical Normalization of User-Generated Medical Forum Data

التفاصيل البيبلوغرافية
العنوان: Lexical Normalization of User-Generated Medical Forum Data
المؤلفون: Dirkson, A.R., Verberne, S., Kraaij, W.
المساهمون: Weissenbacher D., Gonzalez-Hernandez G.
المصدر: Proceedings of the Fourth Social Media Mining for Health Applications (SMM4H) Workshop & Shared Task
سنة النشر: 2019
المجموعة: Leiden Repository (Leiden University)
الوصف: In the medical domain, user-generated social media text is increasingly used as a valuable complementary knowledge source to scientific medical literature. The extraction of this knowledge is complicated by colloquial language use and misspellings. Yet, lexical normalization of such data has not been addressed properly. This paper presents an unsupervised, data-driven spelling correction module for medical social media. Our method outperforms state-of-the-art spelling correction and can detect mistakes with an F0.5 of 0.888. Additionally, we present a novel corpus for spelling mistake detection and correction on a medical patient forum. ; Algorithms and the Foundations of Software technology
نوع الوثيقة: article in journal/newspaper
وصف الملف: application/pdf
اللغة: English
Relation: https://www.aclweb.org/anthology/W19-3202; lucris-id: 110558890; https://hdl.handle.net/1887/80813
DOI: 10.18653/v1/W19-3202
الاتاحة: https://hdl.handle.net/1887/80813
https://www.aclweb.org/anthology/W19-3202
https://doi.org/10.18653/v1/W19-3202
رقم الانضمام: edsbas.E7FA95BC
قاعدة البيانات: BASE