UBC-NLP at SemEval-2019 Task 6:Ensemble Learning of Offensive Content With Enhanced Training Data

التفاصيل البيبلوغرافية
العنوان: UBC-NLP at SemEval-2019 Task 6:Ensemble Learning of Offensive Content With Enhanced Training Data
المؤلفون: Muhammad Abdul-Mageed, Chiyu Zhang, Arun Rajendran
المصدر: SemEval@NAACL-HLT
سنة النشر: 2019
مصطلحات موضوعية: FOS: Computer and information sciences, Training set, Computer Science - Computation and Language, business.industry, Computer science, Offensive, 02 engineering and technology, computer.software_genre, Ensemble learning, SemEval, Task (project management), 020204 information systems, Content (measure theory), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Computation and Language (cs.CL), computer, Host (network), Natural language processing
الوصف: We examine learning offensive content on Twitter with limited, imbalanced data. For the purpose, we investigate the utility of using various data enhancement methods with a host of classical ensemble classifiers. Among the 75 participating teams in SemEval-2019 sub-task B, our system ranks 6th (with 0.706 macro F1-score). For sub-task C, among the 65 participating teams, our system ranks 9th (with 0.587 macro F1-score).
7 pages, 2 figures, Proceedings of the 13th International Workshop on Semantic Evaluation (SemEval)
اللغة: English
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::442e4cd3bc3510b4aea690853029c167
http://arxiv.org/abs/1906.03692
Rights: OPEN
رقم الانضمام: edsair.doi.dedup.....442e4cd3bc3510b4aea690853029c167
قاعدة البيانات: OpenAIRE