Academic Journal

FloraTraiter: Automated parsing of traits from descriptive biodiversity literature

التفاصيل البيبلوغرافية
العنوان: FloraTraiter: Automated parsing of traits from descriptive biodiversity literature
المؤلفون: Ryan A. Folk, Robert P. Guralnick, Raphael T. LaFrance
المصدر: Applications in Plant Sciences, Vol 12, Iss 1, Pp n/a-n/a (2024)
بيانات النشر: Wiley, 2024.
سنة النشر: 2024
المجموعة: LCC:Biology (General)
LCC:Botany
مصطلحات موضوعية: biodiversity literature, flora, functional trait, language model, natural language parsing, Biology (General), QH301-705.5, Botany, QK1-989
الوصف: Abstract Premise Plant trait data are essential for quantifying biodiversity and function across Earth, but these data are challenging to acquire for large studies. Diverse strategies are needed, including the liberation of heritage data locked within specialist literature such as floras and taxonomic monographs. Here we report FloraTraiter, a novel approach using rule‐based natural language processing (NLP) to parse computable trait data from biodiversity literature. Methods FloraTraiter was implemented through collaborative work between programmers and botanical experts and customized for both online floras and scanned literature. We report a strategy spanning optical character recognition, recognition of taxa, iterative building of traits, and establishing linkages among all of these, as well as curational tools and code for turning these results into standard morphological matrices. Results Over 95% of treatment content was successfully parsed for traits with
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2168-0450
Relation: https://doaj.org/toc/2168-0450
DOI: 10.1002/aps3.11563
URL الوصول: https://doaj.org/article/ddb3de9b6f6841bd9022c5b50ca2ee54
رقم الانضمام: edsdoj.b3de9b6f6841bd9022c5b50ca2ee54
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:21680450
DOI:10.1002/aps3.11563