VTAM: A robust pipeline for validating metabarcoding data using internal controls

التفاصيل البيبلوغرافية
العنوان: VTAM: A robust pipeline for validating metabarcoding data using internal controls
المؤلفون: Emmanuel Corse, Aitor González, Thomas Dechatre, Vincent Dubut, Emese Meglécz, Reda Mekdad
المساهمون: Aix Marseille Université (AMU), Theories and Approaches of Genomic Complexity (TAGC), Aix Marseille Université (AMU)-Institut National de la Santé et de la Recherche Médicale (INSERM), Institut méditerranéen de biodiversité et d'écologie marine et continentale (IMBE), Avignon Université (AU)-Aix Marseille Université (AMU)-Institut de recherche pour le développement [IRD] : UMR237-Centre National de la Recherche Scientifique (CNRS), MARine Biodiversity Exploitation and Conservation (UMR MARBEC), Institut de Recherche pour le Développement (IRD)-Institut Français de Recherche pour l'Exploitation de la Mer (IFREMER)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS), Centre National de la Recherche Scientifique (CNRS)-Institut de recherche pour le développement [IRD] : UMR237-Aix Marseille Université (AMU)-Avignon Université (AU), Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Institut Français de Recherche pour l'Exploitation de la Mer (IFREMER)-Institut de Recherche pour le Développement (IRD), Meglécz, Emese
بيانات النشر: HAL CCSD, 2021.
سنة النشر: 2021
مصطلحات موضوعية: [SDE] Environmental Sciences, 0106 biological sciences, FASTQ format, 0303 health sciences, negative control, Data curation, Computer science, false negatives, mock sample, Negative control, false posi tives, Amplicon, computer.software_genre, 010603 evolutionary biology, 01 natural sciences, Pipeline (software), 03 medical and health sciences, replicates, metabarcoding, taxonomic assignation, [SDE]Environmental Sciences, Table (database), Data mining, computer, 030304 developmental biology
الوصف: Metabarcoding studies should be carefully designed to minimize false positives and false negative occurrences. The use of internal controls, replicates, and several overlapping markers is expected to improve the bioinformatics data analysis.VTAM is a tool to perform all steps of data curation from raw fastq data to taxonomically assigned ASV (Amplicon Sequence Variant or simply variant) table. It addresses all known technical error types and includes other features rarely present in existing pipelines for validating metabarcoding data: Filtering parameters are obtained from internal control samples; cross-sample contamination and tag-jump are controlled; technical replicates are used to ensure repeatability; it handles data obtained from several overlapping markers.Two datasets were analysed by VTAM and the results were compared to those obtained with a pipeline based on DADA2. The false positive occurrences in samples were considerably higher when curated by DADA2, which is likely due to the lack of control for tag-jump and cross-sample contamination.VTAM is a robust tool to validate metabarcoding data and improve traceability, reproducibility, and comparability between runs and datasets.
وصف الملف: application/pdf
اللغة: English
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ff7b61c34b2b8c2999edfbdaac8c8d46
https://amu.hal.science/hal-03144831
Rights: OPEN
رقم الانضمام: edsair.doi.dedup.....ff7b61c34b2b8c2999edfbdaac8c8d46
قاعدة البيانات: OpenAIRE