Academic Journal

Reproducibly sampling SARS-CoV-2 genomes across time, geography, and viral diversity

التفاصيل البيبلوغرافية
العنوان: Reproducibly sampling SARS-CoV-2 genomes across time, geography, and viral diversity
المؤلفون: Bolyen, Evan, Dillon, Matthew R., Bokulich, Nicholas, id_orcid:0 000-0002-1784-8935, Ladner, Jason T., Larsen, Brendan B., Hepp, Crystal M., Lemmer, Darrin, Sahl, Jason W., Sanchez, Andrew, Holdgraf, Chris, Sewell, Chris, Choudhury, Aakash G., Stachurski, John, McKay, Matthew, Engelthaler, David M., Worobey, Michael, Keim, Paul, Caporaso, J. Gregory
المصدر: F1000Research, 9
بيانات النشر: F1000Research
سنة النشر: 2020
المجموعة: ETH Zürich Research Collection
مصطلحات موضوعية: Sars-Cov-2, Genome-sampler, QIIME 2, Bioinformatics, Genomics
الوصف: The COVID-19 pandemic has led to a rapid accumulation of SARS-CoV-2 genomes, enabling genomic epidemiology on local and global scales. Collections of genomes from resources such as GISAID must be subsampled to enable computationally feasible phylogenetic and other analyses. We present genome-sampler, a software package that supports sampling collections of viral genomes across multiple axes including time of genome isolation, location of genome isolation, and viral diversity. The software is modular in design so that these or future sampling approaches can be applied independently and combined (or replaced with a random sampling approach) to facilitate custom workflows and benchmarking. genome-sampler is written as a QIIME 2 plugin, ensuring that its application is fully reproducible through QIIME 2’s unique retrospective data provenance tracking system. genome-sampler can be installed in a conda environment on macOS or Linux systems. A complete default pipeline is available through a Snakemake workflow, so subsampling can be achieved using a single command. genome-sampler is open source, free for all to use, and available at https://caporasolab.us/genome-sampler. We hope that this will facilitate SARS-CoV-2 research and support evaluation of viral genome sampling approaches for genomic epidemiology. ; ISSN:2046-1402
نوع الوثيقة: article in journal/newspaper
وصف الملف: application/application/pdf
اللغة: English
Relation: http://hdl.handle.net/20.500.11850/431213
DOI: 10.3929/ethz-b-000431213
الاتاحة: https://hdl.handle.net/20.500.11850/431213
https://doi.org/10.3929/ethz-b-000431213
Rights: info:eu-repo/semantics/openAccess ; http://creativecommons.org/licenses/by/4.0/ ; Creative Commons Attribution 4.0 International
رقم الانضمام: edsbas.7C303C46
قاعدة البيانات: BASE
الوصف
DOI:10.3929/ethz-b-000431213