Academic Journal

Klumpy: A tool to evaluate the integrity of long‐read genome assemblies and illusive sequence motifs.

التفاصيل البيبلوغرافية
العنوان: Klumpy: A tool to evaluate the integrity of long‐read genome assemblies and illusive sequence motifs.
المؤلفون: Madrigal, Giovanni, Minhas, Bushra Fazal, Catchen, Julian
المصدر: Molecular Ecology Resources; Jan2025, Vol. 25 Issue 1, p1-15, 15p
مصطلحات موضوعية: SNAKEHEADS (Fish), GENOMICS, RESEARCH personnel, GENOMES, LOCUS (Genetics), NUCLEOTIDE sequencing
مستخلص: The improvement and decreasing costs of third‐generation sequencing technologies has widened the scope of biological questions researchers can address with de novo genome assemblies. With the increasing number of reference genomes, validating their integrity with minimal overhead is vital for establishing confident results in their applications. Here, we present Klumpy, a tool for detecting and visualizing both misassembled regions in a genome assembly and genetic elements (e.g. genes) of interest in a set of sequences. By leveraging the initial raw reads in combination with their respective genome assembly, we illustrate Klumpy's utility by investigating antifreeze glycoprotein (afgp) loci across two icefishes, by searching for a reported absent gene in the northern snakehead fish, and by scanning the reference genomes of a mudskipper and bumblebee for misassembled regions. In the two former cases, we were able to provide support for the noncanonical placement of an afgp locus in the icefishes and locate the missing snakehead gene. Furthermore, our genome scans were able identify an unmappable locus in the mudskipper reference genome and identify a putative repetitive element shared among several species of bees. see also the Perspective by Isheng Jason Tsai. [ABSTRACT FROM AUTHOR]
Copyright of Molecular Ecology Resources is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:1755098X
DOI:10.1111/1755-0998.13982