High-quality assembly of the reference genome for scarlet sage, Salvia splendens, an economically important ornamental plant

التفاصيل البيبلوغرافية
العنوان: High-quality assembly of the reference genome for scarlet sage, Salvia splendens, an economically important ornamental plant
المؤلفون: Ren-Gang Zhang, Hui Liu, Shuai Nie, Ilga Porth, Li Zijing, Yan-Qiang Sun, Hai-Bo Xin, Zhao Zhengnan, Rong-Feng Cui, Cong Richen, Quan-Zheng Yun, Ai-Xiang Dong, Jian-Feng Mao, Xin-Ning Wang, Fatemeh Maghuly
المصدر: GigaScience
بيانات النشر: Oxford University Press (OUP), 2018.
سنة النشر: 2018
مصطلحات موضوعية: 0301 basic medicine, Heterozygote, DNA, Plant, Sequence assembly, Health Informatics, Genomics, Biology, Data Note, Genome, DNA sequencing, 03 medical and health sciences, evolution, Scarlet sage, scarlet sage, single-molecule real-time sequencing, Salvia, reference genome, Phylogeny, Repetitive Sequences, Nucleic Acid, Comparative genomics, Whole genome sequencing, Base Sequence, Whole Genome Sequencing, Molecular Sequence Annotation, biology.organism_classification, Computer Science Applications, Salvia splendens, Phenotype, 030104 developmental biology, annotation, Evolutionary biology, Genome, Plant, Reference genome
الوصف: Background Salvia splendens Ker-Gawler, scarlet or tropical sage, is a tender herbaceous perennial widely introduced and seen in public gardens all over the world. With few molecular resources, breeding is still restricted to traditional phenotypic selection, and the genetic mechanisms underlying phenotypic variation remain unknown. Hence, a high-quality reference genome will be very valuable for marker-assisted breeding, genome editing, and molecular genetics. Findings We generated 66 Gb and 37 Gb of raw DNA sequences, respectively, from whole-genome sequencing of a largely homozygous scarlet sage inbred line using Pacific Biosciences (PacBio) single-molecule real-time and Illumina HiSeq sequencing platforms. The PacBio de novo assembly yielded a final genome with a scaffold N50 size of 3.12 Mb and a total length of 808 Mb. The repetitive sequences identified accounted for 57.52% of the genome sequence, and 54,008 protein-coding genes were predicted collectively with ab initio and homology-based gene prediction from the masked genome. The divergence time between S. splendens and Salvia miltiorrhiza was estimated at 28.21 million years ago (Mya). Moreover, 3,797 species-specific genes and 1,187 expanded gene families were identified for the scarlet sage genome. Conclusions We provide the first genome sequence and gene annotation for the scarlet sage. The availability of these resources will be of great importance for further breeding strategies, genome editing, and comparative genomics among related species.
تدمد: 2047-217X
DOI: 10.1093/gigascience/giy068
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b6ab4c67766863033fe5943df42c83c1
https://doi.org/10.1093/gigascience/giy068
Rights: OPEN
رقم الانضمام: edsair.doi.dedup.....b6ab4c67766863033fe5943df42c83c1
قاعدة البيانات: OpenAIRE
الوصف
تدمد:2047217X
DOI:10.1093/gigascience/giy068