Academic Journal

Full-length SMRT transcriptome sequencing and microsatellite characterization in Paulownia catalpifolia

التفاصيل البيبلوغرافية
العنوان: Full-length SMRT transcriptome sequencing and microsatellite characterization in Paulownia catalpifolia
المؤلفون: Yanzhi Feng, Yang Zhao, Jiajia Zhang, Baoping Wang, Chaowei Yang, Haijiang Zhou, Jie Qiao
المصدر: Scientific Reports, Vol 11, Iss 1, Pp 1-12 (2021)
بيانات النشر: Nature Portfolio, 2021.
سنة النشر: 2021
المجموعة: LCC:Medicine
LCC:Science
مصطلحات موضوعية: Medicine, Science
الوصف: Abstract Paulownia catalpifolia is an important, fast-growing timber species known for its high density, color and texture. However, few transcriptomic and genetic studies have been conducted in P. catalpifolia. In this study, single-molecule real-time sequencing technology was applied to obtain the full-length transcriptome of P. catalpifolia leaves treated with varying degrees of drought stress. The sequencing data were then used to search for microsatellites, or simple sequence repeats (SSRs). A total of 28.83 Gb data were generated, 25,969 high-quality (HQ) transcripts with an average length of 1624 bp were acquired after removing the redundant reads, and 25,602 HQ transcripts (98.59%) were annotated using public databases. Among the HQ transcripts, 16,722 intact coding sequences, 149 long non-coding RNAs and 179 alternative splicing events were predicted, respectively. A total of 7367 SSR loci were distributed throughout 6293 HQ transcripts, of which 763 complex SSRs and 6604 complete SSRs. The SSR appearance frequency was 28.37%, and the average distribution distance was 5.59 kb. Among the 6604 complete SSR loci, 1–3 nucleotide repeats were dominant, occupying 97.85% of the total SSR loci, of which mono-, di- and tri-nucleotide repeats were 44.68%, 33.86% and 19.31%, respectively. We detected 112 repeat motifs, of which A/T (42.64%), AG/CT (12.22%), GA/TC (9.63%), GAA/TTC (1.57%) and CCA/TGG (1.54%) were most common in mono-, di- and tri-nucleotide repeats, respectively. The length of the repeat SSR motifs was 10–88 bp, and 4997 (75.67%) were ≤ 20 bp. This study provides a novel full-length transcriptome reference for P. catalpifolia and will facilitate the identification of germplasm resources and breeding of new drought-resistant P. catalpifolia varieties.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2045-2322
Relation: https://doaj.org/toc/2045-2322
DOI: 10.1038/s41598-021-87538-8
URL الوصول: https://doaj.org/article/f6a28d88a4df426294f6890f5447bf13
رقم الانضمام: edsdoj.f6a28d88a4df426294f6890f5447bf13
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20452322
DOI:10.1038/s41598-021-87538-8