Galaxy CLIP-Explorer: a web server for CLIP-Seq data analysis

التفاصيل البيبلوغرافية
العنوان: Galaxy CLIP-Explorer: a web server for CLIP-Seq data analysis
المؤلفون: Michael Uhl, Florian Heyl, Rolf Backofen, Daniel Maticzka
المصدر: GigaScience
بيانات النشر: Oxford University Press, 2020.
سنة النشر: 2020
مصطلحات موضوعية: Data Analysis, Web server, Clip seq, CLIP-Seq, Computer science, AcademicSubjects/SCI02254, Health Informatics, Review, computer.software_genre, Nucleotide level, 03 medical and health sciences, 0302 clinical medicine, 030304 developmental biology, 0303 health sciences, Sequence, Sequence Analysis, RNA, High-Throughput Nucleotide Sequencing, RNA-Binding Proteins, Pipeline (software), Computer Science Applications, Peak detection, Galaxy, AcademicSubjects/SCI00960, RNA, Chromatin Immunoprecipitation Sequencing, Data mining, protein, ICLIP, computer, Peak calling, 030217 neurology & neurosurgery
الوصف: Background Post-transcriptional regulation via RNA-binding proteins plays a fundamental role in every organism, but the regulatory mechanisms lack important understanding. Nevertheless, they can be elucidated by cross-linking immunoprecipitation in combination with high-throughput sequencing (CLIP-Seq). CLIP-Seq answers questions about the functional role of an RNA-binding protein and its targets by determining binding sites on a nucleotide level and associated sequence and structural binding patterns. In recent years the amount of CLIP-Seq data skyrocketed, urging the need for an automatic data analysis that can deal with different experimental set-ups. However, noncanonical data, new protocols, and a huge variety of tools, especially for peak calling, made it difficult to define a standard. Findings CLIP-Explorer is a flexible and reproducible data analysis pipeline for iCLIP data that supports for the first time eCLIP, FLASH, and uvCLAP data. Individual steps like peak calling can be changed to adapt to different experimental settings. We validate CLIP-Explorer on eCLIP data, finding similar or nearly identical motifs for various proteins in comparison with other databases. In addition, we detect new sequence motifs for PTBP1 and U2AF2. Finally, we optimize the peak calling with 3 different peak callers on RBFOX2 data, discuss the difficulty of the peak-calling step, and give advice for different experimental set-ups. Conclusion CLIP-Explorer finally fills the demand for a flexible CLIP-Seq data analysis pipeline that is applicable to the up-to-date CLIP protocols. The article further shows the limitations of current peak-calling algorithms and the importance of a robust peak detection.
اللغة: English
تدمد: 2047-217X
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::97280c4662e8489126e6ae8459c11af2
http://europepmc.org/articles/PMC7657819
Rights: OPEN
رقم الانضمام: edsair.doi.dedup.....97280c4662e8489126e6ae8459c11af2
قاعدة البيانات: OpenAIRE