Natural Language Processing Methods for the Study of Protein-Ligand Interactions

التفاصيل البيبلوغرافية
العنوان: Natural Language Processing Methods for the Study of Protein-Ligand Interactions
المؤلفون: Michels, James, Bandarupalli, Ramya, Akbari, Amin Ahangar, Le, Thai, Xiao, Hong, Li, Jing, Hom, Erik F. Y.
سنة النشر: 2024
المجموعة: Computer Science
Quantitative Biology
مصطلحات موضوعية: Quantitative Biology - Quantitative Methods, Computer Science - Computation and Language
الوصف: Recent advances in Natural Language Processing (NLP) have ignited interest in developing effective methods for predicting protein-ligand interactions (PLIs) given their relevance to drug discovery and protein engineering efforts and the ever-growing volume of biochemical sequence and structural data available. The parallels between human languages and the "languages" used to represent proteins and ligands have enabled the use of NLP machine learning approaches to advance PLI studies. In this review, we explain where and how such approaches have been applied in the recent literature and discuss useful mechanisms such as long short-term memory, transformers, and attention. We conclude with a discussion of the current limitations of NLP methods for the study of PLIs as well as key challenges that need to be addressed in future work.
Comment: 52 Pages and 3 Figures
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2409.13057
رقم الانضمام: edsarx.2409.13057
قاعدة البيانات: arXiv