Predicting drug solubility in organic solvents mixtures: A machine-learning approach supported by high-throughput experimentation

التفاصيل البيبلوغرافية
العنوان:	Predicting drug solubility in organic solvents mixtures: A machine-learning approach supported by high-throughput experimentation
المؤلفون:	Cenci F., Diab S., Ferrini P., Harabajiu C., Barolo M., Bezzo F., Facco P.
المساهمون:	Cenci, F., Diab, S., Ferrini, P., Harabajiu, C., Barolo, M., Bezzo, F., Facco, P.
بيانات النشر:	ELSEVIER
سنة النشر:	2024
المجموعة:	Padua Research Archive (IRIS - Università degli Studi di Padova)
مصطلحات موضوعية:	Crystallisation, Drug solubility prediction, High-throughput experimentation, Machine-learning, Mixtures of organic solvent, Novel drug solubility data
الوصف:	A novel approach based on supervised machine -learning is proposed to predict the solubility of drugs and druglike molecules in mixtures of organic solvents. Similar to quantitative structure -property relationship (QSPR) models, different solvent types are identified by molecular descriptors, which, in this study, are considered as UNIFAC subgroups. To overcome the potential lack of UNIFAC subgroups for the complex Active Pharmaceutical Ingredients (APIs) currently developed in the pharmaceutical industry, the API molecule is considered as a unique entity in the proposed modelling approach. Therefore, API solubility is predicted as a function of temperature, functional subgroups of the solvents and composition of the solvent mixture; in turn, regressors ' correlation is handled through Partial Least -Squares (PLS) regression. The method is developed and tested with experimental data of a real API and 14 organic solvents that are industrially employed for crystallisation. Solubility predictions are accurate and precise for single solvents, binary mixtures and ternary mixtures of organic solvents at different compositions and temperatures, with a determination coefficient R 2 >= 0.90. To further test the applicability of the model, the proposed approach is applied to 9 literature organic solubility datasets of drugs and drug -like compounds and compared to benchmark solubility models in the literature. Results show that the proposed approach provides satisfactory predictions: the majority of validation and calibration data have R 2 = 0.95 -0.99; the ratio between RMSE (root mean squared error) of the proposed method and the range of measured solubility values is from 1 to 3 orders of magnitude smaller than the RMSE ratio obtained by the benchmark models.
نوع الوثيقة:	article in journal/newspaper
اللغة:	English
Relation:	info:eu-repo/semantics/altIdentifier/pmid/38763309; info:eu-repo/semantics/altIdentifier/wos/WOS:001259758200001; volume:660; journal:INTERNATIONAL JOURNAL OF PHARMACEUTICS; https://hdl.handle.net/11577/3525948
DOI:	10.1016/j.ijpharm.2024.124233
الاتاحة:	https://hdl.handle.net/11577/3525948 https://doi.org/10.1016/j.ijpharm.2024.124233
Rights:	info:eu-repo/semantics/openAccess
رقم الانضمام:	edsbas.7BA6FDE7
قاعدة البيانات:	BASE

View record in BASE

الوصف
DOI:	10.1016/j.ijpharm.2024.124233