Comparison of multivariate linear regression and a machine learning algorithm developed for prediction of precision warfarin dosing in a Korean population

التفاصيل البيبلوغرافية
العنوان: Comparison of multivariate linear regression and a machine learning algorithm developed for prediction of precision warfarin dosing in a Korean population
المؤلفون: Il-Yong Han, Hoang Dat Nguyen, Ho-Sook Kim, Yong-Soon Cho, Sangzin Ahn, Dae-Kyeong Kim, Jae-Gook Shin, Van Lam Nguyen
المصدر: Journal of Thrombosis and Haemostasis. 19:1676-1686
بيانات النشر: Elsevier BV, 2021.
سنة النشر: 2021
مصطلحات موضوعية: Genotype, Mean squared error, 030204 cardiovascular system & hematology, Machine learning, computer.software_genre, Cohort Studies, Machine Learning, 03 medical and health sciences, 0302 clinical medicine, Vitamin K Epoxide Reductases, Bayesian multivariate linear regression, Republic of Korea, Linear regression, Covariate, Humans, Cytochrome P-450 CYP2C9, Mathematics, business.industry, Linear model, Anticoagulants, Hematology, Test set, Linear Models, Warfarin, Gradient boosting, VKORC1, Artificial intelligence, business, computer, Algorithm, Algorithms
الوصف: Background Personalized warfarin dosing is influenced by various factors including genetic and non-genetic factors. Multiple linear regression (LR) is known as a conventional method to develop predictive models. Recently, machine learning approaches have been extensively implemented for warfarin dosing due to the hypothesis of non-linear association between covariates and stable warfarin dose. Objective To extend the multiple linear regression algorithm for personalized warfarin dosing in a Korean population and compare with a machine learning--based algorithm. Method From this cohort study, we collected information on 650 patients taking warfarin who achieved steady state including demographic information, indications, comorbidities, comedications, habits, and genetic factors. The dataset was randomly split into training set (90%) and test set (10%). The LR and machine learning (gradient boosting machine [GBM]) models were developed on the training set and were evaluated on the test set. Result LR and GBM models were comparable in terms of accuracy of ideal dose (75.38% and 73.85%), correlation (0.77 and 0.73), mean absolute error (0.58 mg/day and 0.64 mg/day), and root mean square error (0.82 mg/day and 0.9 mg/day), respectively. VKORC1 genotype, CYP2C9 genotype, age, and weight were the highest contributors and could obtain 80% of maximum performance in both models. Conclusion This study shows that our LR and GMB models are satisfactory to predict warfarin dose in our dataset. Both models showed similar performance and feature contribution characteristics. LR may be the appropriate model due to its simplicity and interpretability.
تدمد: 1538-7836
DOI: 10.1111/jth.15318
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f8418bb49a124017d4c38f810d6ca753
https://doi.org/10.1111/jth.15318
Rights: CLOSED
رقم الانضمام: edsair.doi.dedup.....f8418bb49a124017d4c38f810d6ca753
قاعدة البيانات: OpenAIRE
الوصف
تدمد:15387836
DOI:10.1111/jth.15318