Academic Journal
New Developments in Sparse PLS Regression
العنوان: | New Developments in Sparse PLS Regression |
---|---|
المؤلفون: | Jérémy Magnanensi, Myriam Maumy-Bertrand, Nicolas Meyer, Frédéric Bertrand |
المصدر: | Frontiers in Applied Mathematics and Statistics, Vol 7 (2021) |
بيانات النشر: | Frontiers Media S.A., 2021. |
سنة النشر: | 2021 |
المجموعة: | LCC:Applied mathematics. Quantitative methods LCC:Probabilities. Mathematical statistics |
مصطلحات موضوعية: | variable selection, partial least squares, sparse partial least squares, generalized partial least squares, bootstrap, stability, Applied mathematics. Quantitative methods, T57-57.97, Probabilities. Mathematical statistics, QA273-280 |
الوصف: | Methods based on partial least squares (PLS) regression, which has recently gained much attention in the analysis of high-dimensional genomic datasets, have been developed since the early 2000s for performing variable selection. Most of these techniques rely on tuning parameters that are often determined by cross-validation (CV) based methods, which raises essential stability issues. To overcome this, we have developed a new dynamic bootstrap-based method for significant predictor selection, suitable for both PLS regression and its incorporation into generalized linear models (GPLS). It relies on establishing bootstrap confidence intervals, which allows testing of the significance of predictors at preset type I risk α, and avoids CV. We have also developed adapted versions of sparse PLS (SPLS) and sparse GPLS regression (SGPLS), using a recently introduced non-parametric bootstrap-based technique to determine the numbers of components. We compare their variable selection reliability and stability concerning tuning parameters determination and their predictive ability, using simulated data for PLS and real microarray gene expression data for PLS-logistic classification. We observe that our new dynamic bootstrap-based method has the property of best separating random noise in y from the relevant information with respect to other methods, leading to better accuracy and predictive abilities, especially for non-negligible noise levels. |
نوع الوثيقة: | article |
وصف الملف: | electronic resource |
اللغة: | English |
تدمد: | 2297-4687 |
Relation: | https://www.frontiersin.org/articles/10.3389/fams.2021.693126/full; https://doaj.org/toc/2297-4687 |
DOI: | 10.3389/fams.2021.693126 |
URL الوصول: | https://doaj.org/article/0a8ab28962a346b5bc74e0bb4189d93c |
رقم الانضمام: | edsdoj.0a8ab28962a346b5bc74e0bb4189d93c |
قاعدة البيانات: | Directory of Open Access Journals |
تدمد: | 22974687 |
---|---|
DOI: | 10.3389/fams.2021.693126 |