|
|
|
|
LEADER |
04076nam a2200565Ia 4500 |
001 |
10.1186-s12859-021-04011-z |
008 |
220427s2021 CNT 000 0 und d |
020 |
|
|
|a 14712105 (ISSN)
|
245 |
1 |
0 |
|a Empowering individual trait prediction using interactions for precision medicine
|
260 |
|
0 |
|b BioMed Central Ltd
|c 2021
|
856 |
|
|
|z View Fulltext in Publisher
|u https://doi.org/10.1186/s12859-021-04011-z
|
520 |
3 |
|
|a Background: One component of precision medicine is to construct prediction models with their predicitve ability as high as possible, e.g. to enable individual risk prediction. In genetic epidemiology, complex diseases like coronary artery disease, rheumatoid arthritis, and type 2 diabetes, have a polygenic basis and a common assumption is that biological and genetic features affect the outcome under consideration via interactions. In the case of omics data, the use of standard approaches such as generalized linear models may be suboptimal and machine learning methods are appealing to make individual predictions. However, most of these algorithms focus mostly on main or marginal effects of the single features in a dataset. On the other hand, the detection of interacting features is an active area of research in the realm of genetic epidemiology. One big class of algorithms to detect interacting features is based on the multifactor dimensionality reduction (MDR). Here, we further develop the model-based MDR (MB-MDR), a powerful extension of the original MDR algorithm, to enable interaction empowered individual prediction. Results: Using a comprehensive simulation study we show that our new algorithm (median AUC: 0.66) can use information hidden in interactions and outperforms two other state-of-the-art algorithms, namely the Random Forest (median AUC: 0.54) and Elastic Net (median AUC: 0.50), if interactions are present in a scenario of two pairs of two features having small effects. The performance of these algorithms is comparable if no interactions are present. Further, we show that our new algorithm is applicable to real data by comparing the performance of the three algorithms on a dataset of rheumatoid arthritis cases and healthy controls. As our new algorithm is not only applicable to biological/genetic data but to all datasets with discrete features, it may have practical implications in other research fields where interactions between features have to be considered as well, and we made our method available as an R package (https://github.com/imbs-hl/MBMDRClassifieR). Conclusions: The explicit use of interactions between features can improve the prediction performance and thus should be included in further attempts to move precision medicine forward. © 2021, The Author(s).
|
650 |
0 |
4 |
|a algorithm
|
650 |
0 |
4 |
|a Algorithms
|
650 |
0 |
4 |
|a Bioinformatics
|
650 |
0 |
4 |
|a Classification
|
650 |
0 |
4 |
|a Coronary artery disease
|
650 |
0 |
4 |
|a Decision trees
|
650 |
0 |
4 |
|a Diabetes Mellitus, Type 2
|
650 |
0 |
4 |
|a Dimensionality reduction
|
650 |
0 |
4 |
|a Diseases
|
650 |
0 |
4 |
|a Feature extraction
|
650 |
0 |
4 |
|a Forecasting
|
650 |
0 |
4 |
|a Generalized linear model
|
650 |
0 |
4 |
|a Genetic epidemiologies
|
650 |
0 |
4 |
|a genetics
|
650 |
0 |
4 |
|a human
|
650 |
0 |
4 |
|a Humans
|
650 |
0 |
4 |
|a Individual prediction
|
650 |
0 |
4 |
|a Interactions
|
650 |
0 |
4 |
|a Learning systems
|
650 |
0 |
4 |
|a machine learning
|
650 |
0 |
4 |
|a Machine learning
|
650 |
0 |
4 |
|a Machine Learning
|
650 |
0 |
4 |
|a Machine learning methods
|
650 |
0 |
4 |
|a multifactor dimensionality reduction
|
650 |
0 |
4 |
|a Multifactor Dimensionality Reduction
|
650 |
0 |
4 |
|a Multifactor dimensionality reductions
|
650 |
0 |
4 |
|a non insulin dependent diabetes mellitus
|
650 |
0 |
4 |
|a personalized medicine
|
650 |
0 |
4 |
|a Personalized medicine
|
650 |
0 |
4 |
|a Power, Psychological
|
650 |
0 |
4 |
|a Precision Medicine
|
650 |
0 |
4 |
|a Prediction
|
650 |
0 |
4 |
|a Prediction performance
|
650 |
0 |
4 |
|a Predictive analytics
|
650 |
0 |
4 |
|a State-of-the-art algorithms
|
700 |
1 |
|
|a Gola, D.
|e author
|
700 |
1 |
|
|a König, I.R.
|e author
|
773 |
|
|
|t BMC Bioinformatics
|