References of "Rauschenberger, Armin 50034419"
     in
Bookmark and Share    
Full Text
Peer Reviewed
See detailPredictive and interpretable models via the stacked elastic net
Rauschenberger, Armin UL; Glaab, Enrico UL; van de Wiel, Mark

in Bioinformatics (in press)

Motivation: Machine learning in the biomedical sciences should ideally provide predictive and interpretable models. When predicting outcomes from clinical or molecular features, applied researchers often ... [more ▼]

Motivation: Machine learning in the biomedical sciences should ideally provide predictive and interpretable models. When predicting outcomes from clinical or molecular features, applied researchers often want to know which features have effects, whether these effects are positive or negative, and how strong these effects are. Regression analysis includes this information in the coefficients but typically renders less predictive models than more advanced machine learning techniques. Results: Here we propose an interpretable meta-learning approach for high-dimensional regression. The elastic net provides a compromise between estimating weak effects for many features and strong effects for some features. It has a mixing parameter to weight between ridge and lasso regularisation. Instead of selecting one weighting by tuning, we combine multiple weightings by stacking. We do this in a way that increases predictivity without sacrificing interpretability. Availability and Implementation: The R package starnet is available on GitHub: https://github.com/rauschenberger/starnet. [less ▲]

Detailed reference viewed: 146 (7 UL)
Full Text
Peer Reviewed
See detailSparse classification with paired covariates
Rauschenberger, Armin UL; Ciocănea-Teodorescu, Iuliana; Jonker, Marianne A. et al

in Advances in Data Analysis and Classification (in press)

This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to ... [more ▼]

This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package palasso is available from CRAN. [less ▲]

Detailed reference viewed: 134 (14 UL)
Full Text
Peer Reviewed
See detailTesting for association between RNA-Seq and high-dimensional data
Rauschenberger, Armin UL; Jonker, Marianne A.; van de Wiel, Mark A. et al

in BMC Bioinformatics (2016), 17

Background: Testing for association between RNA-Seq and other genomic data is challenging due to high variability of the former and high dimensionality of the latter. Results: Using the negative binomial ... [more ▼]

Background: Testing for association between RNA-Seq and other genomic data is challenging due to high variability of the former and high dimensionality of the latter. Results: Using the negative binomial distribution and a random-effects model, we develop an omnibus test that overcomes both difficulties. It may be conceptualised as a test of overall significance in regression analysis, where the response variable is overdispersed and the number of explanatory variables exceeds the sample size. Conclusions: The proposed test can detect genetic and epigenetic alterations that affect gene expression. It can examine complex regulatory mechanisms of gene expression. The R package globalSeq is available from Bioconductor. [less ▲]

Detailed reference viewed: 41 (1 UL)