Naveau, Marion Kon Kam King, Guillaume Rincent, Renaud Sansonnet, Laure Delattre, Maud
High-dimensional variable selection, with many more covariates than observations, is widely documented in standard regression models, but there are still few tools to address it in non-linear mixed-effects models where data are collected repeatedly on several individuals. In this work, variable selection is approached from a Bayesian perspective an...
Nguyen, Trungtin Nguyen, Dung Ngoc Chamroukhi, Faicel
We are motivated by the problem of identifying potentially nonlinear regression relationships between high-dimensional outputs and high-dimensional inputs of heterogeneous data. This requires regression, clustering, and model selection, simultaneously. In this framework, we apply the mixture of experts models which are among the most popular ensemb...
Nguyen, Dung Ngoc Chamroukhi, Faïcel
We are motivated by the problem of identifying potentiallynonlinear regression relationships between high-dimensional outputs andhigh-dimensional inputs of heterogeneous data. This requires simultaneousregression, clustering, and model selection. In this framework, weconsider a case of mixture of experts models characterized by multipleGaussian exp...
Gomtsyan, M.
In this thesis, we propose novel variable selection methods for sparse GLARMA (Generalised Linear Autoregressive Moving Average) models, which can be used for modelling discrete-valued time series. This models allow us to introduce some dependence in a Generalised Linear Model (GLM). Specifically, in Chapter 2, we present an estimation procedure fo...
Youssfi, Younès
Sudden cardiac death (SCD) is defined as a sudden natural death presumed to be of cardiac cause, heralded by abrupt loss of consciousness in the presence of witness, or in the absence of witness occurring within an hour after the onset of symptoms. Despite progress in clinical profiling and interventions, it remains a major public health problem, a...
Yuan, Shuai De Roover, Kim Van Deun, Katrijn
Published in
Behavior research methods
The growing availability of high-dimensional data sets offers behavioral scientists an unprecedented opportunity to integrate the information hidden in the novel types of data (e.g., genetic data, social media data, and GPS tracks, etc.,) and thereby obtain a more detailed and comprehensive view towards their research questions. In the context of c...
Capanu, Marinela Giurcanu, Mihai Begg, Colin B Gönen, Mithat
Published in
Computational statistics & data analysis
A novel variable selection method for low-dimensional generalized linear models is introduced. The new approach called AIC OPTimization via STABility Selection (OPT-STABS) repeatedly subsamples the data, minimizes Akaike's Information Criterion (AIC) over a sequence of nested models for each subsample, and includes in the final model those predicto...
Chounta, Stefania
The average recovery rate for childhood cancers is now 70 to 80%. Radiotherapy is one of the most recommended treatments, but it can cause critical iatrogenic effects in the long term, notably radiation-induced valvulopathy. Models identifying the patients most at risk could make it possible to personalize follow-up protocols and identify these adv...
McKearnan, Shannon B Vock, David M Marai, G Elisabeta Canahuate, Guadalupe Fuller, Clifton D Wolfson, Julian
Published in
Biostatistics (Oxford, England)
Support vector regression (SVR) is particularly beneficial when the outcome and predictors are nonlinearly related. However, when many covariates are available, the method's flexibility can lead to overfitting and an overall loss in predictive accuracy. To overcome this drawback, we develop a feature selection method for SVR based on a genetic algo...
Pi, Lira Halabi, Susan
Published in
Diagnostic and Prognostic Research
BackgroundBuilding prognostic models of clinical outcomes is an increasingly important research task and will remain a vital area in genomic medicine. Prognostic models of clinical outcomes are usually built and validated utilizing variable selection methods and machine learning tools. The challenges, however, in ultra-high dimensional space are no...