Benefits of dimension reduction in penalized regression methods for high-dimensional grouped data: a case study in low sample size

Ambroise

C.

,

McLachlan

G.J.

(

2002

)

Selection bias in gene extraction on the basis of microarray gene-expression data

.

Proc. Natl. Acad. Sci. USA

,

99

,

6562

–

6566

.

Arlot

S.

,

Celisse

A.

(

2010

)

A survey of cross-validation procedures for model selection

.

Stat. Surv

.,

4

,

40

–

79

.

Bastien

P.

et al. (

2015

)

Deviance residuals-based sparse PLS and sparse kernel PLS regression for censored data

.

Bioinformatics

,

31

,

397

–

404

.

Baumann

D.

,

Baumann

K.

(

2014

)

Reliable estimation of prediction errors for QSAR models under model uncertainty using double cross-validation

.

J. Cheminf

.,

6

,

47.

Benner

A.

et al. (

2010

)

High-dimensional cox models: the choice of penalty as part of the model building process

.

Biom. J

.,

52

,

50

–

69

.

Berdeaux

O.

et al. (

2010

)

Identification and quantification of phosphatidylcholines containing very-long-chain polyunsaturated fatty acid in bovine and human retina using liquid chromatography/tandem mass spectrometry

.

J. Chromatogr. A

,

1217

,

7738

–

7748

.

Boucher

T.F.

et al. (

2015

)

A study of machine learning regression methods for major elemental analysis of rocks using laser-induced breakdown spectroscopy

.

Spectrochim. Acta B Atomic Spectr

.,

107

,

1

–

10

.

Boulesteix

A.-L.

(

2004

)

PLS dimension reduction for classification with microarray data

.

Stat. Appl. Genet. Mol. Biol

.,

3

,

Article33.

Bretillon

L.

et al. (

2008

)

Lipid and fatty acid profile of the retina, retinal pigment epithelium/choroid, and the lacrimal gland, and associations with adipose tissue fatty acids in human subjects

.

Exp. Eye Res

.,

87

,

521

–

528

.

Chun

H.

,

Keleş

S.

(

2010

)

Sparse partial least squares regression for simultaneous dimension reduction and variable selection

.

J. R. Stat. Soc. Ser. B Stat. Methodol

.,

72

,

3

–

25

.

Clarke

R.

et al. (

2008

)

The properties of high-dimensional data spaces: implications for exploring gene and protein expression data

.

Nat. Rev. Cancer

,

8

,

37

–

49

.

Fang

K.

et al. (

2015

)

Bi-level variable selection via adaptive sparse group Lasso

.

J. Stat. Comput. Simul

.,

85

,

2750

–

2760

.

Feng

Z.Z.

et al. (

2012

)

The LASSO and sparse least square regression methods for SNP selection in predicting quantitative traits

.

IEEE/ACM Trans. Comput. Biol. Bioinform

.,

9

,

629

–

636

.

Féraud

B.

et al. (

2017

)

Combining strong sparsity and competitive predictive power with the L-sOPLS approach for biomarker discovery in metabolomics

.

Metabolomics

,

13

,

130.

Filzmoser

P.

et al. (

2012

)

Review of sparse methods in regression and classification with application to chemometrics

.

J. Chemometr

.,

26

,

42

–

51

.

Friedman

J.

et al. (

2010

) A note on the group lasso and a sparse group lasso. https://arxiv.org/abs/1001.0736v1.

Garcia

T.P.

et al. (

2014

)

Identification of important regressor groups, subgroups and individuals via regularization methods: application to gut microbiome data

.

Bioinformatics

,

30

,

831

–

837

.

Genuer

R.

et al. (

2010

)

Variable selection using random forests

.

Pattern Recogn. Lett

.,

31

,

2225

–

2236

.

Géron

A.

(

2017

) Hands-on machine learning with scikit-learn and TensorFlow: concepts, tools, and techniques to build intelligent systems O’Reilly media, Sebastopol, CA. pp. 54–56.

Hastie

T.

et al. (

2015

)

Statistical Learning with Sparsity: The Lasso and Generalizations 1 Edition

.

Chapman and Hall/CRC

,

Boca Raton

.

Hastie

T.

et al. (

2001

)

The Elements of Statistical Learning – Data Mining

,

Inference

,

New York

.

Hastie

T.

et al. (

2009

)

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

5th edn.

Springer-Verlag New York Inc

.,

New York

.

Huang

J.

et al. (

2009

) Learning with Structured Sparsity. In: Proceedings of the 26th Annual International Conference on Machine Learning - ICML ‘09 1–8 . doi: 10.1145/1553374.1553429.

Ivanescu

A.E.

et al. (

2016

)

The importance of prediction model validation and assessment in obesity and nutrition research

.

Int J Obes (Lond)

,

40

,

887

–

894

.

James

G.

et al. (

2017

)

An Introduction to Statistical Learning: With Applications in R 1st ed. 2013, Corr

. 7th printing 2017 edn.

Springer

,

New York

.

Google Preview

Lê Cao

K.-A.

et al. (

2008

)

A sparse PLS for variable selection when integrating omics data

.

Stat. Appl. Genet. Mol. Biol

.,

7

,

Article 35.

Lê Cao

K.-A.

et al. (

2009

)

integrOmics: an R package to unravel relationships between two omics datasets

.

Bioinformatics

,

25

,

2855

–

2856

.

Lévy

Y.

et al. (

2014

)

Dendritic cell-based therapeutic vaccine elicits polyfunctional HIV-specific T-cell immunity associated with control of viral load: clinical immunology

.

Eur. J. Immunol

.,

44

,

2802

–

2810

.

Liquet

B.

et al. (

2016

)

Group and sparse group partial least square approaches applied in genomics context

.

Bioinformatics

,

32

,

35

–

42

.

PubMed

Martinez

J.G.

et al. (

2011

)

Empirical performance of cross-validation with oracle methods in a genomics context

.

Am. Stat

.,

65

,

223

–

228

.

Mevik

B.-H.

,

Cederkvist

H.R.

(

2004

)

Mean squared error of prediction (MSEP) estimates for principal component regression (PCR) and partial least squares regression (PLSR)

.

J. Chemometr

.,

18

,

422

–

429

.

Molinaro

A.M.

et al. (

2005

)

Prediction error estimation: a comparison of resampling methods

.

Bioinformatics

,

21

,

3301

–

3307

.

Naes

T.

,

Mevik

B.-H.

(

2001

)

Understanding the collinearity problem in regression and discriminant analysis

.

J. Chemometr

.,

15

,

413

–

426

.

Rendall

R.

et al. (

2017

)

Advanced predictive methods for wine age prediction: part I – a comparison study of single-block regression approaches based on variable selection, penalized regression, latent variables and tree-based ensemble methods

.

Talanta

,

171

,

341

–

350

.

Sill

J.

et al. (

2009

) Feature-Weighted Linear Stacking. https://arxiv.org/abs/0911.0460v2.

Simon

N.

et al. (

2013

)

A Sparse-Group Lasso

.

J. Comput. Graph. Stat

.,

22

,

231

–

245

.

Smit

S.

et al. (

2007

)

Assessing the statistical validity of proteomics based biomarkers

.

Anal. Chim. Acta

,

592

,

210

–

217

.

Strang

G.

(

2016

)

Introduction to Linear Algebra

. 5th edn.

Wellesley-Cambridge Press

,

Wellesley, MA

.

Google Preview

Tibshirani

R.

(

1994

)

Regression shrinkage and selection via the Lasso

.

J. R. Stat. Soc, Ser. B

,

58

,

267

–

288

.

Tropp

J.A.

,

Wright

S.J.

(

2010

)

Computational methods for sparse solution of linear inverse problems

.

Proc. IEEE

,

98

,

948

–

958

.

Trygg

J.

,

Wold

S.

(

2002

)

Orthogonal projections to latent structures (O-PLS)

.

J. Chemometr

.,

16

,

119

–

128

.

Wei

F.

,

Huang

J.

(

2010

)

Consistent group selection in high-dimensional linear regression

.

Bernoulli (Andover)

,

16

,

1369

–

1384

.

PubMed

Wolpert

D.H.

(

1992

)

Stacked generalization

.

Neural Netw

.,

5

,

241

–

259

.

Xu

Z.

et al. (

2014

) Gradient Boosted Feature Selection. In:

Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

, KDD ’14. ACM, New York, NY, USA, pp.

522

–

531

.

Yuan

M.

,

Lin

Y.

(

2006

)

Model selection and estimation in regression with grouped variables

.

J. R. Stat. Soc. Ser. B (Stat. Methodol.)

,

68

,

49

–

67

.

Zeng

B.

et al. (

2017

)

A link-free sparse group variable selection method for single-index model

.

J. Appl. Stat

.,

44

,

2388

–

2400

.

Zhang

X.

et al. (

2016

)

Variable selection for support vector machines in moderately high dimensions

.

J. R. Stat. Soc. Ser. B Stat. Methodol

.,

78

,

53

–

76

.

Zou

H.

(

2006

)

The adaptive Lasso and its Oracle properties

.

J. Am. Stat. Assoc

.,

101

,

1418

–

1429

.

Zou

H.

,

Hastie

T.

(

2005

)

Regularization and variable selection via the elastic net

.

J. R. Stat. Soc. Ser. B (Stat. Methodol.)

,

67

,

301

–

320

.