Skip to main content

Advertisement

Log in

The ridge prediction error sum of squares statistic in linear mixed models

  • Published:
Metrika Aims and scope Submit manuscript

Abstract

In case of multicollinearity, PRESS statistics has been proposed to be used in the selection of the ridge biasing parameter of the ridge estimator which is introduced as an alternative to BLUE. This newly proposed PRESS statistic for the ridge estimator, \(\textit{CPRESS}_{k}\), depends on the conditional ridge residual and can be computed once at a time by fitting the linear mixed model with all the observations. We also define \(R^2_{RidPred}\) statistic to evaluate the predictive ability of the ridge fit. Since the PRESS statistic for the BLUE is a special \(\textit{CPRESS}_{k}\) statistic, we indirectly also give closed form solution of the PRESS statistic for the BLUE. Then, we compared the predictive performance of the linear mixed model via the statistics \( \textit{CPRESS}_{k}\), \(GCV_{k}\) and \(C_{p}\) by considering a real data analysis and a simulation study where the optimal ridge biaisng parameter is obtained by minimizing each statistic. The study shows that the ridge predictors improve the predictive performance of a linear mixed model over BLUE in the presence of multicollinearity and each statistic gives a different optimum ridge biasing value and they show the best predictive performance at their optimum ridge biasing values. In addition, the simulation study has shown that the intensity of variance and multicollinearity is effective in determining the optimum ridge biasing value and this optimum ridge biasing value is effective on the superiority of the predictive performance of ridge estimator over BLUE.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Abraham AA (2008) Model selection methods in the linear mixed model for longitudinal data [PhD thesis]. Department of Biostatistics, University of North Carolina

  • Açar TS, Özkale MR (2016) Cross validation of ridge regression estimator in autocorrelated linear regression models. J Statist Comput Simul 86:2429–2440

    Article  MathSciNet  MATH  Google Scholar 

  • Allen DM (1974) The relationship between variable selection and data agumentation and a method for prediction. Technometrics 16:125–127

    Article  MathSciNet  MATH  Google Scholar 

  • Belsley DA, Kuh E, Welsch RE (1980) Regression Diagnostics. Wiley, New York

    Book  MATH  Google Scholar 

  • Christensen R, Pearson LM, Johnson W (1992) Case deletion diagnostics for mixed models. Technometrics 34:38–45

    Article  MathSciNet  MATH  Google Scholar 

  • Edwards LJ, Muller KE, Wolfinger RD, Qaqish BJ, Schabenberger O (2008) An \(R^{2}\) statistic for fixed effects in the linear mixed model. Stat Med 27:6137–6157

    Article  MathSciNet  Google Scholar 

  • Elden L (1984) A note on the computation of the generalized cross-validation function for ill-conditioned least squares problems. BIT 24:462–472

    MathSciNet  MATH  Google Scholar 

  • Eliot M (2011) Tree-based methods and a mixed ridge estimator for analyzing longitudinal data with correlated predictors [Ph. D. Thesis]. Department of Biostatistics, University of Massachusetts Amherst

  • Harville DA (1977) Maximum likelihood approaches to variance component estimation and to related problems. J Amer Statist Assoc 72:320–338

    Article  MathSciNet  MATH  Google Scholar 

  • Hemmerle WJ, Hartley HO (1973) Computing maximum likelihood estimates for the mixed AOV model using the W transformation. Technometrics 15:819–831

    MathSciNet  MATH  Google Scholar 

  • Henderson CR, Kempthorne O, Searle SR, Von Krosig CN (1959) Estimation of environmental and genetic trends from records subject to culling. Biometrics 15:192–218

    Article  MATH  Google Scholar 

  • Hodges JS, Sargent DJ (2001) Counting degrees of freedom in hierarchical and other richly-parameterised models. Biometrika 88:367–379

    Article  MathSciNet  MATH  Google Scholar 

  • Hoerl AE, Kennard RW (1970) Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12:55–67

    Article  MATH  Google Scholar 

  • Krzanowski WJ (1987) Selection of variables to preserve multivariate data structure, using principal components. J R Statist Soc C 36:22–33

    Google Scholar 

  • Kuran Ö, Özkale MR (2016) Gilmour’s approach to mixed and stochastic restricted ridge predictions in linear mixed models. Linear Algeb Appl 508:22–47

    Article  MathSciNet  MATH  Google Scholar 

  • Kuran Ö, Özkale MR (2021) Marginal ridge conceptual predictive model selection criterion in linear mixed models. Comm Statist Simul Comput 50:581–607

    Article  MathSciNet  MATH  Google Scholar 

  • Kuran Ö, Özkale MR (2019) Model selection via conditional conceptual predictive statistic under ridge regression in linear mixed models. J Statist Comput Simul 89:155–187

    Article  MathSciNet  MATH  Google Scholar 

  • Liu H, Weiss RE, Jennrich RI, Wenger NS (1999) PRESS model selection in repeated measures data. Comput Statist Data Anal 30:169–184

    Article  MathSciNet  MATH  Google Scholar 

  • Liu XQ, Jiang HY (2012) Optimal generalized ridge estimator under the generalized cross-validation criterion in linear regression. Linear Algeb Appl 436:1238–1245

    Article  MathSciNet  MATH  Google Scholar 

  • Liu XQ, Hu P (2013) General ridge predictors in a mixed linear model. J Theor Appl Statist 47:363–378

    MathSciNet  MATH  Google Scholar 

  • Mallows CL (1973) Some comments on Cp. Technometrics 15:661–675

    MATH  Google Scholar 

  • McCulloch CE, Searle SR (2001) Generalized, linear, and mixed models. Wiley, New York

    MATH  Google Scholar 

  • McDonald MC, Galarneau DI (1975) A Monte Carlo evaluation of some ridge-type estimators. J Amer Statist Assoc 70:407–416

    Article  MATH  Google Scholar 

  • Montgomery DC, Friedman DJ (1993) Prediction using regression models with multicollinear predictor variables. IIE Trans 25:73–85

    Article  Google Scholar 

  • Montgomery DC, Peck EA, Vining GG (2012) Introduction to linear regression analysis, 5th edn. Wiley, New York

    MATH  Google Scholar 

  • Newhouse JP, Oman SD (1971) An evaluation of ridge estimators. Rand Corporation R-716-PR:1-28

  • Orelien JG, Edwards LJ (2008) Fixed effect variable selection in linear mixed models using \(R^{2}\) statistics. Comput Stat Data Anal 52:1896–1907

    Article  MATH  Google Scholar 

  • Özkale MR (2015) Predictive performance of linear regression models. Statist Papers 56:531–567

    Article  MathSciNet  MATH  Google Scholar 

  • Özkale MR, Can F (2017) An evaluation of ridge estimator in linear mixed models: an example from kidney failure data. J Appl Stat 44:2251–2269

    Article  MathSciNet  MATH  Google Scholar 

  • Özkale MR, Kuran Ö (2018) Principal components regression and r-k class predictions in linear mixed models. Linear Algeb Appl 543:173–204

    Article  MathSciNet  MATH  Google Scholar 

  • Pinherio JC, Bates DM (2000) Mixed-effects in S and S-Plus. Springer-Verlag, New York

    Book  Google Scholar 

  • Potthoff RF, Roy SN (1964) A generalized multivariate analysis of variance model especially useful for growth curve problems. Biometrika 51:313–326

    Article  MathSciNet  MATH  Google Scholar 

  • Robinson GK (1991) That BLUP is a good thing: the estimation of random effects (with discussion). Statist Sci 6:15–51

    MathSciNet  MATH  Google Scholar 

  • Schabenberger O (2004) Mixed model influence diagnostics. Statist Data Anal 1:17

    Google Scholar 

  • Searle SR, Casella G, McCulloch CE (1992) Variance Components. Wiley, New York

    Book  MATH  Google Scholar 

  • Sheather SJ (2009) A Modern Approach to Regression with R. Springer Science and Business Media, New York

    Book  MATH  Google Scholar 

  • Vonesh EF, Chinchilli VM (1997) Linear and Nonlinear Models for the Analysis of Repeated Measurements. Marcel Dekker, New York

    MATH  Google Scholar 

  • Wahba G, Golub GH, Heath M (1979) Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics 21:215–223

    Article  MathSciNet  MATH  Google Scholar 

  • Wang J, Schaalje GB (2009) Model selection for linear mixed models using predictive criteria. Comm Statist Simul Comput 38:788–801

    Article  MathSciNet  MATH  Google Scholar 

  • Wenren C (2014) Mixed model selection based on the conceptual predictive statistic. Ph. D. Thesis, Department of Mathematics and Statistics, University of Bowling Green State

  • Wenren C, Shang J, Pan J (2016) Marginal conceptual predictive statistic for mixed model selection. Open J Statist 6:239–253

    Article  MATH  Google Scholar 

  • Wenren C, Shang J (2016) Conditional conceptual predictive statistic for mixed model selection. J Appl Stat 43:585–603

    Article  MathSciNet  MATH  Google Scholar 

  • Tarpey T (2000) A note on the prediction sum of squares statistic for restricted least squares. Am Stat 54:116–118

    MathSciNet  Google Scholar 

  • Zewotir T, Galphin JS (2005) Influence diagnostics for linear mixed models. J Data Sci 3:153–177

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Özge Kuran.

Ethics declarations

Conflict of interest

No potential conflict of interest was reported by the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kuran, Ö., Özkale, M.R. The ridge prediction error sum of squares statistic in linear mixed models. Metrika (2023). https://doi.org/10.1007/s00184-023-00927-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s00184-023-00927-z

Keywords

Navigation