Automatic search for optimal penalty parameter

Function that performs an 'automatic' search for the optimal penalty parameter for the ridgeP call by employing Brent's method to the calculation of a cross-validated negative log-likelihood score.

optPenalty.kCVauto(
  Y,
  lambdaMin,
  lambdaMax,
  lambdaInit = (lambdaMin + lambdaMax)/2,
  fold = nrow(Y),
  cor = FALSE,
  target = default.target(covML(Y)),
  type = "Alt"
)

Arguments

Y: Data matrix. Variables assumed to be represented by columns.
lambdaMin: A numeric giving the minimum value for the penalty parameter.
lambdaMax: A numeric giving the maximum value for the penalty parameter.
lambdaInit: A numeric giving the initial (starting) value for the penalty parameter.
fold: A numeric or integer specifying the number of folds to apply in the cross-validation.
cor: A logical indicating if the evaluation of the LOOCV score should be performed on the correlation scale.
target: A target matrix (in precision terms) for Type I ridge estimators.
type: A character indicating the type of ridge estimator to be used. Must be one of: "Alt", "ArchI", "ArchII".

Value

An object of class list:

optLambda: A numeric giving the optimal value for the penalty parameter.
optPrec: A matrix representing the precision matrix of the chosen type (see ridgeP) under the optimal value of the penalty parameter.

Details

The function determines the optimal value of the penalty parameter by application of the Brent algorithm (1971) to the \(K\)-fold cross-validated negative log-likelihood score (using a regularized ridge estimator for the precision matrix). The search for the optimal value is automatic in the sense that in order to invoke the root-finding abilities of the Brent method, only a minimum value and a maximum value for the penalty parameter need to be specified as well as a starting penalty value. The value at which the \(K\)-fold cross-validated negative log-likelihood score is minimized is deemed optimal. The function employs the Brent algorithm as implemented in the optim function.

Note

When cor = TRUE correlation matrices are used in the computation of the (cross-validated) negative log-likelihood score, i.e., the \(K\)-fold sample covariance matrix is a matrix on the correlation scale. When performing evaluation on the correlation scale the data are assumed to be standardized. If cor = TRUE and one wishes to used the default target specification one may consider using target = default.target(covML(Y, cor = TRUE)). This gives a default target under the assumption of standardized data.

Under the default setting of the fold-argument, fold = nrow(Y), one performes leave-one-out cross-validation.

References

Brent, R.P. (1971). An Algorithm with Guaranteed Convergence for Finding a Zero of a Function. Computer Journal 14: 422-425.

Author

Wessel N. van Wieringen, Carel F.W. Peeters <carel.peeters@wur.nl>

Examples


## Obtain some (high-dimensional) data
p = 25
n = 10
set.seed(333)
X = matrix(rnorm(n*p), nrow = n, ncol = p)
colnames(X)[1:25] = letters[1:25]

## Obtain regularized precision under optimal penalty using K = n
OPT <- optPenalty.kCVauto(X, lambdaMin = .001, lambdaMax = 30); OPT
#> $optLambda
#> [1] 13.42467
#> 
#> $optPrec
#> A 25 x 25 ridge precision matrix estimate with lambda = 13.424669
#>              a            b           c            d            e            f …
#> a  0.755139353 -0.003028252 -0.04933164 -0.002421719  0.013977633  0.001114962 …
#> b -0.003028252  0.806519743 -0.01581207 -0.006536546 -0.024427367  0.005763776 …
#> c -0.049331639 -0.015812068  0.75928308 -0.030454560  0.020626621  0.012856312 …
#> d -0.002421719 -0.006536546 -0.03045456  0.801308978  0.020062896  0.001788722 …
#> e  0.013977633 -0.024427367  0.02062662  0.020062896  0.767347418 -0.005726924 …
#> f  0.001114962  0.005763776  0.01285631  0.001788722 -0.005726924  0.811360620 …
#> … 19 more rows and 19 more columns
#> 
OPT$optLambda # Optimal penalty
#> [1] 13.42467
OPT$optPrec   # Regularized precision under optimal penalty
#> A 25 x 25 ridge precision matrix estimate with lambda = 13.424669
#>              a            b           c            d            e            f …
#> a  0.755139353 -0.003028252 -0.04933164 -0.002421719  0.013977633  0.001114962 …
#> b -0.003028252  0.806519743 -0.01581207 -0.006536546 -0.024427367  0.005763776 …
#> c -0.049331639 -0.015812068  0.75928308 -0.030454560  0.020626621  0.012856312 …
#> d -0.002421719 -0.006536546 -0.03045456  0.801308978  0.020062896  0.001788722 …
#> e  0.013977633 -0.024427367  0.02062662  0.020062896  0.767347418 -0.005726924 …
#> f  0.001114962  0.005763776  0.01285631  0.001788722 -0.005726924  0.811360620 …
#> … 19 more rows and 19 more columns

## Another example with standardized data
X <- scale(X, center = TRUE, scale = TRUE)
OPT <- optPenalty.kCVauto(X, lambdaMin = .001, lambdaMax = 30, cor = TRUE,
                          target = default.target(covML(X, cor = TRUE))); OPT
#> $optLambda
#> [1] 1.706836
#> 
#> $optPrec
#> A 25 x 25 ridge precision matrix estimate with lambda = 1.706836
#>              a            b           c           d           e           f …
#> a  0.873376734  0.005931253 -0.10948686  0.02092830  0.02758587 -0.03107073 …
#> b  0.005931253  0.861084084 -0.05863233 -0.06126328 -0.11026791  0.01354590 …
#> c -0.109486856 -0.058632335  0.92644719 -0.10928824  0.04836945  0.03566900 …
#> d  0.020928298 -0.061263276 -0.10928824  0.87328836  0.05739944 -0.00832902 …
#> e  0.027585872 -0.110267905  0.04836945  0.05739944  0.90730967 -0.04183658 …
#> f -0.031070733  0.013545902  0.03566900 -0.00832902 -0.04183658  0.86457495 …
#> … 19 more rows and 19 more columns
#> 
OPT$optLambda # Optimal penalty
#> [1] 1.706836
OPT$optPrec   # Regularized precision under optimal penalty
#> A 25 x 25 ridge precision matrix estimate with lambda = 1.706836
#>              a            b           c           d           e           f …
#> a  0.873376734  0.005931253 -0.10948686  0.02092830  0.02758587 -0.03107073 …
#> b  0.005931253  0.861084084 -0.05863233 -0.06126328 -0.11026791  0.01354590 …
#> c -0.109486856 -0.058632335  0.92644719 -0.10928824  0.04836945  0.03566900 …
#> d  0.020928298 -0.061263276 -0.10928824  0.87328836  0.05739944 -0.00832902 …
#> e  0.027585872 -0.110267905  0.04836945  0.05739944  0.90730967 -0.04183658 …
#> f -0.031070733  0.013545902  0.03566900 -0.00832902 -0.04183658  0.86457495 …
#> … 19 more rows and 19 more columns

## Another example using K = 5
OPT <- optPenalty.kCVauto(X, lambdaMin = .001, lambdaMax = 30, fold = 5); OPT
#> $optLambda
#> [1] 14.40738
#> 
#> $optPrec
#> A 25 x 25 ridge precision matrix estimate with lambda = 14.407382
#>              a            b           c            d            e            f …
#> a  0.691478594 -0.003227029 -0.03222970 -0.002217116  0.009474088  0.001236740 …
#> b -0.003227029  0.691403151 -0.01655489 -0.009966871 -0.027470077  0.010617196 …
#> c -0.032229697 -0.016554885  0.69242537 -0.029799758  0.014524640  0.014598926 …
#> d -0.002217116 -0.009966871 -0.02979976  0.691409551  0.020899750  0.002791406 …
#> e  0.009474088 -0.027470077  0.01452464  0.020899750  0.691973896 -0.006678441 …
#> f  0.001236740  0.010617196  0.01459893  0.002791406 -0.006678441  0.691322449 …
#> … 19 more rows and 19 more columns
#> 
OPT$optLambda # Optimal penalty
#> [1] 14.40738
OPT$optPrec   # Regularized precision under optimal penalty
#> A 25 x 25 ridge precision matrix estimate with lambda = 14.407382
#>              a            b           c            d            e            f …
#> a  0.691478594 -0.003227029 -0.03222970 -0.002217116  0.009474088  0.001236740 …
#> b -0.003227029  0.691403151 -0.01655489 -0.009966871 -0.027470077  0.010617196 …
#> c -0.032229697 -0.016554885  0.69242537 -0.029799758  0.014524640  0.014598926 …
#> d -0.002217116 -0.009966871 -0.02979976  0.691409551  0.020899750  0.002791406 …
#> e  0.009474088 -0.027470077  0.01452464  0.020899750  0.691973896 -0.006678441 …
#> f  0.001236740  0.010617196  0.01459893  0.002791406 -0.006678441  0.691322449 …
#> … 19 more rows and 19 more columns