Identify optimal ridge and fused ridge penalties

Functions to find the optimal ridge and fusion penalty parameters via leave-one-out cross validation. The functions support leave-one-out cross-validation (LOOCV), \(k\)-fold CV, and two forms of approximate LOOCV. Depending on the used function, general numerical optimization or a grid-based search is used.

optPenalty.fused.grid(
  Ylist,
  Tlist,
  lambdas = 10^seq(-5, 5, length.out = 15),
  lambdaFs = lambdas,
  cv.method = c("LOOCV", "aLOOCV", "sLOOCV", "kCV"),
  k = 10,
  verbose = TRUE,
  ...
)

optPenalty.fused.auto(
  Ylist,
  Tlist,
  lambda,
  cv.method = c("LOOCV", "aLOOCV", "sLOOCV", "kCV"),
  k = 10,
  verbose = TRUE,
  lambda.init,
  maxit.ridgeP.fused = 1000,
  optimizer = "optim",
  maxit.optimizer = 1000,
  debug = FALSE,
  optim.control = list(trace = verbose, maxit = maxit.optimizer),
  ...
)

optPenalty.fused(
  Ylist,
  Tlist,
  lambda = default.penalty(Ylist),
  cv.method = c("LOOCV", "aLOOCV", "sLOOCV", "kCV"),
  k = 10,
  grid = FALSE,
  ...
)

Arguments

Ylist: A list of \(G\) matrices of data with \(n_g\) samples in the rows and \(p\) variables in the columns corresponding to \(G\) classes of data.
Tlist: A list of \(G\) of p.d. class target matrices of size \(p\) times \(p\).
lambdas: A numeric vector of positive ridge penalties.
lambdaFs: A numeric vector of non-negative fusion penalties.
cv.method: character giving the cross-validation (CV) to use. The allowed values are "LOOCV", "aLOOCV", "sLOOCV", "kCV" for leave-one-out cross validation (LOOCV), appproximate LOOCV, special LOOCV, and k-fold CV, respectively.
k: integer giving the number of approximately equally sized parts each class is partioned into for \(k\)-fold CV. Only use if cv.method is "kCV".
verbose: logical. If TRUE, progress information is printed to the console.
...: For optPenalty.fused, arguments are passed to optPenalty.fused.grid or optPenalty.fused.auto depending on the value of grid. In optPenalty.fused.grid, arguments are passed to ridgeP.fused. In optPenalty.fused.auto, arguments are passed to the optimizer.
lambda: A symmetric character matrix encoding the class of penalty matrices to cross-validate over. The diagonal elements correspond to the class-specific ridge penalties whereas the off-diagonal elements correspond to the fusion penalties. The unique elements of lambda specify the penalties to determine by the method specified by cv.method. The penalties can be fixed if they are coercible to numeric values, such as e.g. "0", "2.71" or "3.14". Fusion between pairs can be "left out"" using either of "", NA, "NA", or "0". See default.penalty for help on the construction hereof and more details. Unused and can be omitted if grid == TRUE.
lambda.init: A numeric penalty matrix of initial values passed to the optimizer. If omitted, the function selects a starting values using a common ridge penaltiy (determined by 1D optimization) and all fusion penalties to zero.
maxit.ridgeP.fused: A integer giving the maximum number of iterations allowed for each fused ridge fit.
optimizer: character. Either "optim" or "nlm" determining which optimizer to use.
maxit.optimizer: A integer giving the maximum number of iterations allowed in the optimization procedure.
debug: logical. If TRUE additional output from the optimizer is appended to the output as an attribute.
optim.control: A list of control arguments for optim.
grid: logical. Should a grid based search be used? Default is FALSE.

Value

optPenalty.fused.auto returns a list:

Plist: A list of the precision estimates for the optimal parameters.
lambda: The estimated optimal fused penalty matrix.
lambda.unique: The unique entries of the lambda. A more concise overview of lambda
value: The value of the loss function in the estimated optimum.

optPenalty.fused.LOOCV returns a list:

ridge: A numeric vector of grid values for the ridge penalty
fusion: The numeric vector of grid values for the fusion penalty
fcvl: The numeric matrix of evaluations of the loss function

Details

optPenalty.fused.auto serves a utilizes optim for identifying the optimal fused parameters and works for general classes of penalty graphs.

optPenalty.fused.grid gives a grid-based evaluation of the (approximate) LOOCV loss.

References

Bilgrau, A.E., Peeters, C.F.W., Eriksen, P.S., Boegsted, M., and van Wieringen, W.N. (2020). Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes. Journal of Machine Learning Research, 21(26): 1-52.

Author

Anders Ellern Bilgrau, Carel F.W. Peeters <carel.peeters@wur.nl>, Wessel N. van Wieringen

Examples