Statistical Modeling of Covariate-Varying Networks (CVN)
Final Report Abstract
High-dimensional networks play a key role in understanding complex relationships. These relationships are often dynamic in nature and can change with multiple external factors such as time or disease status. Existing methods for estimating high-dimensional graphical models are often limited to static graphs or those that can change with a single covariate. This project developed a novel model class called covariate-varying networks (CVNs) that can change depending on multiple external covariates. We introduce sparsity by applying a L1 penalty on the precision matrices of m >= 2 graphs we aim to estimate. These graphs often exhibit a level of similarity, termed smoothness, which is represented by a meta-graph with m nodes, each corresponding to a graph one wants to estimate. The (weighted) adjacency matrix of the meta-graph represents the strength of similarity enforced among the m graphs. The resulting optimization problem is solved by using an alternating direction method of multipliers (ADMM). A crucial step within the ADMM involves solving a weighted fused signal approximator (wFLSA), which, to the best of our knowledge, has not been previously addressed. We do this by reformulating it as a Generalized LASSO problem and solving it with an ADMM tailored explicitly for this task. The wFLSA method also has valuable other applications, such as in image smoothing and change-point detection in time series. Both methods, CVN and wFLSA, were implemented in R and were released as R packages to facilitate broader use and application. Since the CVN model estimation requires the selection of regularization parameters, we compared appropriate methods for this task in simulation studies. Results show that current methods, including StARS (Stability Approach to Regularization Selection), often perform poorly, underscoring the need for further research. Further insights were gained in the performance of LASSO-type variable selection procedures that we compared with 'best subset' selection. It was shown that penalized variable selection performs better, especially in cases with correlated variables or low signal-to-noise ratio.
Publications
-
A novel class of graphs: Covariate-varying networks. Presentation at the Broad Institute of the Massachusetts Institute of Technology (MIT) & Harvard University, 9 Dec 2022, Cambridge, USA
Dijkstra L., Godt A. & Foraita R.
-
A novel class of graphs: Covariate-varying networks. 5th Conference of the Central European Network (CEN), 3-7 Sep 2023, Basel, Switzerland
Dijkstra L., Godt A. & Foraita R.
-
Variable selection in linear regression models: Choosing the best subset is not always the best choice. Biometrical Journal, 66(1).
Hanke, Moritz; Dijkstra, Louis; Foraita, Ronja & Didelez, Vanessa
-
Estimating sparse graphical models in high dimensions. 70th Biometric Colloquium, Lübeck, 29 Feb 2024
Foraita R., Mose K., Hanke M. &Didelez V.
-
High-dimensional graphical models varying with multiple external covariates and its application to Alzheimer’s disease. International Biometric Conference, Atlanta, USA, 10 Dec 2024
Foraita R., Hüls A. & Dijkstra L.
