Project Details
Projekt Print View

dCortools: Distance Correlation Methods for Detecting Nonlinear Associations in High-Dimensional Molecular Data

Subject Area Medical Informatics and Medical Bioinformatics
Term from 2019 to 2022
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 417754611
 
Final Report Year 2022

Final Report Abstract

The project dealt with the concept of distance correlation, which is a novel and powerful dependence criterion. Contrary to Pearson or Spearman correlation, distance correlation measures any kind of dependence between random vectors, including nonlinear or even nonmonotone associations. In this project, the concept of distance correlation was extended to survival data allowing to test for independence between any kind of predictor variable (such as e.g. clinical or demographical data) and a time-to-event response. For this purpose, two approaches were derived. The first approach is based on the distance correlation of the predictor variable and the martingale residuals in an empty Cox model, the second approach is based on inverse-probability of censoring-weighted U-statistics. Both approaches have been shown to perform well in practice. Moreover, we created an R package dcortools, which is freely available on the Comprehensive R Archive Network (CRAN). Different to other R packages that provide resampling-based tests, this R package features asymptotic testing procedures that are typically substantially faster. Moreover, a highly efficient computation of the distance correlation coefficient itself is implemented. Finally, the R package also features methods for generalized distance correlation and the distance correlation methods for survival data developed in the course of the project. One of the original goals of the project was further the development of iterative variable screening methods for high-dimensional data. This goal was dropped for the moment, since we could establish a unifying results between the distance correlation and other measures of dependence. Using this result, we obtain a regression interpretation for distance correlation, which provides new insights on how to establish iterative variable screening methods. We plan to pursue this direction in future research.

Publications

 
 

Additional Information

Textvergrößerung und Kontrastanpassung