Project Details
Projekt Print View

The role of the theoretical covariance between SNPs in the design of experiments in genomic evaluations

Subject Area Animal Breeding, Animal Nutrition, Animal Husbandry
Term from 2016 to 2021
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 320694892
 
Final Report Year 2021

Final Report Abstract

The aim of genome‐based association studies is to identify and localise genomic markers that are associated with trait expression. The estimated size of a marker effect give rise for further investigation of the biological functionality of the corresponding DNA region. But only if a marker effect is large enough, it can be detected by some statistical testing procedure. Hence, the precision of parameter estimates is highly important in order to increase the number of truly detected DNA variants and to reduce the number of false‐positive detections. To increase the chance (i.e. power) for discovering a trait‐associated variant with genome‐based association studies, a sufficiently large sample is required. An optimal sample size can be estimated under given framework conditions. An experimenter can specify most of the parameters required for an experimental design but the distribution of marker genotypes is actually available after the experiment had taken place. Hence, to design a future experiment, we suggest approximating this information from genetic theory. The approximation depends on the underlying breeding populations (half‐ or full‐sib families) because non‐random mating influences the extent of dependence between markers. Then, knowing the genetic information of selected parents, the number of progeny can be derived to guarantee a certain power of association analysis later on. We could show that, if the dependence between markers is taken into account, the experimental design leads to a more efficient use of animal resources than a classical method used so far. Not only sample size influences the precision of parameter estimates but also the ability of a statistical approach to cope with the dependence among markers which occurs due to the proximity of markers on the genome. We investigated two regression approaches that were able to take information on the proximity of markers into account. Especially, when markers were grouped according to the strength of association among each other, genetic effects captured by markers were estimated more precisely. With this project, we underpinned the importance of employing the dependence among markers, either by grouping them accordingly or by including a kind of similarity matrix in a statistical approach for genome‐based association studies. Furthermore, we contributed statistical and computational tools to design future experiments for fine‐mapping of trait‐associated variants in breeding populations.

Publications

 
 

Additional Information

Textvergrößerung und Kontrastanpassung