Project Details
Projekt Print View

Bridging the gap between data generation and data comprehension in seismology: Towards new insights and improved understandings in seismology via Intelligent Data Analysis

Subject Area Geophysics
Term from 2010 to 2014
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 170386084
 
Final Report Year 2013

Final Report Abstract

Intelligent Data Analysis (IDA) provides a wealth of non-trivial ("intelligent") algorithms, techniques and methodologies aimed at analyzing empirical data. IDA is successfully being applied in various domains, resulting in new insights and improved understanding of many processes, in particular when uncertainty is inherently associated with the problem at hand. We applied IDA to various earth scientific problems, such as in the field of natural hazards, e.g., earthquakes, tsunamis, flood events, landslides, volcano eruptions. These domains all share (despite of their differences in causes and effects) many of the same model- and decision theoretic questions which can be dealt with using IDA methods. Usually the natural processes are complex and the number of potential influencing factors is large. The single and joint effects of the driving forces are not necessarily completely understood, potentially introducing a large degree of uncertainty which impacts the overall analysis and results. Additionally the observation on the basis of which inference is made is often sparse, inaccurate and/or incomplete, adding yet another layer of uncertainty on top. From an IDA point of view, many methods were applied where a principled handling of uncertainty is taken explicit into account: (Dynamic) Bayesian networks (D)BNs, learning (D)BNs from observations, Bayesian statistics, modeling and eliciting prior knowledge, mixture modeling, transfer learning, Bayesian non-parametrics, hierarchical modeling, visualization and dimensionality reduction. In particular worth mentioning is our use of socalled Bayesian Networks (BNs) in natural hazards, allowing for an intuitive, consistent, and rigorous way of quantifying uncertainties. The (in)dependencies between the involved variables relevant to a particular hazard domain are translated into a graph structure enabling for improved understandings and direct insights into the relationships and workings of a natural hazard system. Moreover, as BNs capture the joint distribution of all variables of a given domain, they may be used for expressing any conditional probability distribution of interest, thereby helping to answer quantitative questions on specific scenarios or process response chains. More specifically, we have looked at BNs in the context of tsunami early warning, probabilistic seismic hazard analysis (the ground-motion modeling part), landslide prediction and for flood damage assessments. For all domains, we took a data-driven stance, and were able to develop predictive models, where not only uncertainty of a prediction is taken into account, but we may also perform instant prediction with incomplete evidence/observations (useful for tsunami early warning) and at the same obtain insight into the workings of natural system system itself. Moreover, in comparative studies we showed that BNs perform better or on par with existing methods (usually based on simple predictive models, e.g.. regression models). The BNs were developed from data, however, often data in unavailable or scarce. In such situations prior knowledge needs to be incorporated in typical data-driven IDA methods. To this end we have investigated and developed a platform for eliciting expert knowledge, taking into account the typical biases and other problems with regard to consistently obtaining trustworthy domain knowledge. For this we have focused on probabilistic seismic hazard analysis, where expert knowledge of various ground motion models is required. Additionally, we have developed transfer-learning approaches for "borrowing" information form related ground motion models, hence augmenting data sets for regions where hardly any observations have been made, e.g., for geographically adjacent regions, where for one region we have plenty of data to derive a ground motion model, but for the other region we don't; in that case we implicitly borrow observations (via an intelligent mechanism) to derive a model even for the region with few observations.

Publications

 
 

Additional Information

Textvergrößerung und Kontrastanpassung