Accelerating Matrix Computations for Mining Large Dynamic Complex Networks
Final Report Abstract
Graph mining and machine learning methods often apply linear algebra routines on big data sets. Such routines can be computationally expensive and may exceed available resources. In addition, dynamic changes over time in such data sets are rarely addressed in stateof-the-art software libraries. It was our motivation to overcome several of these aspects with AL-MACOM, specifically to accelerate algebraic algorithms for large-scale and dynamic data sets. To facilitate dynamic graph operations, we published Dynamic Hashed Blocks (DHB), a graph data structure that outperforms state-of-the-art competitors when executing common static and dynamic operations. DHB served as a foundation for several contributions that followed. To improve basic algebraic kernels, we proposed a new MPI-based dynamic sparse matrix multiplication (SpGEMM) algorithm in case of edge updates. This could then be extended to an MPI-based algorithm for dynamic closeness centrality. Using mostly combinatorial techniques, we proposed a new algorithm for approximating the diagonal of the Laplacian pseudoinverse of graphs. This resulted in a more accurate, faster, and more memory-efficient approximation compared to the state of the art. Moreover, we adapted the algorithm to the related problem of approximating the diagonal of the so-called forest matrix. Also this adapted algorithm has the favorable properties mentioned above. The performance of graph analytics kernels also depend on their cache efficiency, which can be optimised by graph reordering. One of our efforts resulted in a multilevel algorithm for mapping graph applications onto hierarchical architectures – it shows a better quality to speed trade-off compared to the competition and can be extended easily to reordering. In addition, we proposed a static and dynamic algorithm for computing a vertex ranking according to Katz centrality. The algorithm improves lower and upper bounds on the centrality values iteratively. This way, it can give (approximate) guarantees on the ranking quality – as opposed to previous methods. Additionally, we proposed a dynamic version of the Suitor approximation algorithm for weighted matchings in graphs. As one further application of our work on approximating (pseudo)inverse matrices, we investigated graph robustness, resulting in algebraic and combinatorial techniques to speed up an established greedy algorithm.
Publications
-
Approximation of the Diagonal of a Laplacian’s Pseudoinverse for Complex Network Analysis. 28th Europ. Symp. on Algorithms (ESA), 6:1–6:24, 2020.
E. Angriman, M. Predari, A. v. d. Grinten & H. Meyerhenke
-
Distributing Sparse Matrix/Graph Applications in Heterogeneous Clusters -an Experimental Study. 2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC), 72-81. IEEE.
Tzovas, Charilaos; Predari, Maria & Meyerhenke, Henning
-
An MPI-based Algorithm for Mapping Complex Networks onto Hierarchical Architectures. Lecture Notes in Computer Science, 167-182. Springer International Publishing.
Predari, Maria; Tzovas, Charilaos; Schulz, Christian & Meyerhenke, Henning
-
New Approximation Algorithms for Forest Closeness Centrality – for Individual Vertices and Vertex Groups. Proceedings of the 2021 SIAM International Conference on Data Mining (SDM), 136-144. Society for Industrial and Applied Mathematics.
van der Grinten, Alexander; Angriman, Eugenio; Predari, Maria & Meyerhenke, Henning
-
A Batch-dynamic Suitor Algorithmfor Approximating Maximum Weighted Matching. ACM Journal of Experimental Algorithmics, 27, 1-41.
Angriman, Eugenio; Boroń, Michał & Meyerhenke, Henning
-
A Fast Data Structure for Dynamic Graphs Based on Hash-Indexed Adjacency Blocks. 20th Intl. Symp. on Exp. Alg. (SEA), 11:1–11:18, 2022
A. v. d. Grinten, M. Predari & F. Willich
-
An MPI-Parallel Algorithm for Static and Dynamic Top-k Harmonic Centrality. 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 100-109. IEEE.
van der Grinten, Alexander; Custers, Geert; Thanh, Duy Le & Meyerhenke, Henning
-
Fast Dynamic Updates and Dynamic SpGEMM on MPI-Distributed Graphs. 2022 IEEE International Conference on Cluster Computing (CLUSTER), 429-439. IEEE.
van der Grinten, Alexander; Custers, Geert; Thanh, Duy Le & Meyerhenke, Henning
-
Faster Greedy Optimization of Resistance-based Graph Robustness. 2022 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 1-8. IEEE.
Predari, Maria; Kooij, Robert & Meyerhenke, Henning
-
Scalable Katz Ranking Computation in Large Static and Dynamic Graphs. ACM Journal of Experimental Algorithmics, 27, 1-16.
Grinten, van der Alexander; Bergamini, Elisabetta; Green, Oded; Bader, David A. & Meyerhenke, Henning
-
Greedy optimization of resistance-based graph robustness with global and local edge insertions. Social Network Analysis and Mining, 13(1).
Predari, Maria; Berner, Lukas; Kooij, Robert & Meyerhenke, Henning
