Coded Distributed Computation with Application to Machine Learning

Applicant Professorin Dr.-Ing. Antonia Wachter-Zeh

Subject Area Communication Technology and Networks, High-Frequency Technology and Photonic Systems, Signal Processing and Machine Learning for Information Technology

Term from 2021 to 2025

Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 470027923

Final Report Year 2025

Final Report Abstract

In this project, our goal was to utilize channel coding tools to enhance the speed, security, and privacy of distributed computing and federated learning. In order to process a large amount of data for training good machine learning models (like deep neural networks, large language models, etc.), distributing computations to capable servers becomes necessary. However, distributed computing introduces issues such as stragglers (nodes with lower computation and communication capabilities) and privacy and security concerns (due to the usage of external computing servers). Federated learning is a machine learning paradigm where the data are a property of decentralized users, hence imposing similar issues as distributed computing. Our goals in this project were as follows. i) constructing coding schemes that are fast, private, and secure for distributed computing, ii) deriving fundamental bounds and limitations for coding schemes for distributed computing, and iii) constructing coding schemes for federated learning to account for its straggler-tolerance, privacy, and security concerns. To achieve these goals, we utilized techniques from algebraic channel codes and from codes on graphs (also known as modern channel coding) to provide resiliency over straggling computation nodes and to ensure privacy and security. Moreover, we adopted well-known techniques related to channel coding, such as group testing, to achieve our goals. Among our results are the first private rateless matrix multiplication scheme, where we use Fountain codes in order to construct a rateless scheme to speed up matrix multiplication and use Lagrange coded computing in order to preserve the privacy of the matrices being multiplied. Moreover, we upgraded the scheme by using verification techniques in order to ensure resiliency against malicious attackers. We focused on the problem of privacy-preserving techniques over data stored in sparse matrices, since existing techniques did not account for this type of matrix. In this project, we also focused on speeding up federated learning (also known as coded federated learning) by replicating the data in a private way across different clients. We were the first to investigate the use of computationally private techniques based on codebased cryptography to preserve privacy in coded federated learning. An important aspect in federated learning is also the combination of privacy-preserving techniques and security-guaranteeing ones. To ensure that, we adopted techniques from group testing in order to allow a private identification of malicious clients in federated learning, using secure aggregation as a privacy-preserving technique.

Publications

Adaptive Private Distributed Matrix Multiplication. IEEE Transactions on Information Theory, 68(4), 2653-2673.
Bitar, Rawad; Xhemrishi, Marvin & Wachter-Zeh, Antonia
Computational Code-Based Privacy in Coded Federated Learning. 2022 IEEE International Symposium on Information Theory (ISIT), 2034-2039. IEEE.
Xhemrishi, Marvin; Graell i. Amat, Alexandre; Rosnes, Eirik & Wachter-Zeh, Antonia
Distributed Matrix-Vector Multiplication with Sparsity and Privacy Guarantees. 2022 IEEE International Symposium on Information Theory (ISIT), 1028-1033. IEEE.
Xhemrishi, Marvin; Bitar, Rawad & Wachter-Zeh, Antonia
Efficient Distributed Machine Learning via Combinatorial Multi-Armed Bandits. 2022 IEEE International Symposium on Information Theory (ISIT), 1653-1658. IEEE.
Egger, Maximilian; Bitar, Rawad; Wachter-Zeh, Antonia & Gündüz, Deniz
Efficient Private Storage of Sparse Machine Learning Data. 2022 IEEE Information Theory Workshop (ITW), 214-219. IEEE.
Xhemrishi, Marvin; Egger, Maximilian & Bitar, Rawad
Secure Private and Adaptive Matrix Multiplication Beyond the Singleton Bound. IEEE Journal on Selected Areas in Information Theory, 3(2), 275-285.
Hofmeister, Christoph; Bitar, Rawad; Xhemrishi, Marvin & Wachter-Zeh, Antonia
Fast and Straggler-Tolerant Distributed SGD with Reduced Computation Load. 2023 IEEE International Symposium on Information Theory (ISIT), 1336-1341. IEEE.
Egger, Maximilian; Hanna, Serge Kas & Bitar, Rawad
Maximal-Capacity Discrete Memoryless Channel Identification. 2023 IEEE International Symposium on Information Theory (ISIT), 2248-2253. IEEE.
Egger, Maximilian; Bitar, Rawad; Wachter-Zeh, Antonia; Gündüz, Deniz & Weinberger, Nir
Private Aggregation in Wireless Federated Learning with Heterogeneous Clusters. 2023 IEEE International Symposium on Information Theory (ISIT), 54-59. IEEE.
Egger, Maximilian; Hofmeister, Christoph; Wachter-Zeh, Antonia & Bitar, Rawad
Sparse and Private Distributed Matrix Multiplication with Straggler Tolerance. 2023 IEEE International Symposium on Information Theory (ISIT), 2535-2540. IEEE.
Egger, Maximilian; Xhemrishi, Marvin; Wachter-Zeh, Antonia & Bitar, Rawad
Byzantine-Resilient Secure Aggregation for Federated Learning Without Privacy Compromises. 2024 IEEE Information Theory Workshop (ITW), 223-228. IEEE.
Xia, Yue; Hofmeister, Christoph; Egger, Maximilian & Bitar, Rawad
Capacity-Maximizing Input Symbol Selection for Discrete Memoryless Channels. 2024 IEEE International Symposium on Information Theory (ISIT), 723-728. IEEE.
Egger, Maximilian; Bitar, Rawad; Wachter-Zeh, Antonia; Gündüz, Deniz & Weinberger, Nir
Scalable and Reliable Over-the-Air Federated Edge Learning. GLOBECOM 2024 - 2024 IEEE Global Communications Conference, 3932-3937. IEEE.
Egger, Maximilian; Hofmeister, Christoph; Kaya, Cem; Bitar, Rawad & Wachter-Zeh, Antonia
Self-Duplicating Random Walks for Resilient Decentralized Learning on Graphs. GLOBECOM 2024 - 2024 IEEE Global Communications Conference, 2960-2965. IEEE.
Egger, Maximilian; Ayache, Ghadir; Bitar, Rawad; Wachter-Zeh, Antonia & Rouayheb, Salim El
Sparsity and Privacy in Secret Sharing: A Fundamental Trade-Off. IEEE Transactions on Information Forensics and Security, 19, 5136-5150.
Bitar, Rawad; Egger, Maximilian; Wachter-Zeh, Antonia & Xhemrishi, Marvin
BI- CompFL: Stochastic Federated Learning with Bi-Directional Compression
Maximilian Egger, Rawad Bitar, Antonia Wachter-Zeh, Nir Weinberger & Deniz Gündüz
Byzantine-Resilient Zero-Order Optimization for Communication-Eﬀicient Heterogeneous Federated Learning
Maximilian Egger, Mayank Bakshi & Rawad Bitar
Cost-Efficient Distributed Learning via Combinatorial Multi-Armed Bandits. Entropy, 27(5), 541.
Egger, Maximilian; Bitar, Rawad; Wachter-Zeh, Antonia & Gündüz, Deniz
Detect & Score: Privacy-Preserving Misbehavior Detection and Contribution Evaluation in Federated Learning. Proceedings of the International Workshop on Secure and Efficient Federated Learning, 1-6. ACM.
Xhemrishi, Marvin; Graell i. Amat, Alexandre & Pejo, Balazs
Efficient Machine Unlearning by Model Splitting and Core Sample Selection. 2025 IEEE Information Theory Workshop (ITW), 1-6. IEEE.
Egger, Maximilian; Bitar, Rawad & Urbanke, Rüdiger
Federated One-Shot Learning with Data Privacy and Objective-Hiding. 2025 IEEE International Symposium on Information Theory (ISIT), 1-6. IEEE.
Egger, Maximilian; Urbanke, Rüdiger & Bitar, Rawad
Federated One-Shot Learning With Data Privacy and Objective-Hiding. IEEE Transactions on Information Forensics and Security, 20, 5166-5180.
Egger, Maximilian; Urbanke, Rüdiger L. & Bitar, Rawad
FedGT: Identification of Malicious Clients in Federated Learning With Secure Aggregation. IEEE Transactions on Information Forensics and Security, 20, 2577-2592.
Xhemrishi, Marvin; Östman, Johan; Wachter-Zeh, Antonia & Amat, Alexandre Graell i.
LoByITFL: Low Communication Secure and Private Federated Learning. Proceedings of the International Workshop on Secure and Efficient Federated Learning, 1-6. ACM.
Xia, Yue; Egger, Maximilian; Hofmeister, Christoph & Bitar, Rawad
Maximal-Capacity Discrete Memoryless Channel Identification. IEEE Transactions on Information Theory, 71(2), 1248-1265.
Egger, Maximilian; Bitar, Rawad; Wachter-Zeh, Antonia; Gündüz, Deniz & Weinberger, Nir
Multi-Terminal Remote Generation and Estimation Over a Broadcast Channel with Correlated Priors. 2025 IEEE International Symposium on Information Theory (ISIT), 1-6. IEEE.
Egger, Maximilian; Bitar, Rawad; Wachter-Zeh, Antonia; Weinberger, Nir & Gündüz, Deniz
Perfect Privacy for Discriminator-Based Byzantine-Resilient Federated Learning
Yue Xia, Christoph Hofmeister, Maximilian Egger & Rawad Bitar
Private Aggregation for Byzantine-Resilient Heterogeneous Federated Learning. 2025 IEEE Information Theory Workshop (ITW), 1-6. IEEE.
Egger, Maximilian & Bitar, Rawad
Private Aggregation in Hierarchical Wireless Federated Learning With Partial and Full Collusion. IEEE Transactions on Information Theory, 71(11), 8977-8992.
Egger, Maximilian; Hofmeister, Christoph; Wachter-Zeh, Antonia & Bitar, Rawad
Self-Duplicating Random Walks for Resilient Decentralized Learning on Graphs
Egger, Maximilian; Ayache, Ghadir; Bitar, Rawad; Wachter-Zeh, Antonia & Rouayheb, Salim El
Soft-Decision Decoding for LDPC Code-Based Quantitative Group Testing. 2025 14th International ITG Conference on Systems, Communications and Coding (SCC), 1-6. IEEE.
Xhemrishi, Marvin; Östman, Johan & Graell i. Amat, Alexandre
Source Anonymity for Private Random Walk Decentralized Learning. 2025 IEEE Information Theory Workshop (ITW), 1-6. IEEE.
Egger, Maximilian; Lage, Svenja; Bitar, Rawad & Wachter-Zeh, Antonia

Servicenavigation

Hauptnavigation

Coded Distributed Computation with Application to Machine Learning

Final Report Abstract

Publications

Additional Information

Servicenavigation

Hauptnavigation

Coded Distributed Computation with Application to Machine Learning

Final Report Abstract

Publications

Additional Information

Textvergrößerung und Kontrastanpassung