Fairness, Integrity, and Privacy in a Scalable Blockchain-based Federated Learning System

Rückel, Timon, Sedlmeir, Johannes, Hofmann, Peter

Nov-11-2021–arXiv.org Artificial Intelligence

This is the accepted version of an article with the same name, published in the Special Issue "Federated Learning and Blockchain Supported Smart Networking in Beyond 5G (B5G) Wireless Communication" in Computer Networks. Abstract Federated machine learning (FL) allows to collectively train models on sensitive data as only the clients' models and not their training data need to be shared. However, despite the attention that research on FL has drawn, the concept still lacks broad adoption in practice. One of the key reasons is the great challenge to implement FL systems that simultaneously achieve fairness, integrity, and privacy preservation for all participating clients. To contribute to solving this issue, our paper suggests a FL system that incorporates blockchain technology, local differential privacy, and zero-knowledge proofs. Our implementation of a proof-of-concept with multiple linear regression illustrates that these state-of-the-art technologies can be combined to a FL system that aligns economic incentives, trust, and confidentiality requirements in a scalable and transparent system. A Blockchain blockchain eliminates the need for a centralized authority, provides transparency, enforces the federated learning protocol, and provides a decentralized infrastructure for the collection of fees and the distribution of rewards. The reward payment is calculated based on the client's clients' Federated learning enables multiple clients FIM Research Center 1. Introduction The application of machine learning (ML) promises far-reaching potentials across industries [1]. ML has already proven successful in many areas, such as web search or recommender systems in e-commerce, in which a lot of high-quality data exists [2]. While researchers address ML's growing demand for compute power and use of data with, e.g., distributed ML approaches where multiple computing nodes share their resources [3, 4, 5] and quality issues with data processing, access to data is not only a technical issue. Both traditional ML and distributed ML approaches assume that their training data is centralized by nature, preventing the applicability of ML approaches to domains in which data is sensitive and distributed at the same time. To avoid that ML approaches must rely on data to which only a centralized organization or individual has full access, federated machine learning (FL) can aggregate the less sensitive ML models that were independently and locally trained by individual clients [6, 7].

blockchain, privacy, signal input, (15 more...)

arXiv.org Artificial Intelligence

Nov-11-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Hawaii (0.04)
  - Virginia (0.04)
  - California (0.04)
- Europe
  - Slovenia > Drava
    - Municipality of Benedikt > Benedikt (0.04)
  - Germany > Bavaria
    - Upper Franconia > Bayreuth (0.04)
- Africa > Middle East
  - Egypt > Cairo Governorate > Cairo (0.04)

Genre:
- Research Report (1.00)
- Overview > Innovation (0.34)

Industry:
- Banking & Finance (1.00)
- Information Technology
  - Security & Privacy (1.00)
  - Services > e-Commerce Services (0.87)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.67)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Optimization (0.46)