Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination

Hu, Ming; Yue, Zhihao; Xie, Xiaofei; Chen, Cheng; Huang, Yihao; Wei, Xian; Lian, Xiang; Liu, Yang; Chen, Mingsong

doi:10.1145/3637528.3671722

Computer Science > Machine Learning

arXiv:2305.10730 (cs)

[Submitted on 18 May 2023 (v1), last revised 4 Jul 2024 (this version, v2)]

Title:Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination

Authors:Ming Hu, Zhihao Yue, Xiaofei Xie, Cheng Chen, Yihao Huang, Xian Wei, Xiang Lian, Yang Liu, Mingsong Chen

View PDF HTML (experimental)

Abstract:Although Federated Learning (FL) enables global model training across clients without compromising their raw data, due to the unevenly distributed data among clients, existing Federated Averaging (FedAvg)-based methods suffer from the problem of low inference performance. Specifically, different data distributions among clients lead to various optimization directions of local models. Aggregating local models usually results in a low-generalized global model, which performs worse on most of the clients. To address the above issue, inspired by the observation from a geometric perspective that a well-generalized solution is located in a flat area rather than a sharp area, we propose a novel and heuristic FL paradigm named FedMR (Federated Model Recombination). The goal of FedMR is to guide the recombined models to be trained towards a flat area. Unlike conventional FedAvg-based methods, in FedMR, the cloud server recombines collected local models by shuffling each layer of them to generate multiple recombined models for local training on clients rather than an aggregated global model. Since the area of the flat area is larger than the sharp area, when local models are located in different areas, recombined models have a higher probability of locating in a flat area. When all recombined models are located in the same flat area, they are optimized towards the same direction. We theoretically analyze the convergence of model recombination. Experimental results show that, compared with state-of-the-art FL methods, FedMR can significantly improve the inference accuracy without exposing the privacy of each client.

Comments:	arXiv admin note: substantial text overlap with arXiv:2208.07677
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2305.10730 [cs.LG]
	(or arXiv:2305.10730v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.10730
Related DOI:	https://doi.org/10.1145/3637528.3671722

Submission history

From: Ming Hu [view email]
[v1] Thu, 18 May 2023 05:58:24 UTC (2,808 KB)
[v2] Thu, 4 Jul 2024 18:22:01 UTC (1,650 KB)

Computer Science > Machine Learning

Title:Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators