MMVFL: A Simple Vertical Federated Learning Framework for Multi-Class Multi-Participant Scenarios
Abstract
:1. Introduction
- A novel and simple multi-participant multi-class VFL framework is proposed. By using this framework, a data owner can borrow information from other data owners to help improve task performance without raw data being disclosed.
- A feature importance evaluation scheme based on MMVFL is proposed, which aims to assess the significance of different features contributed by each participant in the federated learning (FL) model. By identifying and eliminating redundant and harmful features during the initial training stages, the subsequent learning process can be conducted with reduced communication, computation, and storage costs.
- The proposed framework is flexible because different schemes that aim at enhancing communication efficiency and security can be incorporated into the framework. In addition, MMVFL can be easily extended by combining it with deep neural networks.
- The proposed MMVFL framework has been evaluated extensively based on two datasets. The results show that MMVFL is comparable with, and in some cases even superior to, methods that provide label information to each client.
2. Related Work
2.1. Vertical Federated Learning
2.2. Embedded Feature Selection
2.3. Multi-View Learning
3. Proposed Method
3.1. Notations and Problem Definition
3.2. Sparse-Learning-Based Unsupervised Feature Selection
3.3. Privacy-Preserving Label Sharing
3.4. Optimization
- When and are fixed, can be solved locally. Equation (3) becomes:Though is convex, its derivative does not exist when . Following [43], by denoting , its derivative with respect to isis a small constant to avoid overflow. Thus, is nonzero for every i. Therefore, Equation (4) can be rewritten as:We employ an alternating optimization scheme to solve Equation (5). When is fixed, the optimal value of can be obtained throughWhen is fixed, we can update through Equation (4). Note that is initialized with random values before the optimization process begins.
- When is fixed, the optimization problem for solving and isThe optimization of Equation (7) is performed in an alternating manner. When , and are fixed, can be solved locally throughIt is straightforward to obtain the optimal by simply taking the derivative asWhen and are fixed, the optimization of for can be carried out in a similar way, and the optimal is:Likewise, when are fixed, the optimal value of is:
4. Analysis
4.1. Convergence
4.2. Time Complexity
4.3. Privacy Preservation
5. Experimental Evaluation
5.1. Dataset Information
5.2. Comparison Baselines
- supFL [41]: which performs independent supervised feature selection on each of the K participants assuming that they all have access to label information. It optimizes the following objective function:Note that notation in Equation (9) refers to the one-hot matrix that contains the label information as defined in Section 3.3, which is different from the same notation used in [41].
- supMVLFL: which performs supervised multi-view feature selection under a linear transformation framework. It is a direct extension of supFL [41] into an MVL architecture, which optimizes the following objective function:
5.3. Experiment Settings
5.4. Results and Discussion
6. Conclusions and Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Ahmed, U.; Srivastava, G.; Lin, J.C.W. A Federated Learning Approach to Frequent Itemset Mining in Cyber-Physical Systems. J. Netw. Syst. Manag. 2021, 29, 1–17. [Google Scholar]
- Qin, Y.; Kondo, M. Federated Learning-Based Network Intrusion Detection with a Feature Selection Approach. In Proceedings of the 2021 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), Kuala Lumpur, Malaysia, 12–13 June 2021; pp. 1–6. [Google Scholar]
- Ahmed, U.; Lin, J.C.W.; Srivastava, G. 5G-Empowered Drone Networks in Federated and Deep Reinforcement Learning Environments. IEEE Commun. Stand. Mag. 2021, 5, 55–61. [Google Scholar] [CrossRef]
- Bakopoulou, E.; Tillman, B.; Markopoulou, A. Fedpacket: A Federated Learning Approach to Mobile Packet Classification. IEEE Trans. Mob. Comput. 2021, 21, 609–3628. [Google Scholar] [CrossRef]
- Ahmed, U.; Srivastava, G.; Lin, J.C.W. Reliable Customer Analysis Using Federated Learning and Exploring Deep-Attention Edge Intelligence. Future Gener. Comput. Syst. 2022, 127, 70–79. [Google Scholar] [CrossRef]
- Li, Q.; Wen, Z.; Wu, Z.; Hu, S.; Wang, N.; Li, Y.; Liu, X.; He, B. A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection. IEEE Trans. Knowl. Data Eng. 2021, 35, 3347–3366. [Google Scholar]
- Xu, J.; Glicksberg, B.S.; Su, C.; Walker, P.; Bian, J.; Wang, F. Federated Learning for Healthcare Informatics. J. Health Inform. Res. 2021, 5, 1–19. [Google Scholar]
- Byrd, D.; Polychroniadou, A. Differentially Private Secure Multi-Party Computation for Federated Learning in Financial Applications. In Proceedings of the First ACM International Conference on AI in Finance, New York, NY, USA, 15–16 October 2020; pp. 1–9. [Google Scholar]
- Li, Y.; Tao, X.; Zhang, X.; Liu, J.; Xu, J. Privacy-Preserved Federated Learning for Autonomous Driving. IEEE Trans. Intel. Transport. Sys. 2021, 23, 8423–8434. [Google Scholar] [CrossRef]
- Lin, G.; Liang, F.; Pan, W.; Ming, Z. Fedrec: Federated Recommendation with Explicit Feedback. IEEE Intel. Sys. 2020, 36, 21–30. [Google Scholar] [CrossRef]
- Smith, V.; Chiang, C.K.; Sanjabi, M.; Talwalkar, A.S. Federated Multi-Task Learning. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 4424–4434. [Google Scholar]
- Gascón, A.; Schoppmann, P.; Balle, B.; Raykova, M.; Doerner, J.; Zahur, S.; Evans, D. Secure Linear Regression on Vertically Partitioned Datasets. IACR Cryptol. EPrint Arch. 2016, 2016, 892. [Google Scholar]
- Yang, Q.; Liu, Y.; Chen, T.; Tong, Y. Federated Machine Learning: Concept and Applications. ACM Trans. Intell. Syst. Technol. 2019, 10, 12. [Google Scholar]
- Marfoq, O.; Neglia, G.; Bellet, A.; Kameni, L.; Vidal, R. Federated Multi-Task Learning under a Mixture of Distributions. Adv. Inf. Process. Syst. 2021, 34, 15434–15447. [Google Scholar]
- Yoon, J.; Park, G.; Jeong, W.; Hwang, S.J. Bitwidth Heterogeneous Federated Learning with Progressive Weight Dequantization. In Proceedings of the 39th International Conference on Machine Learning, PMLR 162, Baltimore, MD, USA, 7–23 July 2022; pp. 25552–25565. [Google Scholar]
- Xu, Y.Y.; Lin, C.S.; Wang, Y.C.F. Bias-Eliminating Augmentation Learning for Debiased Federated Learning. In Proceedings of the P2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023; pp. 20442–20452. [Google Scholar]
- McMahan, H.B.; Moore, E.; Ramage, D.; Hampson, S.; y Arcas, B.A. Communication-Efficient Learning of Deep Networks from Decentralized Data. arXiv 2016, arXiv:1602.05629. [Google Scholar]
- Konečnỳ, J.; McMahan, H.B.; Ramage, D.; Richtárik, P. Federated Optimization: Distributed Machine Learning for On-Device Intelligence. arXiv 2016, arXiv:1610.02527. [Google Scholar]
- Hardy, S.; Henecka, W.; Ivey-Law, H.; Nock, R.; Patrini, G.; Smith, G.; Thorne, B. Private Federated Learning on Vertically Partitioned Data via Entity Resolution and Additively Homomorphic Encryption. arXiv 2017, arXiv:1711.10677. [Google Scholar]
- Nock, R.; Hardy, S.; Henecka, W.; Ivey-Law, H.; Patrini, G.; Smith, G.; Thorne, B. Entity Resolution and Federated Learning Get a Federated Resolution. arXiv 2018, arXiv:1803.04035. [Google Scholar]
- Yang, K.; Fan, T.; Chen, T.; Shi, Y.; Yang, Q. A Quasi-Newton Method Based Vertical Federated Learning Framework for Logistic Regression. arXiv 2019, arXiv:1912.00513. [Google Scholar]
- Yang, S.; Ren, B.; Zhou, X.; Liu, L. Parallel Distributed Logistic Regression for Vertical Federated Learning without Third-Party Coordinator. arXiv 2019, arXiv:1911.09824. [Google Scholar]
- Liu, Y.; Zhang, X.; Kang, Y.; Li, L.; Chen, T.; Hong, M.; Yang, Q. FedBCD: A Communication-Efficient Collaborative Learning Framework for Distributed Features. IEEE Trans. Signal Process. 2022, 70, 4277–4290. [Google Scholar] [CrossRef]
- Chen, T.; Jin, X.; Sun, Y.; Yin, W. VAFL: A Method of Vertical Asynchronous Federated Learning. arXiv 2020, arXiv:2007.06081. [Google Scholar]
- Zou, T.; Liu, Y.; Kang, Y.; Liu, W.; He, Y.; Yi, Z.; Yang, Q.; Zhang, Y.Q. Defending Batch-Level Label Inference and Replacement Attacks in Vertical Federated Learning. IEEE Trans. Big Data, 2022; early access. [Google Scholar] [CrossRef]
- Feng, S. Vertical Federated Learning-Based Feature Selection with Non-Overlapping Sample Utilization. Expert Syst. Appl. 2022, 208, 118097. [Google Scholar]
- Li, S.; Yao, D.; Liu, J. FedVS: Straggler-Resilient and Privacy-Preserving Vertical Federated Learning for Split Models. In Proceedings of the 40th International Conference on Machine Learning, Honolulu, HI, USA, 23–29 July 2023. [Google Scholar]
- Zhu, H.; Xu, J.; Liu, S.; Jin, Y. Federated Learning on Non-IID Data: A Survey. Neurocomputing 2021, 465, 371–390. [Google Scholar] [CrossRef]
- Kairouz, P.; McMahan, H.B.; Avent, B.; Bellet, A.; Bennis, M.; Bhagoji, A.N.; Bonawitz, K.; Charles, Z.; Cormode, G.; Cummings, R.; et al. Advances and Open Problems in Federated Learning. arXiv 2019, arXiv:1912.04977. [Google Scholar]
- Wang, G.; Dang, C.X.; Zhou, Z. Measure Contribution of Participants in Federated Learning. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; pp. 2597–2604. [Google Scholar]
- Ghorbani, A.; Zou, J. Data Shapley: Equitable Valuation of Data for Machine Learning. In Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, Naha, Japan, 6–18 April 2019; pp. 2242–2251. [Google Scholar]
- Luo, X.; Wu, Y.; Xiao, X.; Ooi, B.C. Feature Inference Attack on Model Predictions in Vertical Federated Learning. In Proceedings of the 2021 IEEE 37th International Conference on Data Engineering (ICDE), Chania, Greece, 19–22 April 2021; pp. 181–192. [Google Scholar]
- Zhang, Q.; Gu, B.; Deng, C.; Huang, H. Secure Bilevel Asynchronous Vertical Federated Learning with Backward Updating. Proc. AAAI Conf. Artif. Intell. 2021, 35, 10896–10904. [Google Scholar] [CrossRef]
- Jin, X.; Chen, P.Y.; Hsu, C.Y.; Yu, C.M.; Chen, T. Cafe: Catastrophic Data Leakage in Vertical Federated Learning. Adv. Neural Inf. Process. Syst. 2021, 34, 994–1006. [Google Scholar]
- Jiang, X.; Zhou, X.; Grossklags, J. Comprehensive Analysis of Privacy Leakage in Vertical Federated Learning During Prediction. Proc. Priv. Enhancing Technol. 2022, 2022, 263–281. [Google Scholar] [CrossRef]
- Fu, C.; Zhang, X.; Ji, S.; Chen, J.; Wu, J.; Guo, S.; Zhou, J.; Liu, A.X.; Wang, T. Label Inference Attacks against Vertical Federated Learning. In Proceedings of the 31st USENIX Security Symposium, Boston, MA, USA, 10–12 August 2022; pp. 1397–1414. [Google Scholar]
- Gu, B.; Xu, A.; Huo, Z.; Deng, C.; Huang, H. Privacy-Preserving Asynchronous Vertical Federated Learning Algorithms for Multiparty Collaborative Learning. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 6103–6115. [Google Scholar]
- Kang, Y.; Liu, Y.; Liang, X. FedCVT: Semi-Supervised Vertical Federated Learning with Cross-View Training. ACM Trans. Intell. Syst. Technol. 2022, 13, 1–16. [Google Scholar] [CrossRef]
- Xu, C.; Tao, D.; Xu, C. A Survey on Multi-View Learning. arXiv 2013, arXiv:1304.5634. [Google Scholar]
- Liu, D.C.; Nocedal, J. On the Limited Memory BFGS Method for Large Scale Optimization. Math. Program. 1989, 45, 501–528. [Google Scholar] [CrossRef]
- Zhao, Z.; Wang, L.; Liu, H. Efficient Spectral Feature Selection with Minimum Redundancy. Proc. AAAI Conf. Artif. Intell. 2010, 24, 673–678. [Google Scholar] [CrossRef]
- Gu, Q.; Li, Z.; Han, J. Joint Feature Selection and Subspace Learning. In Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, Barcelona, Spain, 16–22 July 2011; pp. 1294–1299. [Google Scholar]
- Hou, C.; Nie, F.; Li, X.; Yi, D.; Wu, Y. Joint Embedding Learning and Sparse Regression: A Framework for Unsupervised Feature Selection. IEEE Trans. Cybern. 2014, 44, 793–804. [Google Scholar] [PubMed]
- Li, Z.; Liu, J.; Tang, J.; Lu, H. Robust structured subspace learning for data representation. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 37, 2085–2098. [Google Scholar] [PubMed]
- Feng, S.; Duarte, M.F. Graph Autoencoder-Based Unsupervised Feature Selection with Broad and Local Data Structure Preservation. Neurocomputing 2018, 312, 310–323. [Google Scholar] [CrossRef]
- Feng, Y.; Xiao, J.; Zhuang, Y.; Liu, X. Adaptive Unsupervised Multi-view Feature Selection for Visual Concept Recognition. In Proceedings of the 11th Asian Conference on Computer Vision, Daejeon, Republic of Korea, 5–9 November 2012; pp. 343–357. [Google Scholar]
- Hou, C.; Nie, F.; Tao, H.; Yi, D. Multi-View Unsupervised Feature Selection with Adaptive Similarity and View Weight. IEEE Trans. Knowl. Data Eng. 2017, 29, 1998–2011. [Google Scholar] [CrossRef]
- Tang, C.; Zhu, X.; Liu, X.; Wang, L. Cross-View Local Structure Preserved Diversity and Consensus Learning for Multi-View Unsupervised Feature Selection. Proc. AAAI Conf. Artif. Intell. 2019, 33, 5101–5108. [Google Scholar] [CrossRef]
- Ye, X.; Li, H.; Imakura, A.; Sakurai, T. Distributed Collaborative Feature Selection Based on Intermediate Representation. In Proceedings of the 28th International Joint Conference on Artificial Intelligence, Macao, China, 10–16 August 2019; pp. 4142–4149. [Google Scholar]
- Du, L.; Shen, Y.D. Unsupervised Feature Selection with Adaptive Structure Learning. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, NSW, Australia, 10–13 August 2015; pp. 209–218. [Google Scholar]
- Yang, Y.; Shen, H.T.; Ma, Z.; Huang, Z.; Zhou, X. ℓ2,1-norm Regularized Discriminative Feature Selection for Unsupervised Learning. In Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, Barcelona, Spain, 16–22 July 2011; pp. 1589–1594. [Google Scholar]
- Tang, J.; Hu, X.; Gao, H.; Liu, H. Unsupervised Feature Selection for Multi-View Data in Social Media. In Proceedings of the 2013 SIAM International Conference on Data Mining, Austin, TX, USA, 2–4 May 2013; pp. 270–278. [Google Scholar]
- Liu, Y.; Kang, Y.; Zhang, X.; Li, L.; Cheng, Y.; Chen, T.; Hong, M.; Yang, Q. A Communication Efficient Vertical Federated Learning Framework. arXiv 2019, arXiv:1912.11187. [Google Scholar]
- Li, Y.; Nie, F.; Huang, H.; Huang, J. Large-scale multi-view spectral clustering via bipartite graph. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, TX, USA, 25–30 January 2015. [Google Scholar]
Notation | Definition |
---|---|
Matrix | |
-norm | |
Dataset | |
K | Number of participants |
Number of classes | |
N | Number of samples |
Data dimensionality at client k |
Handwritten | Caltech7 | Isolet | Relathe | |
---|---|---|---|---|
Data dimensionalities of all views | 240, 76, 216, 47, 64 | 48, 40, 254, 1984, 912, 528 | 200, 200, 217 | 1400, 1400, 1522 |
Training samples/class | 120 | 20 | 40 | 400 |
Validation samples/class | 40 | 5 | 20 | 200 |
Number of classes | 10 | 7 | 26 | 2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Feng, S.; Yu, H.; Zhu, Y. MMVFL: A Simple Vertical Federated Learning Framework for Multi-Class Multi-Participant Scenarios. Sensors 2024, 24, 619. https://doi.org/10.3390/s24020619
Feng S, Yu H, Zhu Y. MMVFL: A Simple Vertical Federated Learning Framework for Multi-Class Multi-Participant Scenarios. Sensors. 2024; 24(2):619. https://doi.org/10.3390/s24020619
Chicago/Turabian StyleFeng, Siwei, Han Yu, and Yuebing Zhu. 2024. "MMVFL: A Simple Vertical Federated Learning Framework for Multi-Class Multi-Participant Scenarios" Sensors 24, no. 2: 619. https://doi.org/10.3390/s24020619
APA StyleFeng, S., Yu, H., & Zhu, Y. (2024). MMVFL: A Simple Vertical Federated Learning Framework for Multi-Class Multi-Participant Scenarios. Sensors, 24(2), 619. https://doi.org/10.3390/s24020619