[en] Federated learning (FL) is capable of performing large distributed machine learning tasks across multiple edge users by periodically aggregating trained local parameters. To address key challenges of enabling FL over a wireless fogcloud system (e.g., non-i.i.d. data, users’ heterogeneity), we first propose an efficient FL algorithm based on Federated Averaging (called FedFog) to perform the local aggregation of gradient parameters at fog servers and global training update at the cloud. Next, we employ FedFog in wireless fog-cloud systems by investigating a novel network-aware FL optimization problem that strikes the balance between the global loss and completion time. An iterative algorithm is then developed to obtain a precise measurement of the system performance, which helps design an efficient stopping criteria to output an appropriate number of global rounds. To mitigate the straggler effect, we propose a flexible user aggregation strategy that trains fast users first to obtain a certain level of accuracy before allowing slow users to join the global training updates. Extensive numerical results using several real-world FL tasks are provided to verify the theoretical convergence of FedFog. We also show that the proposed co-design of FL and communication is essential to substantially improve resource utilization while achieving comparable accuracy of the learning model.
Centre de recherche :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SIGCOM
Disciplines :
Ingénierie électrique & électronique
Auteur, co-auteur :
NGUYEN, van Dinh ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SigCom
CHATZINOTAS, Symeon ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SigCom
OTTERSTEN, Björn ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Duong, Trung Q.
Co-auteurs externes :
yes
Langue du document :
Anglais
Titre :
FedFog: Network-Aware Optimization of Federated Learning over Wireless Fog-Cloud System
Date de publication/diffusion :
2022
Titre du périodique :
IEEE Transactions on Wireless Communications
ISSN :
1536-1276
eISSN :
1558-2248
Maison d'édition :
Institute of Electrical and Electronics Engineers, New York, Etats-Unis - New York
Titre particulier du numéro :
Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
Volume/Tome :
21
Fascicule/Saison :
10
Peer reviewed :
Peer reviewed vérifié par ORBi
Projet européen :
H2020 - 742648 - AGNOSTIC - Actively Enhanced Cognition based Framework for Design of Complex Systems
M. Chiang and T. Zhang, "Fog and IoT: An overview of research opportunities," IEEE Internet Things J., vol. 3, no. 6, pp. 854-864, Dec. 2016.
Cisco. (2019). Demystifying 5G in Industrial IoT. [Online]. Available: www.cisco.com/c/dam/en-us/solutions/iot/demystifying-5g-industrialiot.%pdf
J. Park, S. Samarakoon, M. Bennis, and M. Debbah, "Wireless network intelligence at the edge," Proc. IEEE, vol. 107, no. 11, pp. 2204-2239, Nov. 2019.
J. Park et al., "Communication-efficient and distributed learning over wireless networks: Principles and applications," Proc. IEEE, vol. 109, no. 5, pp. 796-819, May 2021.
B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. Y. Arcas, "Communication-efficient learning of deep networks from decentralized data," in Proc. Int. Conf. Artif. Intell. Stat., Fort Lauderdale, FL, USA, Apr. 2017, pp. 1273-1282.
J. Konecný, H. B. McMahan, F. X. Yu, P. Richtárik, A. T. Suresh, and D. Bacon, "Federated learning: Strategies for improving communication efficiency," 2016, arXiv:1610.05492.
J. Wang, S. Wang, R.-R. Chen, and M. Ji, "Demystifying why local aggregation helps: Convergence analysis of hierarchical SGD," 2020, arXiv:2010.12998.
P. Kairouz et al., "Advances and open problems in federated learning," 2019, arXiv:1912.04977.
S. U. Stich, "Local SGD converges fast and communicates little," in Proc. Int. Conf. Learn. Represent., 2019.
Y. Zhang, J. C. Duchi, and M. J. Wainwright, "Communication-efficient algorithms for statistical optimization," J. Mach. Learn. Res., vol. 14, pp. 3321-3363, Nov. 2013.
S. Wang et al., "Adaptive federated learning in resource constrained edge computing systems," IEEE J. Sel. Areas Commun., vol. 37, no. 3, pp. 1205-1221, Jun. 2019.
J. Kang, Z. Xiong, D. Niyato, S. Xie, and J. Zhang, "Incentive mech-anism for reliable federated learning: A joint optimization approach to combining reputation and contract theory," IEEE Internet Things J., vol. 6, no. 6, pp. 10700-10714, Dec. 2019.
J. C. Duchi, M. I. Jordan, and M. J. Wainwright, "Privacy aware learning," J. ACM, vol. 61, pp. 1-57, Dec. 2014, doi: 10.1145/2666468.
H. B. McMahan, D. Ramage, K. Talwar, and L. Zhang, "Learning dif-ferentially private recurrent language models," 2017, arXiv:1710.06963.
T. Li, M. Sanjabi, A. Beirami, and V. Smith, "Fair resource allocation in federated learning," in Proc. Int. Conf. Learn. Represent. (ICLR), 2020.
C. Xie, S. Koyejo, and I. Gupta. (2019). SLSGD: Secure and Effi-cient Distributed on-Device Machine Learning. [Online]. Available: https://128.84.21.199/abs/1903.06996v1
H. H. Yang, Z. Liu, T. Q. S. Quek, and H. V. Poor, "Scheduling policies for federated learning in wireless networks," IEEE Trans. Commun., vol. 68, no. 1, pp. 317-333, Jan. 2020.
S. Zheng, C. Shen, and X. Chen, "Design and analysis of uplink and downlink communications for federated learning," IEEE J. Sel. Areas Commun., vol. 39, no. 7, pp. 2150-2167, Jul. 2021.
M. Chen, Z. Yang, W. Saad, C. Yin, H. V. Poor, and S. Cui, "A joint learning and communications framework for federated learning over wireless networks," IEEE Trans. Wireless Commun., vol. 20, no. 1, pp. 269-283, Jan. 2021.
C. T. Dinh et al., "Federated learning over wireless networks: Con-vergence analysis and resource allocation," IEEE/ACM Trans. Netw., vol. 29, no. 1, pp. 398-409, Feb. 2021.
K. Yang, T. Jiang, Y. Shi, and Z. Ding, "Federated learning via over-the-air computation," IEEE Trans. Wireless Commun., vol. 19, no. 3, pp. 2022-2035, Mar. 2020.
T. T. Vu, D. T. Ngo, N. H. Tran, H. Q. Ngo, M. N. Dao, and R. H. Middleton, "Cell-free massive MIMO for wireless federated learn-ing," IEEE Trans. Wireless Commun., vol. 19, no. 10, pp. 6377-6392, Oct. 2020.
V.-D. Nguyen, S. K. Sharma, T. X. Vu, S. Chatzinotas, and B. Ottersten, "Efficient federated learning algorithm for resource allocation in wireless IoT networks," IEEE Internet Things J., vol. 8, no. 5, pp. 3394-3409, Mar. 2021.
Z. Yang, M. Chen, W. Saad, C. S. Hong, and M. Shikh-Bahaei, "Energy efficient federated learning over wireless communication net-works," IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1935-1949, Mar. 2021.
A. Mahmoudi, H. S. Ghadikolaei, and C. Fischione, "Cost-efficient distributed optimization in machine learning over wireless net-works," in Proc. IEEE Int. Conf. Commun. (ICC), Jun. 2020, pp. 1-7.
L. Liu, J. Zhang, S. Song, and K. B. Letaief, "Client-edge-cloud hierarchical federated learning," in Proc. IEEE Int. Conf. Commun. (ICC), Jun. 2020, pp. 1-6.
S. Hosseinalipour, C. G. Brinton, V. Aggarwal, H. Dai, and M. Chiang, "From federated to fog learning: Distributed machine learning over heterogeneous wireless networks," IEEE Commun. Mag., vol. 58, no. 12, pp. 41-47, Dec. 2020.
S. Hosseinalipour et al., "Multi-stage hybrid federated learning over large-scale D2D-enabled fog networks," 2020, arXiv:2007.09511.
R. Saha, S. Misra, and P. K. Deb, "FogFL: Fog-assisted federated learning for resource-constrained IoT devices," IEEE Internet Things J., vol. 8, no. 10, pp. 8456-8463, May 2021.
Y. Tu, Y. Ruan, S. Wagle, C. G. Brinton, and C. Joe-Wong, "Network-aware optimization of distributed learning for fog comput-ing," in Proc. IEEE INFOCOM Conf. Comput. Commun., Jul. 2020, pp. 2509-2518.
J. Feng, L. Liu, Q. Pei, and K. Li, "Min-max cost optimization for efficient hierarchical federated learning in wireless edge networks," IEEE Trans. Parallel Distrib. Syst., early access, Nov. 30, 2022, doi: 10.1109/TPDS.2021.3131654.
S. Luo, X. Chen, Q. Wu, Z. Zhou, and S. Yu, "HFEL: Joint edge association and resource allocation for cost-efficient hierarchical federated edge learning," IEEE Trans. Wireless Commun., vol. 19, no. 10, pp. 6535-6548, Oct. 2020.
W. Wen, Z. Chen, H. H. Yang, W. Xia, and T. Q. S. Quek, "Joint scheduling and resource allocation for hierarchical federated edge learning," IEEE Trans. Wireless Commun., early access, Jan. 26, 2022, doi: 10.1109/TWC.2022.3144140.
C. Zhou, A. Fu, S. Yu, W. Yang, H. Wang, and Y. Zhang, "Privacypreserving federated learning in fog computing," IEEE Internet Things J., vol. 7, no. 11, pp. 10782-10793, Nov. 2020.
X. Li, K. Huang, W. Yang, S. Wang, and Z. Zhang, "On the convergence of FedAvg on non-IID data," in Proc. Int. Conf. Learn. Represent. (ICLR), 2020, pp. 1-26.
M. Chen, H. V. Poor, W. Saad, and S. Cui, "Convergence time optimization for federated learning over wireless networks," IEEE Trans. Wireless Commun., vol. 20, no. 4, pp. 2457-2471, Apr. 2021.
T. T. Vu, D. T. Ngo, H. Q. Ngo, M. N. Dao, N. H. Tran, and R. H. Middleton, "User selection approaches to mitigate the straggler effect for federated learning on cell-free massive MIMO networks," 2020, arXiv:2009.02031.
W. Xia, T. Q. S. Quek, K. Guo, W. Wen, H. H. Yang, and H. Zhu, "Multi-armed bandit-based client scheduling for federated learning," IEEE Trans. Wireless Commun., vol. 19, no. 11, pp. 7108-7123, Nov. 2020.
B. R. Marks and G. P. Wright, "A general inner approximation algorithm for nonconvex mathematical programs," Oper. Res., vol. 26, no. 4, pp. 681-683, 1978.
G. Lee, W. Saad, and M. Bennis, "An online optimization framework for distributed fog network formation with minimal latency," IEEE Trans. Wireless Commun., vol. 18, no. 4, pp. 2244-2258, Apr. 2019.
S. Shalev-Shwartz and S. Ben-David, Understanding Machine Learning: From Theory to Algorithms. Cambridge, U.K.: Cambridge Univ. Press, 2014.
Y. Ruan, X. Zhang, S.-C. Liang, and C. Joe-Wong, "Towards flexible device participation in federated learning," 2020, arXiv:2006.06954.
U. Dotsch, M. Doll, H.-P. Mayer, F. Schaich, J. Segel, and P. Sehier, "Optimal and successive approaches to signal design for multiple antenna physical layer multicasting," Bell Labs Tech. J., vol. 18, pp. 105-128, 2014.
M.-H. Chen, B. Liang, and M. Dong, "Multi-user multi-task offloading and resource allocation in mobile cloud systems," IEEE Wireless Commun., vol. 17, no. 10, pp. 6790-6805, Oct. 2018.
J. Du, L. Zhao, J. Feng, and X. Chu, "Computation offloading and resource allocation in mixed fog/cloud computing systems with min-max fairness guarantee," IEEE Trans. Commun., vol. 66, no. 4, pp. 1594-1608, Apr. 2018.
H. Kim, D. J. Love, and S. Y. Park, "Optimal and successive approaches to signal design for multiple antenna physical layer multicasting," IEEE Trans. Commun., vol. 59, no. 8, pp. 2316-2327, Aug. 2011.
A. P. Miettinen and J. K. Nurminen, "Energy efficiency of mobile clients in cloud computing," in Proc. USENIX HotCloud, Berkeley, CA, USA, 2010, pp. 1-7.
T. D. Burd and R. W. Brodersen, "Processor design for portable systems," J. VLSI Signal Process. Syst., vol. 13, nos. 2-3, pp. 203-221, 1996.
M. E. T. Gerards, J. L. Hurink, and J. Kuper, "On the interplay between global DVFS and scheduling tasks with precedence constraints," IEEE Trans. Comput., vol. 64, no. 6, pp. 1742-1754, Jun. 2015.
R. T. Marler and J. S. Arora, "Survey of multi-objective optimization methods for engineering," Struct. Multidisciplinary Optim., vol. 26, no. 6, pp. 369-395, Apr. 2004.
V.-D. Nguyen, H. V. Nguyen, O. A. Dobre, and O.-S. Shin, "A new design paradigm for secure full-duplex multiuser systems," IEEE J. Sel. Areas Commun., vol. 36, no. 7, pp. 1480-1498, Jul. 2018.
A. Beck, A. Ben-Tal, and L. Tetruashvili, "A sequential parametric convex approximation method with applications to nonconvex truss topology design problems," J. Global Optim., vol. 47, no. 1, pp. 29-51, Jul. 2010.
A. Ben-Tal and A. Nemirovski, Lectures on Modern Convex Optimiza-tion. Philadelphia: MPS-SIAM Series on Optimi., SIAM, 2001.
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proc. IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998.
A. Krizhevsky and G. Hinton. (2009). Learning Multiple Layers of Features From Tiny Images. [Online]. Available: https://www.cs.toronto. edu/kriz/cifar.html
Y. Mao, J. Zhang, and K. B. Letaief, "Dynamic computation offloading for mobile-edge computing with energy harvesting devices," IEEE J. Sel. Areas Commun., vol. 34, no. 12, pp. 3590-3605, Sep. 2016.