FedFog: Network-Aware Optimization of Federated Learning over Wireless Fog-Cloud System

[en] Federated learning (FL) is capable of performing large distributed machine learning tasks across multiple edge users by periodically aggregating trained local parameters. To address key challenges of enabling FL over a wireless fogcloud system (e.g., non-i.i.d. data, users’ heterogeneity), we first propose an efficient FL algorithm based on Federated Averaging (called FedFog) to perform the local aggregation of gradient parameters at fog servers and global training update at the cloud. Next, we employ FedFog in wireless fog-cloud systems by investigating a novel network-aware FL optimization problem that strikes the balance between the global loss and completion time. An iterative algorithm is then developed to obtain a precise measurement of the system performance, which helps design an efficient stopping criteria to output an appropriate number of global rounds. To mitigate the straggler effect, we propose a flexible user aggregation strategy that trains fast users first to obtain a certain level of accuracy before allowing slow users to join the global training updates. Extensive numerical results using several real-world FL tasks are provided to verify the theoretical convergence of FedFog. We also show that the proposed co-design of FL and communication is essential to substantially improve resource utilization while achieving comparable accuracy of the learning model.

Research center :

Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SIGCOM

Disciplines :

Electrical & electronics engineering

Author, co-author :

NGUYEN, van Dinh ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SigCom

CHATZINOTAS, Symeon ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SigCom

OTTERSTEN, Björn ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

Duong, Trung Q.

External co-authors :

yes

Language :

English

Title :

FedFog: Network-Aware Optimization of Federated Learning over Wireless Fog-Cloud System

Publication date :

2022

Journal title :

IEEE Transactions on Wireless Communications

ISSN :

1536-1276

eISSN :

1558-2248

Publisher :

Institute of Electrical and Electronics Engineers, New York, United States - New York

Special issue title :

Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)

Volume :

Issue :

Peer reviewed :

Peer Reviewed verified by ORBi

European Projects :

H2020 - 742648 - AGNOSTIC - Actively Enhanced Cognition based Framework for Design of Complex Systems

Funders :

H2020
CE - Commission Européenne
European Union

Data Set :

https://arxiv.org/abs/2107.02755

Available on ORBilu :

since 11 April 2022

Statistics

Number of views

211 (27 by Unilu)

Number of downloads

101 (11 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenCitations

OpenAlex citations

WoS citations^™

publications

supporting

mentioning

contrasting

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Bibliography

M. Chiang and T. Zhang, "Fog and IoT: An overview of research opportunities," IEEE Internet Things J., vol. 3, no. 6, pp. 854-864, Dec. 2016.
Cisco. (2019). Demystifying 5G in Industrial IoT. [Online]. Available: www.cisco.com/c/dam/en-us/solutions/iot/demystifying-5g-industrialiot.%pdf
J. Park, S. Samarakoon, M. Bennis, and M. Debbah, "Wireless network intelligence at the edge," Proc. IEEE, vol. 107, no. 11, pp. 2204-2239, Nov. 2019.
J. Park et al., "Communication-efficient and distributed learning over wireless networks: Principles and applications," Proc. IEEE, vol. 109, no. 5, pp. 796-819, May 2021.
B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. Y. Arcas, "Communication-efficient learning of deep networks from decentralized data," in Proc. Int. Conf. Artif. Intell. Stat., Fort Lauderdale, FL, USA, Apr. 2017, pp. 1273-1282.
J. Konecný, H. B. McMahan, F. X. Yu, P. Richtárik, A. T. Suresh, and D. Bacon, "Federated learning: Strategies for improving communication efficiency," 2016, arXiv:1610.05492.
J. Wang, S. Wang, R.-R. Chen, and M. Ji, "Demystifying why local aggregation helps: Convergence analysis of hierarchical SGD," 2020, arXiv:2010.12998.
P. Kairouz et al., "Advances and open problems in federated learning," 2019, arXiv:1912.04977.
S. U. Stich, "Local SGD converges fast and communicates little," in Proc. Int. Conf. Learn. Represent., 2019.
Y. Zhang, J. C. Duchi, and M. J. Wainwright, "Communication-efficient algorithms for statistical optimization," J. Mach. Learn. Res., vol. 14, pp. 3321-3363, Nov. 2013.
S. Wang et al., "Adaptive federated learning in resource constrained edge computing systems," IEEE J. Sel. Areas Commun., vol. 37, no. 3, pp. 1205-1221, Jun. 2019.
J. Kang, Z. Xiong, D. Niyato, S. Xie, and J. Zhang, "Incentive mech-anism for reliable federated learning: A joint optimization approach to combining reputation and contract theory," IEEE Internet Things J., vol. 6, no. 6, pp. 10700-10714, Dec. 2019.
J. C. Duchi, M. I. Jordan, and M. J. Wainwright, "Privacy aware learning," J. ACM, vol. 61, pp. 1-57, Dec. 2014, doi: 10.1145/2666468.
H. B. McMahan, D. Ramage, K. Talwar, and L. Zhang, "Learning dif-ferentially private recurrent language models," 2017, arXiv:1710.06963.
T. Li, M. Sanjabi, A. Beirami, and V. Smith, "Fair resource allocation in federated learning," in Proc. Int. Conf. Learn. Represent. (ICLR), 2020.
C. Xie, S. Koyejo, and I. Gupta. (2019). SLSGD: Secure and Effi-cient Distributed on-Device Machine Learning. [Online]. Available: https://128.84.21.199/abs/1903.06996v1
H. H. Yang, Z. Liu, T. Q. S. Quek, and H. V. Poor, "Scheduling policies for federated learning in wireless networks," IEEE Trans. Commun., vol. 68, no. 1, pp. 317-333, Jan. 2020.
S. Zheng, C. Shen, and X. Chen, "Design and analysis of uplink and downlink communications for federated learning," IEEE J. Sel. Areas Commun., vol. 39, no. 7, pp. 2150-2167, Jul. 2021.
M. Chen, Z. Yang, W. Saad, C. Yin, H. V. Poor, and S. Cui, "A joint learning and communications framework for federated learning over wireless networks," IEEE Trans. Wireless Commun., vol. 20, no. 1, pp. 269-283, Jan. 2021.
C. T. Dinh et al., "Federated learning over wireless networks: Con-vergence analysis and resource allocation," IEEE/ACM Trans. Netw., vol. 29, no. 1, pp. 398-409, Feb. 2021.
K. Yang, T. Jiang, Y. Shi, and Z. Ding, "Federated learning via over-the-air computation," IEEE Trans. Wireless Commun., vol. 19, no. 3, pp. 2022-2035, Mar. 2020.
T. T. Vu, D. T. Ngo, N. H. Tran, H. Q. Ngo, M. N. Dao, and R. H. Middleton, "Cell-free massive MIMO for wireless federated learn-ing," IEEE Trans. Wireless Commun., vol. 19, no. 10, pp. 6377-6392, Oct. 2020.
V.-D. Nguyen, S. K. Sharma, T. X. Vu, S. Chatzinotas, and B. Ottersten, "Efficient federated learning algorithm for resource allocation in wireless IoT networks," IEEE Internet Things J., vol. 8, no. 5, pp. 3394-3409, Mar. 2021.
Z. Yang, M. Chen, W. Saad, C. S. Hong, and M. Shikh-Bahaei, "Energy efficient federated learning over wireless communication net-works," IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1935-1949, Mar. 2021.
A. Mahmoudi, H. S. Ghadikolaei, and C. Fischione, "Cost-efficient distributed optimization in machine learning over wireless net-works," in Proc. IEEE Int. Conf. Commun. (ICC), Jun. 2020, pp. 1-7.
L. Liu, J. Zhang, S. Song, and K. B. Letaief, "Client-edge-cloud hierarchical federated learning," in Proc. IEEE Int. Conf. Commun. (ICC), Jun. 2020, pp. 1-6.
S. Hosseinalipour, C. G. Brinton, V. Aggarwal, H. Dai, and M. Chiang, "From federated to fog learning: Distributed machine learning over heterogeneous wireless networks," IEEE Commun. Mag., vol. 58, no. 12, pp. 41-47, Dec. 2020.
S. Hosseinalipour et al., "Multi-stage hybrid federated learning over large-scale D2D-enabled fog networks," 2020, arXiv:2007.09511.
R. Saha, S. Misra, and P. K. Deb, "FogFL: Fog-assisted federated learning for resource-constrained IoT devices," IEEE Internet Things J., vol. 8, no. 10, pp. 8456-8463, May 2021.
Y. Tu, Y. Ruan, S. Wagle, C. G. Brinton, and C. Joe-Wong, "Network-aware optimization of distributed learning for fog comput-ing," in Proc. IEEE INFOCOM Conf. Comput. Commun., Jul. 2020, pp. 2509-2518.
J. Feng, L. Liu, Q. Pei, and K. Li, "Min-max cost optimization for efficient hierarchical federated learning in wireless edge networks," IEEE Trans. Parallel Distrib. Syst., early access, Nov. 30, 2022, doi: 10.1109/TPDS.2021.3131654.
S. Luo, X. Chen, Q. Wu, Z. Zhou, and S. Yu, "HFEL: Joint edge association and resource allocation for cost-efficient hierarchical federated edge learning," IEEE Trans. Wireless Commun., vol. 19, no. 10, pp. 6535-6548, Oct. 2020.
W. Wen, Z. Chen, H. H. Yang, W. Xia, and T. Q. S. Quek, "Joint scheduling and resource allocation for hierarchical federated edge learning," IEEE Trans. Wireless Commun., early access, Jan. 26, 2022, doi: 10.1109/TWC.2022.3144140.
C. Zhou, A. Fu, S. Yu, W. Yang, H. Wang, and Y. Zhang, "Privacypreserving federated learning in fog computing," IEEE Internet Things J., vol. 7, no. 11, pp. 10782-10793, Nov. 2020.
X. Li, K. Huang, W. Yang, S. Wang, and Z. Zhang, "On the convergence of FedAvg on non-IID data," in Proc. Int. Conf. Learn. Represent. (ICLR), 2020, pp. 1-26.
M. Chen, H. V. Poor, W. Saad, and S. Cui, "Convergence time optimization for federated learning over wireless networks," IEEE Trans. Wireless Commun., vol. 20, no. 4, pp. 2457-2471, Apr. 2021.
T. T. Vu, D. T. Ngo, H. Q. Ngo, M. N. Dao, N. H. Tran, and R. H. Middleton, "User selection approaches to mitigate the straggler effect for federated learning on cell-free massive MIMO networks," 2020, arXiv:2009.02031.
W. Xia, T. Q. S. Quek, K. Guo, W. Wen, H. H. Yang, and H. Zhu, "Multi-armed bandit-based client scheduling for federated learning," IEEE Trans. Wireless Commun., vol. 19, no. 11, pp. 7108-7123, Nov. 2020.
B. R. Marks and G. P. Wright, "A general inner approximation algorithm for nonconvex mathematical programs," Oper. Res., vol. 26, no. 4, pp. 681-683, 1978.
G. Lee, W. Saad, and M. Bennis, "An online optimization framework for distributed fog network formation with minimal latency," IEEE Trans. Wireless Commun., vol. 18, no. 4, pp. 2244-2258, Apr. 2019.
S. Shalev-Shwartz and S. Ben-David, Understanding Machine Learning: From Theory to Algorithms. Cambridge, U.K.: Cambridge Univ. Press, 2014.
Y. Ruan, X. Zhang, S.-C. Liang, and C. Joe-Wong, "Towards flexible device participation in federated learning," 2020, arXiv:2006.06954.
U. Dotsch, M. Doll, H.-P. Mayer, F. Schaich, J. Segel, and P. Sehier, "Optimal and successive approaches to signal design for multiple antenna physical layer multicasting," Bell Labs Tech. J., vol. 18, pp. 105-128, 2014.
M.-H. Chen, B. Liang, and M. Dong, "Multi-user multi-task offloading and resource allocation in mobile cloud systems," IEEE Wireless Commun., vol. 17, no. 10, pp. 6790-6805, Oct. 2018.
J. Du, L. Zhao, J. Feng, and X. Chu, "Computation offloading and resource allocation in mixed fog/cloud computing systems with min-max fairness guarantee," IEEE Trans. Commun., vol. 66, no. 4, pp. 1594-1608, Apr. 2018.
H. Kim, D. J. Love, and S. Y. Park, "Optimal and successive approaches to signal design for multiple antenna physical layer multicasting," IEEE Trans. Commun., vol. 59, no. 8, pp. 2316-2327, Aug. 2011.
A. P. Miettinen and J. K. Nurminen, "Energy efficiency of mobile clients in cloud computing," in Proc. USENIX HotCloud, Berkeley, CA, USA, 2010, pp. 1-7.
T. D. Burd and R. W. Brodersen, "Processor design for portable systems," J. VLSI Signal Process. Syst., vol. 13, nos. 2-3, pp. 203-221, 1996.
M. E. T. Gerards, J. L. Hurink, and J. Kuper, "On the interplay between global DVFS and scheduling tasks with precedence constraints," IEEE Trans. Comput., vol. 64, no. 6, pp. 1742-1754, Jun. 2015.
R. T. Marler and J. S. Arora, "Survey of multi-objective optimization methods for engineering," Struct. Multidisciplinary Optim., vol. 26, no. 6, pp. 369-395, Apr. 2004.
V.-D. Nguyen, H. V. Nguyen, O. A. Dobre, and O.-S. Shin, "A new design paradigm for secure full-duplex multiuser systems," IEEE J. Sel. Areas Commun., vol. 36, no. 7, pp. 1480-1498, Jul. 2018.
A. Beck, A. Ben-Tal, and L. Tetruashvili, "A sequential parametric convex approximation method with applications to nonconvex truss topology design problems," J. Global Optim., vol. 47, no. 1, pp. 29-51, Jul. 2010.
A. Ben-Tal and A. Nemirovski, Lectures on Modern Convex Optimiza-tion. Philadelphia: MPS-SIAM Series on Optimi., SIAM, 2001.
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proc. IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998.
A. Krizhevsky and G. Hinton. (2009). Learning Multiple Layers of Features From Tiny Images. [Online]. Available: https://www.cs.toronto. edu/kriz/cifar.html
Y. Mao, J. Zhang, and K. B. Letaief, "Dynamic computation offloading for mobile-edge computing with energy harvesting devices," IEEE J. Sel. Areas Commun., vol. 34, no. 12, pp. 3590-3605, Sep. 2016.