Evolutionary algorithms; Maintenance; Reinforcement Learning; Scheduling; Sustainable manufacturing; Evolutionary Multi-objectives; Hypervolume; Maintenance process; Maintenance scheduling; Manufacturing sector; Multi agent; Production process; Reinforcement learnings; Sustainable maintenance; Control and Systems Engineering; Electrical and Electronic Engineering; Artificial Intelligence
Abstract :
[en] In recent years, sustainability has emerged as a major priority for businesses across various industries, and the manufacturing sector is no exception. Production and maintenance processes now need to be economically profitable while also adopting practices that adhere to the principles of environmental integrity and social responsibility. This article explores an innovative approach aimed at optimizing maintenance scheduling from an economic perspective (considering maintenance, breakdown, downtime costs), an environmental perspective (considering the carbon footprint produced during production) and a social perspective (considering the fatigue experienced by technicians during maintenance activities). To the best of our knowledge, this is the first study to propose a manufacturing scheduling approach that considers all three pillars of sustainability. Another significant contribution of this research is the innovative way in which the optimization problem is addressed. We propose an evolutionary multi-objective multi-agent Deep Q-network-based approach, where multiple agents explore the preference space to maximize the hypervolume of these sustainable objectives. Our methodology uses industrially representative data that incorporate realistic machine degradation signals, carbon intensity indicators, and technician constraints. The results demonstrate the trade-offs between these objectives when compared to traditional maintenance policies such as corrective and condition-based maintenance, as well as different Deep Q-network policies trained with various preferences. Our approach demonstrates superior performance compared to both baselines. Specifically, we observe an 11.6% improvement in hypervolume over Deep Q-network and an 18.9% improvement over Proximal Policy Optimization, resulting in significantly increased profitability within the system.
Disciplines :
Computer science
Author, co-author :
RUIZ RODRIGUEZ, Marcelo Luis ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal
KUBLER, Sylvain ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal
Robert, Jérémy; Cebi International S.A., 30 rue J.F. Kennedy, Steinsel, Luxembourg
Voisin, Alexandre; Université de Lorraine, CNRS, CRAN, Nancy, France
LE TRAON, Yves ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
External co-authors :
yes
Language :
English
Title :
Evolutionary multi-objective multi-agent deep reinforcement learning for sustainable maintenance scheduling
Publication date :
15 September 2025
Journal title :
Engineering Applications of Artificial Intelligence
ISSN :
0952-1976
Publisher :
Elsevier Ltd
Volume :
156
Pages :
111126
Peer reviewed :
Peer Reviewed verified by ORBi
Development Goals :
8. Decent work and economic growth 9. Industry, innovation and infrastructure
This research was funded in whole or in part by the Luxembourg National Research Fund (FNR), grant reference 16756339. For the purpose of open access, and in fulfilment of the obligations arising from the grant agreement, the author has applied a Creative Commons Attribution 4.0 International (CC BY 4.0) license to any Author Accepted Manuscript version arising from this submission.
Abbas, A.N., Chasparis, G.C., Kelleher, J.D., Hierarchical framework for interpretable and specialized deep reinforcement learning-based predictive maintenance. Data Knowl. Eng., 149, 2024.
Aissani, N., Beldjilali, B., Trentesaux, D., Dynamic scheduling of maintenance tasks in the petroleum industry: A reinforcement approach. Eng. Appl. Artif. Intell. 22:7 (2009), 1089–1103.
Al-Hourani, S., Rescheduling preventive maintenance for utilities equipment using criticality analysis. 2020 Industrial and Systems Engineering Conference, ISEC 2020, 2020, Institute of Electrical and Electronics Engineers Inc.
An, Y., Chen, X., Zhang, J., Li, Y., A hybrid multi-objective evolutionary algorithm to integrate optimization of the production scheduling and imperfect cutting tool maintenance considering total energy consumption. J. Clean. Prod., 268, 2020.
Arena, S., Florian, E., Sgarbossa, F., Sølvsberg, E., Zennaro, I., A conceptual framework for machine learning algorithm selection for predictive maintenance. Eng. Appl. Artif. Intell., 133, 2024, 108340.
Arık, O.A., Population-based tabu search with evolutionary strategies for permutation flow shop scheduling problems under effects of position-dependent learning and linear deterioration. Soft Comput. 25:2 (2021), 1501–1518.
Asghar, I., Sarkar, B., jun Kim, S., Economic analysis of an integrated production–inventory system under stochastic production capacity and energy consumption. Energies, 12(16), 2019.
Attia, A.M., Alatwi, A.O., Al Hanbali, A., Alsawafy, O.G., Joint maintenance planning and production scheduling optimization model for green environment. J. Qual. Maint. Eng. 30:1 (2024), 153–174.
Baykasoğlu, A., Madenoğlu, F.S., Greedy randomized adaptive search procedure for simultaneous scheduling of production and preventive maintenance activities in dynamic flexible job shops. Soft Comput. 25:23 (2021), 14893–14932.
Ben Abdellafou, K., Hadda, H., Korbaa, O., Heuristic algorithms for scheduling intrees on m machines with non-availability constraints. Oper. Res. 21:1 (2021), 55–71.
Bencheikh, G., Letouzey, A., Desforges, X., An approach for joint scheduling of production and predictive maintenance activities. J. Manuf. Syst. 64 (2022), 546–560.
Bhatta, K., Chang, Q., An integrated control strategy for simultaneous robot assignment, tool change and preventive maintenance scheduling using heterogeneous graph neural network. Robot. Comput.-Integr. Manuf., 84, 2023.
Bhatta, K., Chang, Q., Integrating robot assignment and maintenance management: A multi-agent reinforcement learning approach for holistic control. IEEE Robot. Autom. Lett. 8:9 (2023), 5338–5344.
Bodnar, C., Day, B., Lió, P., Proximal distilled evolutionary reinforcement learning. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, 2020, 3283–3290.
Boufellouh, R., Belkaid, F., Multi-objective approach to optimize production and maintenance scheduling in flow shop environment under non-renewable resources constraints. 2019 International Conference on Advanced Electrical Engineering ICAEE 2019, 2019, Institute of Electrical and Electronics Engineers Inc.
Cacereño, A., Greiner, D., Galván, B.J., Multi-objective optimum design and maintenance of safety systems: An in-depth comparison study including encoding and scheduling aspects with nsga-ii. Mathematics, 9(15), 2021.
Chang, C.C., Optimal preventive replacement last policy for a successive random works system with random lead time. Comm. Statist. Theory Methods 52:4 (2023), 1202–1216.
Chen, R., Yang, B., Li, S., Wang, S., A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem. Comput. Ind. Eng., 149, 2020, 106778.
Cui, W., Lu, B., A bi-objective approach to minimize makespan and energy consumption in flow shops with peak demand constraint. Sustain. (Switzerland), 12(10), 2020.
Cui, W., Lu, B., Energy-aware operations management for flow shops under TOU electricity tariff. Comput. Ind. Eng., 151, 2021.
Cui, W., Sun, H., Xia, B., Integrating production scheduling, maintenance planning and energy controlling for the sustainable manufacturing systems under TOU tariff. J. Oper. Res. Soc. 71:11 (2020), 1760–1779.
Detti, P., Nicosia, G., Pacifici, A., Zabalo Manrique de Lara, G., Robust single machine scheduling with a flexible maintenance activity. Comput. Oper. Res. 107 (2019), 19–31.
Diaz Cazanas, R., Delgado Sobrino, D.R., Caganova, D., Kostal, P., Velisek, K., Joint programming of production-maintenance tasks: A simulated annealing-based method. Int. J. Simul. Model. 18:4 (2019), 666–677.
Djassemi, M., Seifoddini, H., Analysis of critical machine reliability in manufacturing cells. J. Ind. Eng. Manag. 12:1 (2019), 70–82.
Do, P., Nguyen, V.T., Voisin, A., Iung, B., Neto, W.A.F., Multi-agent deep reinforcement learning-based maintenance optimization for multi-dependent component systems. Expert Syst. Appl., 245, 2024.
Dong, J., Ye, C., Research on two-stage joint optimization problem of green manufacturing and maintenance for semiconductor wafer. Math. Probl. Eng., 2020, 2020.
Drugan, M.M., Reinforcement learning versus evolutionary computation: A survey on hybrid algorithms. Swarm Evol. Comput. 44 (2019), 228–246.
Electricity Maps, Data portal. 2024 (Accessed: 17 July 2024).
European Commission, The European green deal. 2019 https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX%3A52019DC0640Communication from the Commission to the European Parliament, the Council, the European Economic and Social Committee and the Committee of the Regions.
Farahani, A., Tohidi, H., Shoja, A., An integrated optimization of quality control chart parameters and preventive maintenance using Markov chain. Adv. Prod. Eng. Manag. 14:1 (2019), 5–14.
Feng, M., Li, Y., Predictive maintenance decision making based on reinforcement learning in multistage production systems. IEEE Access 10 (2022), 18910–18921.
Fu, T., Gao, W., Coley, C., Sun, J., Reinforced genetic algorithm for structure-based drug design. Adv. Neural Inf. Process. Syst. 35 (2022), 12325–12338.
Ghaleb, M., Taghipour, S., Sharifi, M., Zolfagharinia, H., Integrated production and maintenance scheduling for a single degrading machine with deterioration-based failures. Comput. Ind. Eng., 143(October 2019), 2020, 106432.
Giner, J., Lamprecht, R., Gallina, V., Laflamme, C., Sielaff, L., Sihn, W., Demonstrating reinforcement learning for maintenance scheduling in a production environment. IEEE International Conference on Emerging Technologies and Factory Automation, Vol. 2021-Septe, ETFA, 2021, Institute of Electrical and Electronics Engineers Inc.
Gupta, S., Jain, A., Assessing the effect of reliability-based maintenance approach in job shop scheduling with setup time and energy consideration using simulation; a simulation study. Smart Sci. 9:4 (2021), 283–304.
Gupta, S., Jain, A., Analysis of integrated preventive maintenance and machine failure in stochastic flexible job shop scheduling with sequence-dependent setup time. Smart Sci. 10:3 (2022), 175–197.
Gupta, S., Jain, A., Chan, F.T., Phanden, R.K., A study on simulation-based optimization of a stochastic flexible job shop scheduling undergoing preventive maintenance with sequence-dependent setup time. Int. J. Interact. Des. Manuf., 2023.
Gusenbauer, M., Haddaway, N.R., Which academic search systems are suitable for systematic reviews or meta-analyses? Evaluating retrieval qualities of google scholar, PubMed, and 26 other resources. Res. Synth. Methods 11:2 (2020), 181–217.
Hayes, C.F., Rădulescu, R., Bargiacchi, E., Källström, J., Macfarlane, M., Reymond, M., Verstraeten, T., Zintgraf, L.M., Dazeley, R., Heintz, F., et al. A practical guide to multi-objective reinforcement learning and planning. Auton. Agents Multi- Agent Syst., 36(1), 2022, 26.
Hedjazi, D., Layachi, F., Boubiche, D.E., A multi-agent system for distributed maintenance scheduling. Comput. Electr. Eng. 77 (2019), 1–11.
Hidri, L., Alqahtani, K., Gazdar, A., Badwelan, A., Integrated scheduling of tasks and preventive maintenance periods in a parallel machine environment with single robot server. IEEE Access 9 (2021), 74454–74470.
Huang, J., Chang, Q., Arinez, J., Deep reinforcement learning based preventive maintenance policy for serial production lines. Expert Syst. Appl., 160, 2020.
Jaber, M.Y., Givi, Z., Neumann, W.P., Incorporating human fatigue and recovery into the learning–forgetting process. Appl. Math. Model. 37:12–13 (2013), 7287–7299.
Jawahir, I., Dillon, O., Rouch, K., Joshi, K.J., Venkatachalam, A., Jaafar, I.H., Total life-cycle considerations in product design for sustainability: A framework for comprehensive evaluation. Proceedings of the 10th International Research/Expert Conference, Barcelona, Spain, vol. 1, 2006, Citeseer.
Jayasuriya, R.P., Amarasinghe, P.A., Abeygunawardane, S.K., Application of artificial intelligence for maintenance modelling of critical machines in solid tire manufacturing. 2021 International Conference on Innovative Trends in Information Technology, ICITIIT 2021, 2021, Institute of Electrical and Electronics Engineers Inc.
Kedy, G.S.M., Penda, M.C., Nneme Nneme, L., Mayi, O.T.S., Lehman, L.G., Enhancing the effectiveness of joint production and maintenance scheduling based on a multi-agent system and a pigouvian approach of externalities. Prod. Eng.(0123456789), 2024.
Köksal Ahmed, E., Li, Z., Veeravalli, B., Ren, S., Reinforcement learning-enabled genetic algorithm for school bus scheduling. J. Intell. Transp. Syst. 26:3 (2022), 269–283.
Lee, C.-Y., Machine scheduling with an availability constraint. J. Global Optim. 9:3 (1996), 395–416.
Mi, S., Feng, Y., Zheng, H., Li, Z., Gao, Y., Tan, J., Integrated intelligent green scheduling of predictive maintenance for complex equipment based on information services. IEEE Access 8 (2020), 45797–45812.
Mirahmadi, N., Taghipour, S., 2019. Energy-efficient optimization of flexible job shop scheduling and preventive maintenance. In: Proceedings - Annual Reliability and Maintainability Symposium, vol. 2019-Janua, Ryerson Univ, Dept Mech & Ind Engn, Reliabil Risk & Maintenance Res Lab, Toronto, ON M5B 2K3, Canada, (ISSN: 0149144X) ISBN: 9781538665541.
Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., Riedmiller, M., Playing atari with deep reinforcement learning. 2013.
Muhuri, P.K., Shukla, A.K., Abraham, A., Industry 4.0: A bibliometric analysis and detailed overview. Eng. Appl. Artif. Intell. 78 (2019), 218–235.
Nguyen, V.T., Do, P., Vosin, A., Iung, B., Artificial-intelligence-based maintenance decision-making and optimization for multi-state component systems. Reliab. Eng. Syst. Saf., 228, 2022.
Ochella, S., Shafiee, M., Dinmohammadi, F., Artificial intelligence in prognostics and health management of engineering systems. Eng. Appl. Artif. Intell., 108, 2022, 104552.
Paraschos, P.D., Koulinas, G.K., Koulouriotis, D.E., Reinforcement learning for combined production-maintenance and quality control of a manufacturing system with deterioration failures. J. Manuf. Syst. 56 (2020), 470–483.
Paraschos, P.D., Koulinas, G.K., Koulouriotis, D.E., A reinforcement learning/ad-hoc planning and scheduling mechanism for flexible and sustainable manufacturing systems. Flex. Serv. Manuf. J., 2023.
Perera, Y.S., Ratnaweera, D., Dasanayaka, C.H., Abeykoon, C., The role of artificial intelligence-driven soft sensors in advanced sustainable process industries: A critical review. Eng. Appl. Artif. Intell., 121, 2023, 105988.
Qamhan, A.A., Ahmed, A., Al-Harkan, I.M., Badwelan, A., Al-Samhan, A.M., Hidri, L., An exact method and ant colony optimization for single machine scheduling problem with time window periodic maintenance. IEEE Access 8 (2020), 44836–44845.
Qin, W., Zhuang, Z., Liu, Y., Xu, J., Sustainable service oriented equipment maintenance management of steel enterprises using a two-stage optimization approach. Robot. Comput.-Integr. Manuf., 75, 2022, 102311.
Ran, Y., Zhou, X., Lin, P., Wen, Y., Deng, R., A survey of predictive maintenance: Systems, purposes and approaches. 2019.
Roijers, D.M., Vamplew, P., Whiteson, S., Dazeley, R., A survey of multi-objective sequential decision-making. J. Artificial Intelligence Res. 48 (2013), 67–113.
Rokhforoz, P., Fink, O., Maintenance scheduling of manufacturing systems based on optimal price of the network. Reliab. Eng. Syst. Saf., 217, 2022.
Ruiz Rodríguez, M.L., Kubler, S., de Giorgio, A., Cordy, M., Robert, J., Le Traon, Y., Multi-agent deep reinforcement learning based predictive maintenance on parallel machines. Robot. Comput.-Integr. Manuf., 78, 2022.
Ruiz-Rodríguez, M.L., Kubler, S., Robert, J., Le Traon, Y., Dynamic maintenance scheduling approach under uncertainty: Comparison between reinforcement learning, genetic algorithm simheuristic, dispatching rules. Expert Syst. Appl., 248, 2024, 123404.
Seidgar, H., Fazlollahtabar, H., Zandieh, M., Scheduling two-stage assembly flow shop with random machines breakdowns: integrated new self-adapted differential evolutionary and simulation approach. Soft Comput. 24:11 (2020), 8377–8401.
Sharifi, M., Taghipour, S., Optimal production and maintenance scheduling for a degrading multi-failure modes single-machine production environment. Appl. Soft Comput., 106, 2021, 107312.
Shen, J., Zhu, Y., A parallel-machine scheduling problem with periodic maintenance under uncertainty. J. Ambient. Intell. Humaniz. Comput. 10:8 (2019), 3171–3179.
Sin, I.H., Chung, B.D., Bi-objective optimization approach for energy aware scheduling considering electricity cost and preventive maintenance using genetic algorithm. J. Clean. Prod., 244, 2020.
Sutton, R.S., Barto, A.G., Reinforcement learning: An introduction. 2018, MIT Press.
United Nations, Transforming our world: the 2030 agenda for sustainable development. 2015, 16301.
Wang, Z., Deng, Q., Zhang, L., Li, H., Li, F., Joint optimization of integrated mixed maintenance and distributed two-stage hybrid flow-shop production for multi-site maintenance requirements. Expert Syst. Appl., 215, 2023.
Wang, J., Du, H., Xing, J., Qiao, F., Ma, Y., Novel energy- and maintenance-aware collaborative scheduling for a hybrid flow shop based on dual memetic algorithms. IEEE Robot. Autom. Lett. 5:4 (2020), 5613–5620.
Widiastuti, H., Sulistyani, A., Utami, E.R., Do environmental issues matter to investors?. International Conference on Sustainable Innovation Track Accounting and Management Sciences (ICOSIAMS 2021), 2022, Atlantis Press, 228–234.
Wu, C.H., Yao, Y.C., Dauzère-Pérès, S., Yu, C.J., Dynamic dispatching and preventive maintenance for parallel machines with dispatching-dependent deterioration. Comput. Oper. Res., 113, 2020, 104779.
Xia, T., Shi, G., Si, G., Du, S., Xi, L., Energy-oriented joint optimization of machine maintenance and tool replacement in sustainable manufacturing. J. Manuf. Syst. 59 (2021), 261–271.
Xia, T., Si, G., Shi, G., Zhang, K., Xi, L., Optimal selective maintenance scheduling for series–parallel systems based on energy efficiency optimization. Appl. Energy, 314, 2022, 118927.
Yan, Q., Wang, H., Wu, F., Digital twin-enabled dynamic scheduling with preventive maintenance using a double-layer Q-learning algorithm. Comput. Oper. Res., 144, 2022.
Yan, Q., Wang, H., Yang, S., A learning-assisted bi-population evolutionary algorithm for distributed flexible job-shop scheduling with maintenance decisions. IEEE Trans. Evol. Comput., 2024.
Yazdani, R., Alipour-Vaezi, M., Kabirifar, K., Salahi Kojour, A., Soleimani, F., A lion optimization algorithm for an integrating maintenance planning and production scheduling problem with a total absolute deviation of completion times objective. Soft Comput. 26:24 (2022), 13953–13968.
Ye, Z., Cai, Z., Yang, H., Si, S., Zhou, F., Joint optimization of maintenance and quality inspection for manufacturing networks based on deep reinforcement learning. Reliab. Eng. Syst. Saf., 236, 2023.
Yu, T.S., Han, J.H., Scheduling proportionate flow shops with preventive machine maintenance. Int. J. Prod. Econ., 231, 2021, 107874.