Dynamic Scheduling of Crane by Embedding Deep Reinforcement Learning into a Digital Twin Framework
Abstract
:1. Introduction
2. An Integrated DT Application Framework for Crane Transportation Dynamic Scheduling
2.1. Workshop Physical Space
2.2. Twin Data Centre
2.3. Crane Scheduling Digital Twin Space
3. Problem Description and MDP Modelling
3.1. Crane Dynamic Scheduling Problem Description
3.2. Problem as MDP
3.2.1. Basic Knowledge of MDP
3.2.2. State Definition
3.2.3. Action Space
3.2.4. Reward Function
4. Methodology
4.1. Double DQN Algorithm
4.2. Crane Dynamic Scheduling Based on DRL-DT
5. Case Study
5.1. Test Platform and Case Data
5.2. Results Analysis and Discussion
6. Conclusions
- An integrated application framework of crane dynamic scheduling by deep reinforcement learning and digital twins (DRL-DT) is designed.
- The crane dynamic scheduling problem in the workshop is modelled as an MDP, and detailed definitions of state, action and reward are given.
- The DDQN joint logic simulation environment method under DRL-DT is developed to realize the dynamic scheduling of crane transportation, and its effectiveness is proven by a case study.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Serrano-Ruiz, J.C.; Mula, J.; Poler, R. Smart manufacturing scheduling: A literature review. J. Manuf. Syst. 2021, 61, 265–287. [Google Scholar] [CrossRef]
- Peterson, B.; Harjunkoski, I.; Hoda, S.; Hooker, J.N. Scheduling multiple factory cranes on a common track. Comput. Oper. Res. 2014, 48, 102–112. [Google Scholar] [CrossRef]
- Tang, L.; Xie, X.; Liu, J. Scheduling of a single crane in batch annealing process. Comput. Oper. Res. 2009, 36, 2853–2865. [Google Scholar] [CrossRef]
- Liu, Z.; Guo, S.; Wang, L. Integrated green scheduling optimization of flexible job shop and crane transportation considering comprehensive energy consumption. J. Clean. Prod. 2019, 211, 765–786. [Google Scholar] [CrossRef]
- Zhou, B.; Liao, X. Decomposition-based 2-echelon multi-objective evolutionary algorithm with energy-efficient local search strategies for shop floor multi-crane scheduling problems. Neural Comput. Appl. 2020, 32, 10719–10739. [Google Scholar] [CrossRef]
- Zhou, B.; Liao, X. Particle filter and Levy flight-based decomposed multi-objective evolution hybridized particle swarm for flexible job shop greening scheduling with crane transportation. Appl. Soft Comput. 2020, 91, 106217. [Google Scholar] [CrossRef]
- Li, J.; Xu, A.; Zang, X. Simulation-based solution for a dynamic multi-crane-scheduling problem in a steelmaking shop. Int. J. Prod. Res. 2020, 58, 6970–6984. [Google Scholar] [CrossRef]
- Du, Y.; Li, J.; Luo, C.; Meng, L. A hybrid estimation of distribution algorithm for distributed flexible job shop scheduling with crane transportations. Swarm Evol. Comput. 2021, 62, 100861. [Google Scholar] [CrossRef]
- Qu, S.; Wang, J.; Govil, S.; Leckie, J.O. Optimized Adaptive Scheduling of a Manufacturing Process System with Multi-skill Workforce and Multiple Machine Types: An Ontology-based, Multi-agent Reinforcement Learning Approach. Procedia CIRP 2016, 57, 55–60. [Google Scholar] [CrossRef]
- Shahrabi, J.; Adibi, M.A.; Mahootchi, M. A reinforcement learning approach to parameter estimation in dynamic job shop scheduling. Comput. Ind. Eng. 2017, 110, 75–82. [Google Scholar] [CrossRef]
- Wang, Y.-F. Adaptive job shop scheduling strategy based on weighted Q-learning algorithm. J. Intell. Manuf. 2020, 31, 417–432. [Google Scholar] [CrossRef]
- Wang, H.; Sarker, B.R.; Li, J.; Li, J. Adaptive scheduling for assembly job shop with uncertain assembly times based on dual Q-learning. Int. J. Prod. Res. 2021, 59, 5867–5883. [Google Scholar] [CrossRef]
- Lin, C.-C.; Deng, D.-J.; Chih, Y.-L.; Chiu, H.-T. Smart Manufacturing Scheduling With Edge Computing Using Multiclass Deep Q Network. IEEE Trans. Ind. Inform. 2019, 15, 4276–4284. [Google Scholar] [CrossRef]
- Shi, D.; Fan, W.; Xiao, Y.; Lin, T.; Xing, C. Intelligent scheduling of discrete automated production line via deep reinforcement learning. Int. J. Prod. Res. 2020, 58, 3362–3380. [Google Scholar] [CrossRef]
- Liu, C.-L.; Chang, C.-C.; Tseng, C.-J. Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems. IEEE Access 2020, 8, 71752–71762. [Google Scholar] [CrossRef]
- Luo, S. Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning. Appl. Soft Comput. 2020, 91, 106208. [Google Scholar] [CrossRef]
- Hu, L.; Liu, Z.; Hu, W.; Wang, Y.; Tan, J.; Wu, F. Petri-net-based dynamic scheduling of flexible manufacturing system via deep reinforcement learning with graph convolutional network. J. Manuf. Syst. 2020, 55, 1–14. [Google Scholar] [CrossRef]
- Han, B.-A.; Yang, J.-J. Research on Adaptive Job Shop Scheduling Problems Based on Dueling Double DQN. IEEE Access 2020, 8, 186474–186495. [Google Scholar] [CrossRef]
- Wang, L.; Hu, X.; Wang, Y.; Xu, S.; Ma, S.; Yang, K.; Liu, Z.; Wang, W. Dynamic job-shop scheduling in smart manufacturing using deep reinforcement learning. Comput. Netw. 2021, 190, 107969. [Google Scholar] [CrossRef]
- Yang, S.; Xu, Z.; Wang, J. Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning. Sensors 2021, 21, 1019. [Google Scholar] [CrossRef]
- Fang, Y.; Peng, C.; Lou, P.; Zhou, Z.; Hu, J.; Yan, J. Digital-Twin-Based Job Shop Scheduling Toward Smart Manufacturing. IEEE Trans. Ind. Inform. 2019, 15, 6425–6435. [Google Scholar] [CrossRef]
- Zhou, G.; Zhang, C.; Li, Z.; Ding, K.; Wang, C. Knowledge-driven digital twin manufacturing cell towards intelligent manufacturing. Int. J. Prod. Res. 2020, 58, 1034–1051. [Google Scholar] [CrossRef]
- Wang, Y.; Wu, Z. Model construction of planning and scheduling system based on digital twin. Int. J. Adv. Manuf. Technol. 2020, 109, 2189–2203. [Google Scholar] [CrossRef]
- Zhang, J.; Deng, T.; Jiang, H.; Chen, H.; Qin, S.; Ding, G. Bi-level dynamic scheduling architecture based on service unit digital twin agents. J. Manuf. Syst. 2021, 60, 59–79. [Google Scholar] [CrossRef]
- Yan, J.; Liu, Z.; Zhang, C.; Zhang, T.; Zhang, Y.; Yang, C. Research on flexible job shop scheduling under finite transportation conditions for digital twin workshop. Robot. Comput. -Integr. Manuf. 2021, 72, 102198. [Google Scholar] [CrossRef]
- Esposito, C.; Castiglione, A.; Palmieri, F.; Ficco, M.; Dobre, C.; Iordache, G.V.; Pop, F. Event-based sensor data exchange and fusion in the Internet of Things environments. J. Parallel Distrib. Comput. 2018, 118, 328–343. [Google Scholar] [CrossRef]
- Hu, H.; Jia, X.; He, Q.; Fu, S.; Liu, K. Deep reinforcement learning based AGVs real-time scheduling with mixed rule for flexible shop floor in industry 4.0. Comput. Ind. Eng. 2020, 149, 106749. [Google Scholar] [CrossRef]
- Zhang, Z.; Wang, W.; Zhong, S.; Hu, K. Flow Shop Scheduling with Reinforcement Learning. Asia Pac. J. Oper. Res. 2013, 30, 1350014. [Google Scholar] [CrossRef]
Rule | Description |
---|---|
FIFO | Select the job with the first coming |
SPT | Select the job with the shortest processing time |
STD | Select the job with the shortest transportation distance |
SRPT | Select the job with the shortest remaining processing time |
NVF | The crane will select the job with the nearest load machine (Euclidean distance) |
LWT | Select the job with the longest waiting time |
FRO | Select the job with the fewest remaining operations |
MPI | The crane will select the job with minimal physical interference from other cranes |
Operation 1 | Operation 2 | Operation 3 | Operation 4 | Operation 5 | |
---|---|---|---|---|---|
Job 1 | 1:37 | 3:54 | 5:57 | 8:36 | 9:31 |
Job 2 | 1:56 | 3:66 | 5:33 | 7:25 | 9:68 |
Job 3 | 1:43 | 4:48 | 6:58 | 8:72 | 9:22 |
Job 4 | 2:31 | 3:75 | 6:48 | 7:74 | 9:63 |
Job 5 | 1:74 | 4:41 | 5:80 | 8:43 | 9:44 |
Methods | Makespan (No Failure) | Makespan (Failure) | Gap |
---|---|---|---|
DRL-DT | 1213 | 1533 | 320 |
FIFO | 1715 | 2420 | 705 |
SPT | 1688 | 2312 | 624 |
STD | 1805 | 2587 | 782 |
SRPT | 1732 | 2644 | 912 |
NVF | 1583 | 2331 | 748 |
LWT | 1611 | 2170 | 559 |
FRO | 1499 | 2198 | 699 |
MPI | 1536 | 2161 | 625 |
NVF-FRO | 1322 | 1785 | 463 |
NVF-MPI | 1443 | 1838 | 395 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xu, Z.; Chang, D.; Sun, M.; Luo, T. Dynamic Scheduling of Crane by Embedding Deep Reinforcement Learning into a Digital Twin Framework. Information 2022, 13, 286. https://doi.org/10.3390/info13060286
Xu Z, Chang D, Sun M, Luo T. Dynamic Scheduling of Crane by Embedding Deep Reinforcement Learning into a Digital Twin Framework. Information. 2022; 13(6):286. https://doi.org/10.3390/info13060286
Chicago/Turabian StyleXu, Zhenyu, Daofang Chang, Miaomiao Sun, and Tian Luo. 2022. "Dynamic Scheduling of Crane by Embedding Deep Reinforcement Learning into a Digital Twin Framework" Information 13, no. 6: 286. https://doi.org/10.3390/info13060286
APA StyleXu, Z., Chang, D., Sun, M., & Luo, T. (2022). Dynamic Scheduling of Crane by Embedding Deep Reinforcement Learning into a Digital Twin Framework. Information, 13(6), 286. https://doi.org/10.3390/info13060286