Introduction

Nowadays, big data attain much attention from many scholars and enterprises, because it has huge potential in making high-quality decisions for various complex systems [1]. To be specific, it is intractable to collect and acquire large amounts of data from systems in a short time owing to diverse factors such as limited human and financial resources [2]. To predict the true data as accurately as possible, a large amount of obtained data is usually fitted into approximate distributions of parameters with big data technology [3,4,5]. As a result, a great deal of data collected from practical systems can be formulated as approximate expressions such as stochastic and fuzzy distributions, which makes modeling and optimization of real-world systems more challenging [6]. Intelligent optimization algorithms have been widely and resoundingly employed to handle diverse complicated optimization problems because of their powerful performance, e.g., strong search ability and high computation efficiency [7,8,9,10,11,12]. Recently, remanufacturing systems have received great concern from governments, because they play positive roles in environment protection and resource conservation [13]. Remanufacturing end-of-life (EOL) products usually faces much uncertainty because of their diverse usage processes in various kinds of customers [14]. In practice, managers and engineers of remanufacturing enterprises often collect a great deal of data which can be formulated as the approximate distributions as the above-mentioned approach.

The recent years have witnessed that much attention is given to efficiently promoting operation efficiency of manufacturing and remanufacturing systems via various approaches, e.g., planning and scheduling [15,16,17,18,19]. Disassembly is a very critical step in remanufacturing systems [20]. In a disassembly process, the disassembly line is usually employed to perform disassembly tasks [21, 22]. Currently, disassembly line balancing problems (DLBPs) are regarded as an essential issue in improving disassembly efficiency [23]. They aim at assigning a group of disassembly tasks to workstations while meeting precedence relationships of disassembly tasks to optimize some specified criteria [24]. Recently, DLBPs have attained wide concern from academic and industrial areas [25, 26]. To solve DLBPs, we have to determine a set of performed tasks and their assignment among workstations while obeying complex precedence constraints of disassembly tasks. Thus, it is difficult to effectively solve them, and they have proven to be NP-complete [27]. As a consequence, many meta-heuristic approaches have been used to handle DLBPs owing to their powerful search ability [28]. Zhang et al. [29] present an improved artificial fish swarm algorithm to minimize the number of workstations, total idle time, the sum of the maximum disassembly costs and disassembly direction change frequencies in a DLBP. Agrawal and Tiwari [30] adopt a collaborative ant colony optimization algorithm to deal with a stochastic disassembly line balancing and sequencing problem with minimizing the number of workstations, idle time and the probability of line stoppage. Wang et al. [31] aim at maximizing disassembly profit and minimizing the number of workstations, workload smoothness and energy consumption in a DLBP with uncertainty. They present a multi-objective discrete flower pollination algorithm to solve it. Zhu et al. [32] propose a parallel DLBP with minimizing the cycle time, number of workstations, idle index and quantity of disassembly resource. They use a multi-objective hybrid group neighborhood search algorithm to cope with it. Kalayci and Gupta [33] study a sequence-dependent DLBP with consideration of efficiency, safety and revenue. They design a hybrid approach combining a particle swarm optimization (PSO) and a neighborhood-based mutation operator to deal with it. Wang et al. [34] provide a genetic simulated annealing (SA) algorithm to achieve disassembly profit maximization as well as reach number of workstations and smoothness index minimization in a parallel DLBP. Çil et al. [35] design an ant colony optimization method to minimize the cycle time in a robotic DLBP. Xiao et al. [36] consider maximizing the total profit in a DLBP under uncertainty. An improved migrating birds optimization algorithm is developed to address it. Wang et al. [37] aim at maximizing the profit and minimizing the number of stations, the difference in workstation load and the energy consumption in a DLBP. They design an improved genetic algorithm to tackle it.

Through analyzing the existing studies, it can be found that most of them concentrate on disassembling only one EOL product on a disassembly line. Nevertheless, in practice, multiple products usually need to be disassembled together on a disassembly line in a planning horizon. Consequently, it is of great significance to deal with a multi-product disassembly line balancing problem to realize a global optimization of disassembling them together. We give an example with disassembling two products, i.e., \({\mathrm{A}}_{1}\) and \({\mathrm{A}}_{2}\), to describe the multi-product disassembly. Let \({x}_{1}\) and \({x}_{2}\) denote their disassembly plans that are made independently, while \({x}_{3}\) and \({x}_{4}\) represent their disassembly plans that are made together. Then, this work formulates their concise disassembly line balancing (DLB) models in Table 1, where \(f\left({x}_{i}\right)\) indicates the disassembly profit of \({x}_{i}\), \({t}_{j}\left({x}_{i}\right)\) denotes the disassembly time of \({x}_{i}\), \(i\in \left\{\mathrm{1, 2},\mathrm{3, 4}\right\}\), \(j\in \left\{\mathrm{1, 2}\right\}\), \({T}_{1}\) and \({T}_{2}\) are two given disassembly time limitations, and \({\Omega }_{1}\) and \({\Omega }_{2}\) are the constraints regarding product structures of \({\mathrm{A}}_{1}\) and \({\mathrm{A}}_{2}\).

Table 1 Models of DLB for disassembling \({\mathrm{A}}_{1}\) and \({\mathrm{A}}_{2}\)

By modeling the two DLBs for disassembling two products independently and together, respectively, we observe that the latter is difficult to solve and is more likely to achieve a globally optimal solution, because it has a larger solution space than the former. As a result, we prefer to deal with a DLB model by considering simultaneously disassembling multiple products in a planning horizon to reach a globally optimal solution. To this end, this work studies a multi-product disassembly line balancing problem (named as MDLBP for short). Additionally, it is necessary to take the uncertainty into consideration, because the disassembly process usually suffers from it as the result of diverse unpredictable and uncontrollable factors. Specifically, it is hard for decision-makers to accurately evaluate the disassembly time and set-up time of disassembly tasks in advance, since the various usage processes of EOL products result in the intractability of exactly knowing their information. To overcome this difficulty, the disassembly time and set-up time of disassembly tasks are expressed as random numbers obeying their known stochastic distributions. Moreover, a decision-maker needs to concern profit-desired and time-related criteria in practice. Hence, this work considers a stochastic MDLBP with the aim to reach the expected disassembly profit maximization while satisfying disassembly time constraints. By making comparisons between this work and the existing studies, we have mainly made the following three contributions:

  1. 1.

    This work proposes a novel stochastic MDLBP to achieve a global optimization for disassembling multiple products simultaneously. Then, a chance-constrained programming model with the objective of maximal disassembly profit is established to describe this problem mathematically.

  2. 2.

    We particularly design an enhanced group teaching optimization algorithm (EGTOA) incorporating a stochastic simulation approach to deal with the formulated model. In this method, EGTOA with some special strategies such as solution representation, decoding and local-best search approaches is designed to search candidate solutions from a solution domain, while the stochastic simulation approach is developed to assess the performance and feasibility of the obtained solutions.

  3. 3.

    By performing simulation experiments on several real-life cases and comparing the proposed method with five popularly known algorithms, i.e., group teaching optimization algorithm (GTOA) [38], PSO [39], improved gravitational search algorithm (IGSA) [40], variable neighborhood search (VNS) [41] and SA [42], we validate the effectiveness of the proposed method in addressing the studied problem.

The rest of this paper is organized as follows. The investigated problem is described in the section “Problem formulation”. Then, the section “Proposed method” introduces the basic GTOA and illustrates the solution algorithm. Next, simulation experiments are conducted and the results are analyzed in the section “Experimental study”. Finally, we conclude this paper and explore our future research in the section “Conclusion”.

Problem formulation

Problem statement

This work considers simultaneously disassembling multiple EOL products on a disassembly line in a planning period, and we propose a stochastic MDLBP with maximizing the expected disassembly profit subject to a given disassembly time restriction. MDLBP aims at assigning a series of disassembly tasks of each EOL product to workstations. The disassembly process must obey the precedence relationships of disassembly tasks. In the past years, AND/OR graphs have been triumphantly applied to describe the precedence constraints [43,44,45]. This work also employs AND/OR graphs to formulate the complex relationships of disassembly tasks. Let \(i\) indicate a component index, and \(j\) and \(k\) represent disassembly tasks indices. To construct an AND/OR graph for a product, this work defines the following three matrices:

  1. 1.

    Precedence matrix \(C=\left[{c}_{jk}\right]\): It demonstrates precedence and conflict relations of two disassembly tasks as

    $${c}_{jk}=\left\{\begin{array}{l}1, \quad \text{ if task } k \text{ can be executed after task } j\\ -1, \quad \text{ if task }j\text{ and task }k\text{ conflict with each other}\\ 0,\quad \text{otherwise}\end{array}.\right.$$
  2. 2.

    Succession matrix \(S=\left[{s}_{jk}\right]\): It denotes the relations between two adjacent disassembly tasks as

    $${s}_{jk}=\left\{\begin{array}{l}1,\quad \text{ if task }k\text{ can be executed next to task }j\\ 0,\text{ otherwise}\end{array}.\right.$$
  3. 3.

    Incidence matrix \(D=\left[{d}_{ij}\right]\): It describes the relations between components and disassembly tasks as

    $${d}_{ij}=\left\{\begin{array}{l}1,\quad\text{ if component }i\text{ is obtained by task }j \\ -1,\quad \text{if component }i\text{ is disassembled via task }j \\ 0,\quad \text{ otherwise}\end{array}.\right.$$

This work intends to simultaneously disassemble multiple products on a disassembly line. Thus, multiple AND/OR graphs are supposed to be adopted to describe their corresponding products. By employing AND/OR graphs, we can formulate the precedence constraints for all the disassembled products. Besides, the workload of each workstation is not allowed to exceed a given value, i.e., cycle time. To depict this constraint, the total disassembly time of the assigned disassembly tasks to a workstation cannot be larger than the cycle time.

Mathematical model

To formulate the mathematical model for the concerned problem, we define the notations as:

(1) Parameters

\(p\)

product index, \(p\in \{\mathrm{1,2},\dots ,P\}\), where \(P\) indicates the number of disassembled products

\(i\)

component index, \(i\in \{\mathrm{1, 2}, \dots ,{N}_{p}\}\), where \({N}_{p}\) represents the number of components of product \(p\)

\(j,k\)

task index, \(j,k \in \{\mathrm{0, 1},\mathrm{2}\dots ,{J}_{p}\}\), where \({J}_{p}\) means the number of tasks of product \(p\), and 0 is a dummy task

\(m\)

workstation index, \(m\in \{\mathrm{1,2}, \dots ,M\}\), where \(M\) denotes the number of workstations

\({t}_{pj}^{^{\prime}}\)

executing time of task \(j\) on product \(p\)

\({t}_{pjk}^{{{\prime}}{{\prime}}}\)

set-up time of executing task \(k\) if it is executed after task \(j\) on product \(p.\)

\({c}_{pj}^{^{\prime}}\)

cost per time unit of executing task \(j\) on product \(p.\)

\({c}_{pjk}^{{{\prime}}{{\prime}}}\)

cost per time unit of setting up task \(k\) if it is executed after task \(j\) on product \(p\)

\(F\)

fixed cost of opening a workstation

\({r}_{pi}\)

recycling value of component \(i\) of product \(p\)

\(CT\)

cycle time

\(T\)

a given disassembly time limitation

\(\alpha \)

confidence level of the disassembly time regarding \(CT\)

\(\beta \)

confidence level of the disassembly time regarding \(T\)

\({D}_{p}\)

incidence matrix of an AND/OR graph of product \(p\)

\({S}_{p}\)

succession matrix of an AND/OR graph of product \(p\)

\({d}_{pij}\)

an element in row \(i\) and column \(j\) of \({D}_{p}\)

\({s}_{pjk}\)

an element in row \(j\) and column \(k\) of \({S}_{p}\)

It is worth mentioning here that \({t}_{pj}^{^{\prime}}\) and \({t}_{pjk}^{{{\prime}}{{\prime}}}\) are random.

(2) Decision variables

\({w}_{pj}\)

task indication. If task \(j\) is executed on product \(p\), \({w}_{pj}=1.\) Otherwise, \({w}_{pj}=0\)

\({x}_{pjm}\)

assignment indication. If task \(j\) is assigned to workstation \(m\) on product \(p\), \({x}_{pjm}=1.\) Otherwise, \({x}_{pjm}=0\)

\({y}_{pjk}\)

adjacent indication. If task \(k\) is executed after task \(j\) on product \(p\), \({y}_{pjk}=1.\) Otherwise, \({y}_{pjk}=0\)

\({z}_{m}\)

open indication. If workstation \(m\) is opened,\({ z}_{m}=1\). Otherwise, \({z}_{m}=0\).

By adopting the above-mentioned indices, a chance-constrained programming model for the considered MDLBP can be established as

$$\mathrm{max}\,E\,\left(\sum_{p=1}^{P}\sum_{j=1}^{{J}_{p}}\sum_{i=1}^{{N}_{p}}{{d}_{pij}r}_{pi}{w}_{pj}-\sum_{p=1}^{P}\sum_{j=1}^{{J}_{p}}{t}_{pj}^{^{\prime}}{c}_{pj}^{^{\prime}}{w}_{pj}-\sum_{p=1}^{P}\sum_{j=1}^{{J}_{p}}\sum_{k=1}^{{J}_{p}}{t}_{pjk}^{{{\prime}}{{\prime}}}{c}_{pjk}^{{{\prime}}{{\prime}}}{y}_{pjk}-\sum_{m=1}^{M}{z}_{m}F\right).$$
(1)

s.t.

$$\sum_{m=1}^{M}{x}_{pjm}=1,\quad j=\mathrm{1, 2},\dots ,{J}_{p},\quad p=\mathrm{1, 2},\dots ,P.$$
(2)
$$\sum_{m=1}^{M}{x}_{pjm}\ge \sum_{m=1}^{M}{x}_{pkm} ,\quad \forall {y}_{pjk}=1,\,\,p=\mathrm{1, 2},\dots ,P.$$
(3)
$${s}_{pjk}-{y}_{pjk}\ge 0,\quad j,k=\mathrm{1, 2},\dots {J}_{p} ,\,\,p=\mathrm{1, 2},\dots ,P.$$
(4)
$$0\le \sum_{j=0}^{{J}_{p}}{d}_{pij}{w}_{pj}\le 1,\quad i=\mathrm{1, 2},\dots ,{N}_{p},\,\,p=\mathrm{1, 2},\dots ,P.$$
(5)
$$\mathit{Pr}\left\{E(\sum_{j=1}^{{J}_{p}}{{t}_{pj}^{^{\prime}}x}_{pjm}{w}_{pj}+\sum_{j=1}^{{J}_{p}}\sum_{k=1}^{{J}_{p}}{t}_{pjk}^{{{\prime}}{{\prime}}}{x}_{pkm}{y}_{pjk})\le CT\right\}\ge \alpha ,\quad m=\mathrm{1, 2},\dots ,M,\,\,p =\mathrm{1, 2},\dots ,P.$$
(6)
$$\mathit{Pr}\left\{E(\sum_{p=1}^{P}\sum_{j=1}^{{J}_{p}}{t}_{pj}^{^{\prime}}{w}_{pj}+\sum_{p=1}^{P}\sum_{j=1}^{{J}_{p}}\sum_{k=1}^{{J}_{p}}{t}_{pjk}^{{{\prime}}{{\prime}}}{y}_{pjk})\le T\right\}\ge \beta .$$
(7)
$${w}_{pj},{x}_{pjm},{y}_{pjk},{z}_{m} \in \left\{\mathrm{0, 1}\right\},\,\,j,k=\mathrm{1, 2},\dots ,{J}_{p},\,\,m=\mathrm{1, 2},\dots ,M,\,\,p=\mathrm{1, 2},\dots ,P.$$
(8)

Here, (1) demonstrates that the objective is to achieve the expected disassembly profit maximization. The disassembly profit is random, since it relates to random disassembly time and set-up time. \(E(\eta )\) indicates an expected value of a random variable \(\eta \). (2) guarantees that a disassembly task can be allocated to a workstation at most. (3) represents that the precedence relationships among tasks must be satisfied. (4) reveals that a feasible disassembly task sequence of an EOL product should meet precedence and conflict relationships. (5) denotes the incidence relationships between tasks and components of an EOL product. (6) indicates a chance constraint for the actual workload per workstation, where \(\alpha \) is the probability that the actual workload of a workstation is not larger than \(CT\). (7) illustrates a chance constraint for the disassembly time and \(\beta \) is the probability that the disassembly time is less than or equal to \(T\). (8) gives the range of decision variables.

Proposed method

Compared with disassembling only one product on a disassembly line in the literature [40], the studied problem has some unique characteristics. For example, it has a larger decision space, leading to the difficulty of finding the optimal solutions. Moreover, acquiring feasible solutions is very hard because of the disassembly time requirements, workstation availability, and complicated precedence relationships of disassembly tasks. Thus, we have to design an effective optimizer to search promising and feasible solutions as quickly as possible. Furthermore, the disassembly process usually faces a great deal of uncertainty, because it is intractable to accurately know the information of EOL products beforehand as a result of their various usage processes in diverse customers [46,47,48,49,50]. Therefore, we need to cope with the noise from random disturbance. To overcome the above challenges, we introduce an intelligent optimization algorithm named as EGTOA incorporating a stochastic simulation approach.

Basic framework of GTOA

GTOA is a novel population-based meta-heuristic algorithm to solve continuous optimization problems [38]. By simulating the group teaching mechanism in which students are taught according to their aptitude, it can effectively deal with varied optimization problems [38]. In GTOA, a student represents an individual, i.e., a solution, and his knowledge denotes the fitness value of this solution. GTOA mainly contains four phases, i.e., teacher allocation, ability grouping, teacher, and student phases. These four phases are repeated until a given termination criterion is reached. In teacher allocation phase, the teacher is determined and allocated. In ability grouping phase, all students are divided into two groups, i.e., outstanding and average groups, based on their ability of accepting knowledge. In teacher phase, a student can learn knowledge from the teacher and teaching plans are different between outstanding group and average group. In student phase, a student can acquire knowledge by self-learning and interaction with other students. This work sums up the procedure of basic GTOA as [38]:

  • Step 1: Set algorithm parameters including the maximum number of fitness evaluations \({T}_{m}\), the current number of fitness evaluations \({T}_{c}\) and population size \(Q\).

  • Step 2: Randomly generate and evaluate an initial population.

  • Step 3: Allocate the teacher \({T}^{t}\) by selecting the top three individuals and calculate \({T}^{t}\).

  • Step 4: Divide the population into an outstanding group \({v}_{g}^{t}\) and an average group \({v}_{b}^{t}\) based on their fitness values.

  • Step 5: Conduct teaching and learning phases by using the following two steps:

    • Step 5.1: Perform the teacher and student phases on \({v}_{g}^{t}\), and then obtain \({v}_{g}^{t+1}\).

    • Step 5.2: Perform the teacher and student phases on \({v}_{b}^{t}\), and then obtain \({v}_{b}^{t+1}\).

  • Step 6: Construct a new population which comprises of \({v}_{g}^{t+1}\) and \({v}_{b}^{t+1}\).

  • Step 7: Stop this iteration process if a given termination criterion is met. Otherwise, it goes to Step 3.

GTOA has many merits such as utilizing fewer parameters than most of other meta-heuristic algorithms, easy realization and great computation efficiency. It has shown admirable performance in handling varied complex optimization problems such as shading problems in photovoltaic systems [51] and engineering design problems [52]. However, as far as we know, GTOA has not been adopted to address DLBPs and its potential in industrial applications needs to be further investigated. Thus, to extend it to handle the proposed problem and explore its performance, this work designs an enhanced GTOA with some special strategies such as the solution representation and a local-best search method according to the features of the concerned problem. Its detailed design is demonstrated below.

The framework of EGTOA

Solution representation and population initialization

In EGTOA, a double-linked list structure with real values is taken as an encoding form based on the features of the concerned problem. An individual, i.e., a solution, can be represented as \(v=({v}_{1},{v}_{2},\dots ,{v}_{P})\), where \({v}_{p}\) indicates a disassembly decision of product \(p\), and it is expressed as \(\left({v}_{p}^{^{\prime}}, {v}_{p}^{{{\prime}}{{\prime}}}\right), p=\mathrm{1, 2},\dots ,P\). \({v}_{p}^{^{\prime}}=\left\{{o}_{p1},{o}_{p2},\dots ,{o}_{p{J}_{p}}\right\}\) denotes a disassembly task sequence decision of product \(p\), while \({v}_{p}^{{{\prime}}{{\prime}}}=\left\{{x}_{p1},{x}_{p2},\dots ,{x}_{p{J}_{p}}\right\}\) indicates a disassembly task execution decision of product \(p\). Notice that \(\left({v}_{p}^{^{\prime}}, {v}_{p}^{{{\prime}}{{\prime}}}\right)\) is defined in continuous decision space. It must be transformed into corresponding \(\left({\varphi }_{p}^{^{\prime}},{\varphi }_{p}^{{{\prime}}{{\prime}}}\right)\) locating in the discrete decision space for evaluations. To do it, we design a decoding method as: 1) For the disassembly task sequence decision of product \(p\), we map the \(j\)-th smallest value in \({v}_{p}^{^{\prime}}\) to the \(j\)-th task in \({\varphi }_{p}^{^{\prime}}\). 2) For the disassembly task execution decision of product \(p\), if a value in \({v}_{p}^{{{\prime}}{{\prime}}}\) is larger than 0.5, the value on the corresponding position in \({\varphi }_{p}^{{{\prime}}{{\prime}}}\) is 1, and otherwise 0. According to this method, we can transform \(\left({v}_{p}^{^{\prime}}, {v}_{p}^{{{\prime}}{{\prime}}}\right)\) into \(\left({\varphi }_{p}^{^{\prime}},{\varphi }_{p}^{{{\prime}}{{\prime}}}\right)\) effectively. Note that \({\varphi }_{p}^{{{\prime}}{{\prime}}}\) is a binary vector, and each element in \({\varphi }_{p}^{{{\prime}}{{\prime}}}\) indicates whether the disassembly task on the corresponding position in \({\varphi }_{p}^{^{\prime}}\) is executed. If it is equal to 1, this disassembly task is executed, and otherwise, it is not executed. To describe the above-mentioned transformation process more clearly, we give an example with 5 disassembly tasks of product \(p\), as shown in Table 2.

Table 2 Decoding method of disassembly task sequence and disassembly task execution decision

In addition, to guarantee the search efficiency of EGTOA in the tremendous decision space, we have to generate feasible solutions that meet precedence and conflict relationships. When infeasible solutions arise, this work uses the following method to adjust them: (1) The disassembly task sequence \({\varphi }_{p}^{^{\prime}}\) is repaired to satisfy precedence relationships based on precedence matrix \(C\), \(p=\mathrm{1, 2},\dots ,P\). (2) The disassembly task execution decision \({\varphi }_{p}^{{{\prime}}{{\prime}}}\) is modified to meet the precedence and conflict relationships according to precedence matrix \(C\) and succession matrix \(S\), \(p=\mathrm{1, 2},\dots ,P\).

Notice that the above solution representation does not give the assignment of disassembly tasks among workstations. This work proposes the decoding rules as: (1) On a disassembly line, each EOL product is disassembled according to its disassembly decision shown in a solution. (2) A workstation is initially opened. Each executed disassembly task is sequentially assigned to the earliest available workstation based on the disassembly order of products that it belongs to. If the workload of a workstation exceeds the given \(CT\), another workstation is opened.

According to the above-mentioned methods, an initial population containing Q feasible solutions can be randomly produced.

Solution evaluation

To describe uncertainty mathematically, this work defines the disassembly time and set-up time of disassembly tasks as random numbers which obey their known random distributions. Consequently, the objective function value is expressed as an expected value, and thus, we cannot directly calculate it like addressing deterministic optimization problems. Monte Carlo simulation is regarded as an effective way to assess probability functions [53, 54]. It adopts the average objective function value of a vast number of samples to approximate the expected value. In theory, this approach can achieve the true objective function value if the number of samples reaches infinite. Nevertheless, we have to do such simulation with the limited computation resources in reality. Therefore, to make good use of the given computation resources for evaluating the performance and feasibility of obtained solutions in a search process, this work develops a stochastic simulation approach. Let \(\Phi \) be an evaluated solution and \(L\) be the number of samples. The method is described as follows:

  • Step 1: Initialize \(\overline{A }(\Phi )=0\), \({T}_{l}\left(\Phi \right)=0, {F}_{l}\left(\Phi \right)=0, {N}_{l}\left(\Phi \right)=1\), and \(L=30\).

  • Step 2: Generate \(L\) samples and each sample \({\gamma }_{l}\) includes the disassembly time and set-up time produced from their random distributions, \(l=\mathrm{1,2},\dots ,L\).

  • Step 3: Calculate disassembly time \({T}_{l}\left(\Phi \right)\) and the number of opened workstations \({N}_{l}\left(\Phi \right)\), \(l=\mathrm{1,2},\dots ,L\).

  • Step 4: Calculate the fixed cost of opened workstations \({F}_{l}\left(\Phi \right)\) and the disassembly profit \({A}_{l}\left(\Phi \right)\), \(l=\mathrm{1,2},\dots ,L\).

  • Step 5: Sort \({T}_{l}\left(\Phi \right), l=1, 2,\dots ,L\), in an ascending order and let \(l^{\prime} = \left\lfloor {\beta \cdot L} \right\rfloor\). If \({T}_{{l}^{^{\prime}}}\left(\Phi \right)\le T\), \(\Phi \) satisfies Eq. (7) and go to Step 6. Otherwise, \(\Phi \) is infeasible, and let \(\overline{A }\left(\Phi \right)\) equal to \(0.0001\).

  • Step 6: Compute the expected disassembly profit \(\overline{A }(\Phi )\) as:

    $$\overline{A }\left(\Phi \right)=\frac{{\sum }_{l=1}^{L}{A}_{l}\left(\Phi \right)}{L}.$$
    (9)
  • Step 7: Return \(\overline{A }(\Phi )\) as the approximate objective function value of \(\Phi \).

Teacher allocation phase

Basic GTOA adopts the top three individuals obtained in a search process to guide the population to search promising solutions. Let \({v}_{1}^{t}\), \({v}_{2}^{t}\) and \({v}_{3}^{t}\) be the first, second and third best students, respectively. \({T}^{t}\) denotes the teacher, and \(f\left(x\right)\) represents the objective function value of \(x\). The teacher allocation can be defined as:

$${T}^{t}=\left\{\begin{array}{l}{v}_{1}^{t},\quad f\left(\frac{{v}_{1}^{t}+{v}_{2}^{t}+{v}_{3}^{t}}{3}\right)<f\left({v}_{1}^{t}\right)\\ \frac{{v}_{1}^{t}+{v}_{2}^{t}+{v}_{3}^{t}}{3},\quad f\left(\frac{{v}_{1}^{t}+{v}_{2}^{t}+{v}_{3}^{t}}{3}\right)\ge f\left({v}_{1}^{t}\right) \end{array}.\right.$$
(11)

Ability grouping phase

Basic GTOA divides all students into two groups, i.e., an outstanding group and an average group, in terms of their ability of accepting knowledge. According to their fitness values, the top \(Q/2\) individuals in the class constitute the outstanding group \({v}_{g}^{t}\), while the rest of the class composes the average group \({v}_{b}^{t}\). Notice that the ability grouping is dynamic. It is performed again after a learning cycle. In addition, to accelerate the convergence of this algorithm, outstanding and average groups share the same teacher.

Teacher phase

Basic GTOA considers that students are able to gain knowledge from a teacher. The teacher makes different teaching plans for two groups according to their ability of accepting knowledge. For \({v}_{g}^{t}\), the teacher attempts to increase the mean knowledge of the whole class. It can be represented as:

$${v}_{ei}^{t+1}={v}_{i}^{t}+a\left({T}^{t}-F\left(b\cdot {M}^{t}+c\cdot {v}_{i}^{t}\right)\right),$$
(11)

where \({v}_{ei}^{t+1}\) is the knowledge that student \(i\) learns from the teacher at time \(t+1\), \({v}_{i}^{t}\) and \({M}^{t}\) are the knowledge of student \(i\) and the mean knowledge of the whole class at time \(t\), and \(t\) denotes the iteration times. \(a\), \(b\), and \(c\) are random numbers ranging from 0 to 1. The sum of \(b\) and \(c\) is equal to 1. \(F\) is the teaching factor denoting the teaching result of the teacher, and it is a constant equaling to 1 or 2.

Different from \({v}_{g}^{t}\), \({v}_{b}^{t}\) is poor in accepting knowledge. Generally, a teacher gives more attention to \({v}_{b}^{t}\) than \({v}_{g}^{t}\). As a result, the teacher tries to particularly enhance the knowledge of the students of \({v}_{b}^{t}\). It is presented as Eq.(12):

$${v}_{ei}^{t+1}={v}_{i}^{t}+2d\left({T}^{t}-{v}_{i}^{t}\right),$$
(12)

where \(d\) is a random number between 0 and 1.

It is worth noting that students may not be able to acquire knowledge in this phase, which can be formulated by Eq.(13):

$${v}_{ei}^{t+1}=\left\{\begin{array}{c}{v}_{ei}^{t+1},\quad f\left({v}_{i}^{t}\right)<f\left({v}_{ei}^{t+1}\right)\\ {v}_{i}^{t}, \quad f\left({v}_{i}^{t}\right)\ge f\left({v}_{ei}^{t+1}\right)\end{array}.\right.$$
(13)

Student phase

In addition to acquiring knowledge from the teacher, a student can gain knowledge with two methods, i.e., self-learning and interaction with other students. They can be expressed as Eq.(14):

$${v}_{si}^{t+1}=\left\{\begin{array}{c}{v}_{ei}^{t+1}+e\left({v}_{ei}^{t+1}-{v}_{ej}^{t+1}\right)+g\left({v}_{ei}^{t+1}-{v}_{i}^{t}\right), f\left({v}_{ej}^{t+1}\right)<f\left({v}_{ei}^{t+1}\right)\\ {v}_{ei}^{t+1}-e\left({v}_{ei}^{t+1}-{v}_{ej}^{t+1}\right)+g\left({v}_{ei}^{t+1}-{v}_{i}^{t}\right), f\left({v}_{ej}^{t+1}\right)\ge f\left({v}_{ei}^{t+1}\right)\end{array},\right.$$
(14)

where \(e\) and \(g\) are two random numbers between 0 and 1, \(j\) denotes a student which is randomly selected from the class, and \(i\) represents the current student. \({v}_{si}^{t+1}\) is the knowledge that student \(i\) learns at time \(t+1\), while \({v}_{ej}^{t+1}\) is the knowledge that student \(j\) learns in teacher phase at time \(t+1\). In Eq. (14), the self-learning and interaction from other students are defined as the second and third items on the right, respectively.

Notice that a student may not be able to gain knowledge in student phase. Equation (15) is employed to formulate this situation. It is given as:

$${v}_{i}^{t+1}=\left\{\begin{array}{c}{v}_{ei}^{t+1},\quad f\left({v}_{si}^{t+1}\right)<f\left({v}_{ei}^{t+1}\right)\\ {v}_{si}^{t+1},\quad f\left({v}_{si}^{t+1}\right)\ge f\left({v}_{ei}^{t+1}\right)\end{array},\right.$$
(15)

where \({v}_{i}^{t+1}\) is the knowledge of student \(i\) at time \(t+1\).

Local-best search method

To strengthen the exploitation ability of EGTOA, a local-best search approach is developed to refine the best solution obtained in the search process. It can be described as: (1) the best solution \({v}_{i}\) is chosen. Then, two positions are randomly selected in its first string \({v}_{i}^{^{\prime}}\) and values on them are swapped. If the value \(\rho \) on the corresponding position in the second string \({v}_{i}^{{{\prime}}{{\prime}}}\) is less than 0.5, then \(\rho \) equals to 0. Otherwise, \(\rho \) is equal to 1. Thus, a neighborhood solution can be produced. (2) The searched neighborhood solution in the continuous decision space is transformed into a corresponding solution in the discrete decision space to evaluate its performance and feasibility. (3) If the transformed neighborhood solution locating in the discrete decision space is infeasible, we adopt the method in the section “Solution representation and population initialization” to adjust it. Otherwise, the new solution and original solution are directly compared. The better one is chosen as the current best solution. (4) If a given termination condition is satisfied, i.e., reaching the maximum number of neighborhood solutions \(I\), we output the current best solution. Otherwise, we repeat (1–3). Nevertheless, the computation resources are usually limited. Consequently, to make efficient use of the limited computation resources, the local-best search method is applied with probability \(R\) at each iteration.

Procedure of EGTOA

Having discussed all the phases of EGTOA in detail, we summarize its framework as: (1) Algorithm parameters are set, and an initial population containing a set of randomly generated solutions is produced and evaluated. (2) The teacher is allocated. (3) All students are divided into outstanding and average groups. (4) The teacher and student phases are sequentially performed on these two groups. (5) The local-best search is conducted with probability \(R\). EGTOA repeats (2–5) until a given termination criterion is reached. At last, the best solution is output. The procedure of EGTOA is given in Algorithm 1.

figure a

Experimental study

To examine the performance of EGTOA in handing the concerned problem, this work carries out simulation experiments on a set of randomly produced test problems, and makes comparisons between EGTOA and five competitive approaches: PSO [39], IGSA [40], VNS [41], SA [42], and GTOA [38]. PSO and IGSA are two population-based meta-heuristics. PSO has been triumphantly extended to deal with varied complex optimization problems [8, 55, 56] including DLBPs [40]. IGSA has been employed to solve DLBPs [40]. This work extends them to deal with the considered problem as the design in the study [40] which also focuses on solving DLBPs. Because EGTOA uses a local-best search method, we also choose two single-solution-based meta-heuristics, i.e., VNS and SA, for comparisons. They have also been applied to tackle DLBPs in the existing studies and we extend them to solve the investigated problem as the literature [41, 42]. Furthermore, to test the effectiveness of the local-best search method, GTOA [38] is also taken as a compared method.

Test instance generation

This work adopts four products that are widely used for experiments in the existing literature, i.e., a radio set [57], a copy machine [57], a hammer drill [58] and a microwave oven [58], to randomly generate five test problems with \(n\) EOL products, \(n\in \{\mathrm{2, 4},\mathrm{6, 8}, 10\}\). Each test problem contains three instances. The data for each instance are randomly produced from the intervals in Table 3. Besides, we assume that the disassembly time and set-up time of disassembly tasks follow their corresponding truncated normal distribution. Their mean time is randomly generated from the intervals, as shown in Table 3. Their standard deviation is equal to a product of their mean time and 0.00001. Additionally, the cycle time and fixed cost of opening a workstation are constants, and they are equal to 100 and 10, respectively. Furthermore, to set the given disassembly time limitation of an instance, this work randomly produces a solution of the instance and calculates its disassembly time, then let this disassembly time multiplied by 3 be the given disassembly time limitation of this instance. The confidence levels \(\alpha \) and \(\beta \) are both equal to 0.90.

Table 3 Test instance generation method

Parameter settings

The parameter settings of PSO and IGSA are obtained from their original studies [39, 40]. PSO has been successfully used to cope with a DLBP in [40]. It can acquire relatively better results when its population size is set to 50 as the setting in [40]. Other parameter settings of PSO refer to its original study [39]. The parameter settings of IGSA, VNS and SA are derived from existing studies on addressing DLBPs in [40,41,42]. Table 4 gives the parameter settings of PSO, IGSA, VNS and SA. In addition, to make a fair comparison, the number of fitness evaluations is taken as a stopping criterion equaling to 30 \(P\cdot U\), where \(U\) represents the maximum number of disassembly tasks in the solved instance.

Table 4 Parameter settings of PSO, IGSA, VNS and SA

To obtain a relatively better parameter combination for EGTOA, an Orthogonal Experiment [59] is conducted. There are three parameters in EGTOA and each of them has four levels, i.e., \(Q\in \{\mathrm{20, 40, 60, 80}\}\), \(R\in \{\mathrm{0.05, 0.10, 0.15, 0.20}\}\) and \(I\in \{\mathrm{20, 40, 60, 80}\}\). Hence, an orthogonal array \({L}_{16}({4}^{3})\) can be constructed. It contains 16 different parameter combinations, as shown in Table 5. EGTOA with each parameter combination performs 20 times for solving an instance with two EOL products. The obtained best disassembly profit value of each time is used as the response variable and the average response variable (ARV) of 20 times is applied to analyze the experimental results.

Table 5 Orthogonal array and ARV

Table 5 gives the ARVs of EGTOA with all parameter combinations. It can be found that EGTOA can achieve a better result when \(Q=20\), \(R=0.20\) and \(I=80\). Thus, this parameter combination is employed in the following experiments. To make a fair and valid comparison between EGTOA and GTOA, the population size of GTOA is set to 20.

Comparison analysis and discussion

In this section, we conduct comparison experiments to test the performance of EGTOA and its peer approaches. EGTOA, GTOA, PSO, IGSA, VNS and SA are applied to solve each instance 20 times. The mean and standard deviation (std) values over 20 times are used to analyze their performance. Table 6 shows the experimental results of EGTOA and two population-based meta-heuristic methods, i.e., PSO and IGSA. From Table 6, by comparing EGTOA with PSO, it can be observed that EGTOA can obtain better mean disassembly profit values than PSO on 9 out of 15 instances. Via making a comparison between EGTOA with IGSA, we can see that the mean disassembly profit values of EGTOA are larger than those of IGSA on all the instances. Besides, the average values of mean disassembly profit values on all the instances of EGTOA, PSO and IGSA are 14,615.71, 14,559.37 and 14,224.00, respectively. We can find that the average value of mean disassembly profit values on all the instances of EGTOA is better than that of PSO and IGSA. Table 7 gives the experimental results of EGTOA and two single-solution-based meta-heuristic methods, i.e., VNS and SA. From Table 7, we can observe that EGTOA can obtain better mean disassembly profit values than VNS on all the instances, and EGTOA can gain better mean disassembly profit values than SA on 14 out of 15 instances. The standard deviation values of EGTOA are smaller than those of VNS and SA on all the instances. In addition, the average values of mean disassembly profit values on all the instances of EGTOA, VNS and SA are 14,615.71, 13,154.90 and 13,074.40, respectively. We can observe that EGTOA can achieve a better performance than VNS and SA in terms of the average value of mean disassembly profit values on all the instances. The above-mentioned comparison results demonstrate that EGTOA outperforms PSO, IGSA, VNS and SA in handling the considered problem.

Table 6 Comparison results of EGTOA, PSO and IGSA
Table 7 Comparison results of EGTOA, VNS and SA

Besides, to verify the effectiveness of the local-best search method in EGTOA, comparison experiments between EGTOA and GTOA are conducted. Table 8 gives their experimental results. It is obvious that the mean disassembly profit values of EGTOA are greater than those of GTOA on all the instances. In addition, it can be found that the average value of mean disassembly profit values on all the instances of EGTOA is larger than that of GTOA. The comparison results indicate that EGTOA performs better than GTOA in solving the concerned problem. Thus, it can be verified that the local-best search method is beneficial to enhancing the performance of EGTOA.

Table 8 Comparison results of EGTOA and GTOA

Additionally, to further validate the effectiveness of EGTOA, the \(t\)-test with 38 degrees of freedom at a 0.10 level of significance is adopted in this work. The symbol “ + ” means that EGTOA is significantly better than its peers, “–” indicates that EGTOA is significantly worse than its peers, and “∼” denotes that EGTOA is statistically equivalent to its peers. Table 9 exhibits the comparison results of \(t\)-test. It is obvious that EGTOA is significantly better than GTOA on 9 instances and statistically equivalent to GTOA on the rest. EGTOA is significantly better than PSO on 7 instances and statistically equivalent to PSO on the rest. EGTOA is significantly better than IGSA and VNS on all the instances. EGTOA is significantly better than SA on 14 out of 15 instances and statistically equivalent to SA on 1 instance. Via analyzing the comparison results of \(t\)-test, it can be demonstrated that EGTOA is superior to five competitive methods, i.e., GTOA, PSO, IGSA, VNS and SA, in handling the concerned problem.

Table 9 Comparison results of \(t\)-test

Moreover, to visually exhibit the experimental results of six algorithms, i.e., EGTOA, GTOA, PSO, IGSA, VNS and SA, this work draws their boxplot and convergence curve graphs on all the instances as given in Figs. 1 and 2, respectively. From Fig. 1, it can be observed that the obtained results of EGTOA are much better and more concentrated than its five peers on most instances. Furthermore, from Fig. 2, we can see that EGTOA can converge to a better disassembly profit value faster than GTOA, PSO, IGSA, VNS and SA on most instances. Thus, the convergence and optimality of EGTOA are much better than those of its five peers. Via analyzing the above-mentioned experimental results, it can be declared that EGTOA performs better than its five compared methods in tackling the concerned problem.

Fig. 1
figure 1

Boxplot of EGTOA, GTOA, PSO, IGSA, VNS and SA on all the instances

Fig. 2
figure 2

Convergence curve graphs of EGTOA, GTOA, PSO, IGSA, VNS and SA on all the instances

Conclusion

To efficiently operate and manage the remanufacturing systems and reach the global optimization in the disassembly process, this work addresses a stochastic profit-oriented multi-product disassembly line balancing problem with consideration of disassembly time limitation. Next, according to the characteristics of this problem, a chance-constrained programming model is formulated to describe it mathematically. To solve it, this work develops an intelligent optimization algorithm named as enhanced group teaching optimization algorithm. Finally, the effectiveness of the proposed method is validated by conducting simulation experiments on some real-life cases and comparing it with five competitive approaches in literature, i.e., group teaching optimization algorithm, particle swarm optimization algorithm, improved gravitational search algorithm, variable neighborhood search algorithm and simulated annealing algorithm. The results can help decision-makers make wise decisions in stochastic environments when there are multiple EOL products that need to be dismantled together on a disassembly line.

In the future research, we plan to explore the following topics: (1) to formulate and tackle a new stochastic multi-product disassembly line balancing problem considering energy-aware optimization; and (2) to develop more efficient approaches to handle the concerned problem with high complexity and stochastic features.