A novel tri-stage with reward-switching mechanism for constrained multiobjective optimization problems

Qu, Jiqing; Li, Xuefeng; Xiao, Hui

doi:10.1007/s40747-024-01379-2

A novel tri-stage with reward-switching mechanism for constrained multiobjective optimization problems

Original Article
Open access
Published: 30 March 2024

Volume 10, pages 4625–4655, (2024)
Cite this article

Download PDF

You have full access to this open access article

Complex & Intelligent Systems Aims and scope Submit manuscript

A novel tri-stage with reward-switching mechanism for constrained multiobjective optimization problems

Download PDF

Jiqing Qu¹,
Xuefeng Li^1,2 &
Hui Xiao¹

264 Accesses
Explore all metrics

Abstract

The effective exploitation of infeasible solutions plays a crucial role in addressing constrained multiobjective optimization problems (CMOPs). However, existing constrained multiobjective optimization evolutionary algorithms (CMOEAs) encounter challenges in effectively balancing objective optimization and constraint satisfaction, particularly when tackling problems with complex infeasible regions. Subsequent to the prior exploration, this paper proposes a novel tri-stage with reward-switching mechanism framework (TSRSM), including the push, pull, and repush stages. Each stage consists of two coevolutionary populations, namely ${\text {Pop}}_1$ and ${\text {Pop}}_2$. Throughout the three stages, ${\text {Pop}}_1$ is tasked with converging to the constrained Pareto front (CPF). However, ${\text {Pop}}_2$ is assigned with distinct tasks: (i) converging to the unconstrained Pareto front (UPF) in the push stage; (ii) utilizing constraint relaxation technique to discover the CPF in the pull stage; and (iii) revisiting the search for the UPF through knowledge transfer in the repush stage. Additionally, a novel reward-switching mechanism (RSM) is employed to transition between different stages, considering the extent of changes in the convergence and diversity of populations. Finally, the experimental results on three benchmark test sets and 30 real-world CMOPs demonstrate that TSRSM achieves competitive performance when compared with nine state-of-the-art CMOEAs. The source code is available at https://github.com/Qu-jq/TSRSM.

A dynamic resource allocation strategy for collaborative constrained multi-objective optimization algorithm

Article 16 August 2022

A constrained multi-objective optimization algorithm with two cooperative populations

Article 08 February 2022

Dynamic grid-based uniform search for solving constrained multiobjective optimization problems

Article 13 November 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Many real-world problems belong to constrained multiobjective optimization problems (CMOPs), which have conflicting objectives subject to various constraints [1,2,3,4]. The general CMOP can be expressed as follows:

$$\begin{aligned} \text {min}\; {\textbf{F}}({\textbf{x}})=(f_1({\textbf{x}}),f_2({\textbf{x}}),\dots ,f_M({\textbf{x}})) \end{aligned}$$

(1)

s.t.

$$\begin{aligned} {\left\{ \begin{array}{ll} &{}\textrm{g}_i({\textbf{x}})\le 0,i=1,\dots ,s \\ &{}h_i({\textbf{x}})=0,i=s+1,\dots ,t\\ &{}{\textbf{x}}\in \Omega , \end{array}\right. } \end{aligned}$$

(2)

where ${\textbf{F}}({\textbf{x}})$ is composed of M conflicting objective functions; ${\textbf{x}}=(x_1,x_2,\dots ,x_D)$ is a solution with D dimensions; $\Omega \in {\mathbb {R}}^n$ denotes the decision space; $\textrm{g}_i({\textbf{x}})$ and $h_i({\textbf{x}})$ indicate s inequality constraints and $t-s$ equality constraints, respectively; and t is the number of constraints.

To express the degree of the ith constraint violation (denoted as $CV_i({\textbf{x}})$) of ${\textbf{x}}$ at the ith constraint, the following formulation is used:

$$\begin{aligned} CV_i({\textbf{x}})={\left\{ \begin{array}{ll} &{} \text {max}(0,\textrm{g}_i({\textbf{x}})),\; i=1,\dots ,s \\ &{} \text {max}(0,\left| h_i({\textbf{x}}) \right| -\delta ),\; i=s+1,\dots ,t, \end{array}\right. } \end{aligned}$$

(3)

where $\delta $ is a very small positive constraint boundary relaxation parameter (e.g., 1e–4), which turns $h_i({\textbf{x}})$ into inequality constraints. The overall constraint violation value of ${\textbf{x}}$ (denoted as CV) is formulated as:

$$\begin{aligned} CV({\textbf{x}})=\sum _{i=1}^{t} CV_i({\textbf{x}}), \end{aligned}$$

(4)

when $CV({\textbf{x}})=0$ means that the decision variable ${\textbf{x}}$ is a feasible solution. Otherwise, it is an infeasible solution. Given ${\textbf{x}}_1$ and ${\textbf{x}}_2$ are all feasible, if ${\textbf{F}}({\textbf{x}}_1)$ is not worse than ${\textbf{F}}({\textbf{x}}_2)$ and it at least has one better objective, ${\textbf{x}}_1$ is said to dominate ${\textbf{x}}_2$ $({\textbf{x}}_1 \prec {\textbf{x}}_2)$. A solution is deemed Pareto-optimal if no other feasible solution dominates it. CMOPs aim to find a set of Pareto-optimal solutions that satisfy various constraints. In the decision space, the set of all feasible Pareto-optimal solutions is the Pareto-optimal set (PS). The mapping of the PS onto the objective space forms the constrained Pareto front (CPF). Similarly, when addressing unconstrained multiobjective optimization problems (MOPs), the unconstrained Pareto front (UPF) is ultimately desired [5].

In contrast to unconstrained MOPs, CMOPs pose a greater challenge in simultaneously managing conflicting objectives and constraints [6, 7]. Many constrained multiobjective evolutionary algorithms (CMOEAs) have been developed to address this issue by employing diverse constraint-handling techniques (CHTs). The current CHTs can be divided into five categories: (1) penalty function methods [8, 9]; (2) separation of constraints and objectives [10, 11]; (3) multiobjective methods [12, 13]; (4) hybrid methods [14, 15]; and (5) multi-stages and multi-populations (MSMP) [16, 17]. Although those methods employed in the state-of-the-art CMOEAs have demonstrated high performance on certain CMOPs, they still have limitations when it comes to solving problems with complex infeasible regions and small discrete feasible regions. Unfortunately, many real-world problems exhibit such characteristics, such as the problem of synchronous optimal pulse-width modulation of 3-level inverters [18], which pose challenges to the existing CMOEAs. More specifically, the penalty function methods and separation of constraints and objectives require careful tuning of related parameters. Designing an additional objective becomes challenging in multiobjective methods. Hybrid methods demand differentiability of the problem [19]. MSMP-based CMOEAs overcome the challenges of directly solving CMOPs by leveraging infeasible solutions to extract valuable information, which facilitates the collaboration between populations and stages [20]. However, they encounter difficulties in effectively leveraging infeasible solutions.

Inspired by the success of MSMP, a novel tri-stage with reward-switching mechanism framework (TSRSM) is proposed for CMOPs. The three stages of TSRSM employ distinct strategies to leverage infeasible solutions. The novel features of TSRSM are as follows:

1.
The proposed TSRSM framework is comprised of three stages: the push stage, the pull stage, and the repush stage. Each stage employs two cooperative populations, namely ${\text {Pop}}_1$ and ${\text {Pop}}_2$. The role of ${\text {Pop}}_2$ varies across different stages. In the push stage, ${\text {Pop}}_2$ aims to converge to the UPF and guide ${\text {Pop}}_1$ to pass through infeasible regions. Subsequently, ${\text {Pop}}_2$ employs the constraint relaxation technique to enhance feasibility in the pull stage. Finally, ${\text {Pop}}_2$ reconvenes with the UPF using knowledge transfer and shares its unique insights to inform and guide ${\text {Pop}}_1$ in the repush stage. The novel characteristic of this approach is that the ${\text {Pop}}_2$ alternates between the UPF and CPF, resulting in greater effectiveness compared to the single-direction movement of the auxiliary population in existing CMOEAs (e.g., CCMO), as evidenced by experimental results.
2.
A novel reward-switching mechanism (RSM) is devised to decide when to switch stages by evaluating the convergence and diversity levels exhibited by the population. One distinct characteristic of this approach is that RSM takes into account the convergence and diversity of the population simultaneously, making it a more accurate method to switch stages compared to other switching mechanisms.

To demonstrate the performance of TSRSM, 9 state-of-the-art CMOEAs were selected for comparison on three benchmark test sets and 30 real-world CMOPs [21]. The results reveal that the proposed method achieves superiority over other CMOEAs on both benchmark problems and real-world CMOPs. Additionally, TSRSM obtains the best performance on 10 real-world problems, including the synchronous optimal pulse-width modulation of 3-level inverters problem, the multi-product batch plant problem [22], the heat exchanger network design problem [23], and others. This achievement represents the highest number of best results compared to other CMOPs.

It should be noted that while many research works are based on multi-stage approaches [24, 25], they primarily focus on utilizing different tasks in each stage, rather than emphasizing the optimization problems themselves. In the TSRSM, the current stage can continue to evolve based on its performance in the problem at hand. This means that if the pull stage performs well in the problem, there may be no need for a repush stage.

The remainder of this paper is organized as follows. In “Related works and motivation”, we review the existing MSMP and explain our motivations. In “Proposed method”, the proposed TSRSM is introduced. “Experimental results” shows the experimental setting and results. Finally, “Conclusions and future work” presents the conclusions and future work.

Related works and motivation

This section provides an overview of the existing CMOEAs that are relevant to the field of MSMP, as this paper specifically emphasizes the multi-stage framework.

CMOEAs based on two stages and two populations

This part of the method tries to balance objectives and constraints by two stages and two populations.

As for two-population algorithms, one representative is CCMO [26], which is a coevolutionary framework featuring two weak cooperative populations. One population is exclusively dedicated to solving the original CMOPs with the specific objective of finding the CPF. In contrast, the other population focuses its efforts on discovering the UPF. Another two-population algorithm called cDPEA [27], in which one population is designed to preserve competitive infeasible solutions, and the other population adopts a feasibility-oriented approach to handle infeasible solutions. Furthermore, a novel adaptive fitness function was implemented to regulate the trade-off between convergence and diversity.

As for two-stage algorithms, one representative is PPS-MOEA/D [28], which introduced a push–pull searching strategy. The push stage is mainly focused on directing the population toward the UPF, while the pull stage is responsible for attracting the population toward the CPF. The switching mechanism employs the gradient of the maximum of the nadir and the minimum of ideal points. DD-CMOEA [5] employed this switching mechanism in the exploration and exploitation stage with two populations. The primary objective of the exploration stage is to search for informative infeasible solutions. In contrast, the exploitation stage leverages infeasible solutions to explore nearby feasible solutions. CMOEA-MS [29] is another two-stage algorithm that consists of a first stage for identifying feasible regions and a second stage for spreading along feasible boundaries. Moreover, CMOEA-MS used fitness evaluation strategies to adaptively balance objectives and constraints in two stages. Another strategy was developed by TSTI [30], which employed different emphases on the three indicators (namely convergence, diversity, and feasibility) in two stages. The first stage is to obtain solutions with good distribution and to prevent the population from falling into local optima. The second stage is to quickly converge to the CPF. DATEA [31] used weak coevolution of the dual population to consider constraints in the first stage. Then, a feasibility-oriented approach is employed to guide a single population in spreading across the feasible regions discovered in the first stage. URCMO [32] utilized the knowledge learned from the learning stage about the relationship between the UPF and the CPF to guide the evolving strategies in the evolving stage.

Inspired by the success of evolutionary multitask (EMT) in other fields, such as high-dimensional classification feature selection problems, some researchers have attempted to develop EMT to solve CMOPs. Qiao et al. [33] first introduced EMT [34] into CMOEAs (named EMCMO), which includes two tasks: the first task is designed to solve the original CMOP, and the other is for the unconstrained MOP. Furthermore, a transfer strategy was devised to determine whether to transfer parent or offspring sets into the environmental selection. A novel EMT, named MTCMO [35], was subsequently developed, which employs a dynamic auxiliary task and leverages an improved $\epsilon $-constraint method to effectively tackle complex CMOPs. Furthermore, a tri-task framework known as CMOEMT [36] was introduced. Three tasks are designed for the original CMOP, the unconstrained MOP, and the relaxed CMOP, respectively. The evolutionary process can be broken down into two distinct stages: the evolving stage and the transfer stage. During the evolving stage, three specific tasks evolve independently. Conversely, the transfer stage effectively transmits relevant information among the three tasks.

CMOEAs based on three stages and three populations

This part of the method attempts to balance objectives and constraints in a more granular way.

TriP [37] is a representative three-population algorithm. Two populations evolve using a weak coevolutionary framework to handle the original CMOP and the unconstrained MOP separately, while the third population independently addresses the relaxed CMOP. Three populations of TriP and CMOEMT have the same purpose, but the way they exchange information with each other is different. C3M [24] is one representative three-stage algorithm. In the early stage, setting aside the typical consideration of feasibility to enable a more thorough exploration of the objective space. At the medium stage, the algorithm focuses on individual constraints, selecting those of the highest priority to explore the objective space further. In the last stage of the algorithm, feasibility is fully accounted for to enhance the quality of solutions achieved in the previous two stages. Another three-stage algorithm, TSCSO [25], introduced a tri-stage competitive swarm optimizer. The first stage focuses on achieving global convergence to the UPF, the second stage aims to enhance the diversity of the population and explore more feasible regions, and the third stage is utilized to search for the feasible regions omitted in the previous stage.

Motivations

The aforementioned studies share a common objective of addressing CMOPs by utilizing distinct populations or stages to handle the CMOP, unconstrained MOP, and relaxed CMOP. This is because MSMP-based evolutionary algorithms can circumvent the challenges encountered in directly solving the original CMOPs by employing a well-designed staged approach [19]. First, the stage of solving the unconstrained MOP helps to find promising solutions by ignoring constraints. Second, the stage of solving the relaxed CMOP is beneficial to expanding feasible solutions. Third, the stage of solving the original CMOP allows the population to converge further to the CPF. However, the inadequate weight they possess in the evolutionary process can lead to challenges in solving specific problems. For example, if the algorithm neglects the utilization of the relaxed CMOP, it struggles to solve problems characterized by large infeasible regions and small discrete feasible regions. Hence, designing an effective framework and switching mechanism is crucial for achieving optimal results.

The existing frameworks still have some weaknesses. CCMO utilized a coevolutionary framework with two weak cooperative populations, which search for the CPF and UPF, respectively. However, searching for the UPF in the later stages results in significant resource wastage, which means that it may be ineffective to search for the CPF when the UPF and CPF are located far apart. TriP used the tri-population-based coevolutionary framework, which solves the CMOP, the unconstrained MOP, and the relaxed CMOP, respectively. The third population uses the $\epsilon $-constrained technique in PPS-MOEA/D, easily falling into the local optimum, such as MW1, MW2, and MW10 [38]. CMOEMT encounters a similar challenge to Trip as it also utilizes the $\epsilon $-constrained technique independently during the initial stage. EMCMO employed an EMT framework with knowledge transfer, which has been demonstrated to achieve high performance on MW [38] problems. MTCMO improved the EMT framework and knowledge transfer, which also has high performance on MW. However, both EMCMO and MTCMO have limitations in dealing with problems that have large infeasible regions, such as LIR-CMOP [39], because they primarily prioritize feasibility.

Switching mechanisms are designed to achieve a balance among the distinct stages that serve different tasks. However, the existing switching mechanisms still encounter difficulties due to their inaccurate judgment of CMOPs with diverse characteristics as illustrated in Table 1. Consequently, the efficiency and versatility of these existing switching mechanisms remain insufficient.

Table 1 Switching mechanism of existing algorithm

A novel tri-stage with reward-switching mechanism for constrained multiobjective optimization problems

Abstract

Similar content being viewed by others

A dynamic resource allocation strategy for collaborative constrained multi-objective optimization algorithm

A constrained multi-objective optimization algorithm with two cooperative populations

Dynamic grid-based uniform search for solving constrained multiobjective optimization problems

Introduction

Related works and motivation

CMOEAs based on two stages and two populations

CMOEAs based on three stages and three populations

Motivations

Proposed method

The procedure of TSRSM

Reward-switching mechanism

The push stage

The pull stage

The repush stage

Computational complexity

Experimental results

Experimental settings

Test functions

Compared algorithms

Performance indicators

Comparison with peer algorithms

Comparison on real-world CMOPs

Discussions about TSRSM

Investigation into the search behavior

Investigation into the main strategies

Parameter analysis of TSRSM

Conclusions and future work

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendix A Table

Appendix B

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation