1. Introduction
Robot systems, primarily accomplishing desired tasks through close collaboration between individual or multiple robots and humans, have now become an indispensable part of the development in agriculture, industry, and service sectors. In this trend of development, robot path planning has become one of the most crucial issues in robot systems. Multi-robot path planning is an NP-hard problem, which describes the scenario where, in the same working environment, these robots need to move from their initial positions to their respective destinations at the minimum cost while avoiding collisions with other robots or obstacles during the movement. Multi-robot path planning involves multiple constraints, particularly in dynamically changing environments. Previous work has proposed several solutions to avoid collisions and coordinate between multiple robots [Reference Olfati-Saber1]. Classical methods for solving robot path planning problems typically include potential field techniques [Reference Tournassoud2], bounding box representations [Reference Simeon, Leroy and Lauumond3], Breadth-First Search and Depth-First Search [Reference Žunić, Djedović and Žunić4], A* algorithm [Reference Fu, Chen, Zhou, Zheng, Wei, Dai and Pan5], and exploring random tree methods [Reference Zhang, Xiong, Li, Du and Zhao6]. However, these methods often fail to effectively address multi-robot path planning as the scale of robots or obstacles increases. Current researchers primarily utilize artificial neural networks and metaheuristic algorithms to solve such problems. Metaheuristic algorithms primarily mimic biological behaviors or natural phenomena as heuristic methods and are primarily categorized into four types. The first type is population-based metaheuristic algorithms, exemplified by particle swarm optimization (PSO) [Reference Kennedy and Russell7], which simulates the foraging behavior of bird flocks, and ant colony optimization (ACO) [Reference Dorigo, Mauro and Thomas8], which mimics the process where ant populations release pheromones along their paths during foraging to find optimal routes based on pheromone concentration. Similar algorithms include artificial bee colony (ABC) [Reference Karaboga and Bahriye9]. The second type is evolutionary-based metaheuristic algorithms, such as genetic algorithm (GA) [Reference Holland10] and differential evolution (DE) [Reference Qin, Huang and Suganthan11], which emulate the process of biological evolution where the most optimal genetic individuals are preserved over generations. The third type is based on chemical and physical phenomena, grounded in theoretical foundations, with representative algorithms including gravitational search algorithm (GSA) [Reference Rashedi, Nezamabadi-Pour and Saryazdi12] and Archimedes optimization algorithm (AOA) [Reference Hashim, Hussain, Houssein, Mabrouk and Al-Atabany13]. The fourth type is inspired by human social behaviors and can draw inspiration from various aspects of daily life. Examples include lungs performance-based optimization (LPO) [Reference Ghasemi, Zare, Zahedi, Trojovský, Abualigah and Trojovská14], which simulates human lung function activity, and football team training algorithm (FTTA) [Reference Tian and Mei15], which mimics the training process of a football team. However, due to the homogeneity of their update strategies, these algorithms tend to fall into local optima to some degree when solving problems, and their parameter settings are relatively uniform, making them unsuitable for solving diverse problems.
This article is a study on path planning problem based on metaheuristic algorithm. According to the No Free Lunch theorem, no single algorithm can solve all problems optimally. Therefore, researchers often integrate multiple strategies and algorithms to improve the performance of path planning algorithms. Geng et al. [Reference Geng, Sun, Wang, Bu, Liu, Li and Zhao16] combined sparrow search algorithm (SSA) with reverse learning strategy to solve a robot path planning problem on a grid map. Parhi et al. [Reference Parhi17] merged the classical method of linear regression (LR) with GSA called RGSA, incorporating multiple chaos strategies to obtain optimal paths in multi-humanoids (Nao robots) movement problems. Li et al. [Reference Li, Zhao, Chen, Xiong and Liu18] proposed a hybrid algorithm based on Improved GA and Dynamic Window Approach for solving mobile robot path planning problems. Xu et al. [Reference Xu, Maoyong and Baoye19] combined a new fourth-order Bezier transition curve with an improved PSO algorithm, proposed a novel method for smooth path planning of mobile robots. Nazarahari et al. [Reference Nazarahari, Esmaeel and Samira20] proposed an innovative artificial potential field (APF) algorithm to find all feasible paths between start and end points in a discrete grid environment and developed an Enhanced GA to improve initial path and find the optimal path between start and end points in robot path planning problems. Zhang et al. [Reference Zhang, Xu, Zhan and Han21] proposed a hybrid algorithm of GA and firefly algorithm to enhance the responsiveness and computational capability of mobile robots during movement. Dai et al. [Reference Dai, Long, Zhang and Gong22] introduced an Improved ACO utilizing characteristics of A* algorithm and MAX-MIN ant system to achieve efficient search capability for mobile robot path planning in complex maps. Chen et al. [Reference Chen and Jie23] addressed path planning problems for mobile robots in known environments, proposed a grid-based hybrid APF and ACO path planning method. Zhang et al. [Reference Zhang, Ning, Li, l. Pan and Zhang24] proposed a turning point-based grey wolf optimizer (GWO) for solving the path planning problem of patrol robots. The algorithm utilizes the concepts of roulette wheel selection and crossover mutation to broaden the search scope of the initial population. Additionally, the convergence factor function varies with the number of obstacles, enhancing the performance of the proposed algorithm. Liu et al. [Reference Liu, Liu and Xiao25] integrated the A* algorithm with an improved GWO to propose an A*-IGWO for solving parking lot path planning problems. The algorithm constructs a minimum cost equation using the A* algorithm on the basis of the population updating mechanism of the IGWO, fully leveraging the advantages of both algorithms to improve performance. Dai et al. [Reference Dai, Li, Chen, Nie, Rui and Zhang26] proposed an improved sparrow search algorithm (ISSA) for solving the path planning problem of cellular robots on large-scale three-dimensional truss. Julius et al. [Reference Fusic and Sitharthan27] proposed an improved self-adaptive learning PSO (ISALPSO) algorithm for solving the path planning problem of mobile robots in 2D lidar maps. The algorithm can fine-tune the lidar information using a binary occupancy grid method during the robot’s movement, thereby adaptively adjusting the parameters by ISALPSO.
The coati optimization algorithm (COA) [Reference Dehghani, Montazeri, Trojovská and Trojovský28] is an efficient and novel metaheuristic algorithm proposed by Dehghani et al. in 2022, widely applied to various global optimization problems. However, due to its inherent mechanism issues, it tends to fall into the dilemma of premature convergence when solving high-dimensional problems. Researchers have proposed some improvement ideas for COA. Hashim et al. [Reference Hashim, Houssein, Mostafa, Hussien and Helmy29] introduced an adaptive mutation strategy for the COA to solve feature extraction and global optimization problems. Baş et al [Reference Baş and Gülnur30] proposed an Enhanced COA for solving large-scale high-dimensional big data optimization problems (BOP). Yildizdan et al. [Reference Baş and Gülnur31] used a transfer function to transform the continuous optimization problem-solving COA into a binary optimization algorithm (BinCOA) to solve the Knapsack Problem (KP) and the Uncapacitated Facility Location Problem (UFLP). Hasanien et al. [Reference Hasanien, Alsaleh, Alassaf and Alateeq32] introduced a new improved COA to find the optimal solution for the Probability Optimal Power Flow (POPF) problem. Jia et al. [Reference Jia, Shi, Wu, Rao, Zhang and Abualigah33] proposed an improved COA based on the sound search envelope strategy to solve six engineering application problems. Although DE and COA have a wide range of applications and unique advantages, both algorithms exhibit poor solution quality when dealing with high-dimensional and multimodal problems, making them susceptible to local optima. To overcome the limitations of COA, this paper combines a cross-update strategy from some DE algorithm variants with the original COA to propose a new hybrid algorithm (SDECOA). When faced with different problems, self-adaptive differential evolution-based coati optimization algorithm (SDECOA) adaptively adjusts the use frequency of update strategies, fully utilizing the update mechanisms of both algorithms to enhance the algorithm’s global search and local search capabilities.
The contributions of this paper are summarized as follows:
-
1. Combining the update strategies from variants of DE algorithms with COA, an SDECOA is proposed, enhancing the COA’s global search and local search capabilities.
-
2. The crossover probability CR in the DE strategy adapts dynamically based on the algorithm’s iterations, increasing the convergence speed of the algorithm.
-
3. The proposed algorithm successfully solved 48 real-world constrained optimization problems from various fields and achieved superior results compared to the top algorithms in the CEC2020 competition.
-
4. The multi-robot path planning problem is NP-hard, and the presence of multiple moving obstacles further complicates the problem. The proposed algorithm effectively addresses this issue.
The organization of this paper is as follows: Section 2 discusses the relevant definitions of DE, COA, real-world constrained optimization problems, and online multi-robot path planning problems. Section 3 provides a detailed explanation of SDECOA’s definition. Section 4 showcases the experimental results of this study. Section 5 is the summary and future research objectives of this paper.
2. Preliminaries
In this section, COA and DE algorithm are introduced separately, along with the definition of real-world constrained engineering optimization problems and the definition of multi-robot path planning problems.
2.1. Coati optimization algorithm (COA)
This section describes the mathematical model of the original COA. The COA is a novel swarm intelligence algorithm proposed in 2022, which primarily simulates the behavior of coatis hunting iguanas. Initially, half of the population will climb trees to approach their food source, the iguana, and this behavior is defined by Eq. (1):
In the above equation, x i,j represents the position of the i-th individual in the j-th dimension of the solution space, Ig denotes the position of the best solution in the current population, N represents the population size, r 1 is a random number between [0, 1], and I takes a random value of either 1 or 2.
Faced with the siege of coatis, the iguana will randomly drop to the ground. Eq. (2) expresses this behavior in the solution space. Meanwhile, the other half of the coatis will search for their prey, as represented by Eq. (3):
where $Ig_{j}^{G}$ represents the random dropping position of the iguana, with lb and ub denoting the upper and lower bounds of the solution space, respectively. r 2 and r 3 are both random numbers between [0,1]. In Eq. (3), $F_{Ig}^{G}$ represents the fitness value of Ig and F i denotes the current fitness value of the i-th individual. Subsequently, if the fitness value Fnew i of the newly generated individual is superior to the current individual, its position is replaced; otherwise, the original position is retained, as expressed in Eq. (4):
After locating the iguana, all the coatis will slowly surround their prey. This biological characteristic is represented by equations (5) and (6):
where t denotes the current iteration number, while T represents the maximum iteration number, and r 4 and r 5 are both random numbers between [0, 1]. Subsequently, the new position of the current individual is also calculated using Eq. (4). The pseudocode of the basic framework of the COA is represented by Algorithm 1.
2.2. Differential evolution (DE)
This section describes the mathematical model of original DE. DE is a well-known classic metaheuristic algorithm, mainly consisting of three evolutionary processes: mutation, crossover, and selection.
2.2.1. Mutation
In DE, the mutation process involves randomly selecting two individuals and using the difference of their position vectors as the step size for updating the target carrier position, as represented by Eq. (7):
In the above equation, x i represents the i-th individual in the population, x R1 and x R2 are two different individuals randomly selected from the population, j denotes the j-th dimension of the solution, and F is the scaling factor.
2.2.2. Crossover
The crossover process involves exchanging the j-th dimension component between the mutated individual y i and the target carrier x i , generating a crossover individual z i , as represented by Eq. (8):
where CR denotes the crossover probability, j r represents a randomly selected number between 1 and the maximum dimensions, and r is a random number between [0, 1].
2.2.3. Selection
Similar to Eq. (4), after comparing the fitness values of the target carrier and the crossover individual, one of them is randomly selected. The better individual is chosen as the position vector for the next generation, achieving the goal of evolution:
where $F_{{z_{i}}}$ represents the fitness value of the crossover individual and F i represents the fitness value of the target carrier.
2.3. Real-world constrained optimization problems
This paragraph introduces the basic definition of constrained optimization problems, which typically refer to minimization problems under several equality and inequality constraints. The real-world constrained optimization problems in this paper are all derived from CEC2020, covering various fields such as industrial chemical processes, synthesis and design, mechanical engineering, power systems, power electronics, and livestock feed optimization. The definitions of these problems are referenced from literature [Reference Kumar, Wu, Ali, Mallipeddi, Suganthan and Das34] and all meet the following function definition.
In the above equation, X denotes the solution vector of the problem, g represents inequality constraints, n denotes the number of g, h represents equality constraints, and m signifies the total number of all constraints.
2.4. Multi-robot online path planning problem
This article studies the multi-robot path planning problem with multiple static and dynamic obstacles; therefore, it is necessary to consider the safe distance between robots and other robots, robots and static obstacles, and robots and dynamic obstacles. The mathematical model of this problem is provided in literature [Reference Akay and Mustafa35], where each robot has its own starting point and destination. During the robots movement, there are static and dynamic obstacles of various sizes and shapes. All robots not only need to avoid these obstacles but also must not collide with each other. The robots need to reach their destinations with the minimum cost step by step.
Minimize,
where F 1 represents the shortest distance, F 2 represents avoiding static obstacles, F 3 represents avoiding dynamic obstacles, and F 4 represents avoiding other robots. F 1 can be expressed as Eq. (12):
where
where NR represents the number of robots and ( $x_{i}^{g}, y_{i}^{g}$ ) represents the destination coordinates of the i-th robot. F 2 can be expressed as Eq. (15):
where $\varepsilon$ is a large penalty value, d s represent the safety distance between any two objects, and $d_{i}^{s}$ is the sum of distances from all static obstacles to the i-th robot, as derived from Eq. (16):
where NS is the number of static obstacles and ( $x_{j}^{s}, y_{j}^{s}$ ) is the coordinate position of the j-th static obstacle. F 3 can be expressed as Eq. (17):
where $d_{i}^{D}$ is the sum of distances from all dynamic obstacles to the i-th robot, as obtained from Eq. (18):
where ND is the number of dynamic obstacles, ( $x_{j}^{D}, y_{j}^{D}$ ) is the coordinate position of the j-th dynamic obstacle, and the movement of dynamic obstacles at each step is determined by equations (19) and (20):
In the above equation, $v_{j}^{D}$ represents the movement speed of dynamic obstacle j and $\alpha _{j}$ represents its radial position relative to the target position. F 4 can be expressed as Eq. (21):
where $d_{i}^{R}$ is the sum of distances between different robots and ( $x_{j}^{R}, y_{j}^{R}$ ) is the coordinate position of the j-th robot, as obtained from Eq. (22):
3. The proposed SDECOA algorithm
This section will elaborate on the mathematical definition of SDECOA in detail. In the original COA algorithm, there existed an imbalance between global search capability and local search capability. To overcome this deficiency, we propose called SDECOA. For optimization problems with fewer extreme points, a mutation operator with stronger local search capability should be used. For optimization problems with more extreme points, a mutation operator with stronger global and local search capability should be employed. It is difficult for a single mutation operator to balance global search capability and local search capability, creating obstacles to achieving this balance. In SDECOA, two variants of DE are introduced in the update strategy, and these strategies are called upon based on a roulette wheel method. Initially, all strategies have the same probability of being used. However, during the algorithm iteration process, strategies that yield better results are given higher usage probabilities through learning. Learning is determined through points, where a strategy that achieves good results earns a point. As a result, satisfactory results can be achieved for different problems, overcoming the challenge of imbalance between local and global search in COA.
3.1. Crossing
In traditional DE algorithms, the crossing probability CR is a fixed value, with literature [Reference Draa, Samira and Imene36] suggesting a range of values for CR between [0, 1]. When CR approaches zero, the changes in individuals between adjacent generations are minimal, leading to a high presence of similar individuals in the population and potentially causing the algorithm to get stuck in local optima. Conversely, when CR is close to 1, the individuals update positions fluctuate greatly, hindering efficient convergence. To address these issues, this paper proposes an adaptive dynamic adjustment method, calculated as follows:
where $CR_{\max }$ equals 0.95, $CR_{\min }$ equals 0.05, t represents the current iteration number, and T is the maximum number of iterations.
3.2. Variant update strategies of DE
The first introduced update strategy, rand < CR, was proposed in literature [Reference Das and Ponnuthurai37], while the other part was inspired by the GWO [Reference Kumar, Wu, Ali, Mallipeddi, Suganthan and Das34] and is represented by Eq. (24):
In the above equation, x best , x second , and x third represent the current population’s top three individuals, FF is the scaling factor with a value of 0.8, while R1 and R2 are randomly selected individuals that are different from the i-th individual.
The second introduced update strategy was proposed in literature [Reference Wang, Zixing and Qingfu39] and is represented by Eq. (25):
where L is a random number between [0, 1], and R1, R2, and R3 are also three distinct random individuals.
3.3. Algorithm implementation
In the algorithm proposed, during the exploration phase, Eqs. (1), (24), and (25) are randomly used, with their probabilities determined by Eq. (26). During the invocation process, a counter records which update strategy an individual has been used. If an individual achieves better results in this iteration, the score of the strategy used by that individual is incremented. The probabilities for the next roulette wheel selection are calculated based on the total score obtained by each strategy in this iteration:
where $p_{j}(t+1)$ represents the probability of strategy j being used in the next iteration, $S_{i}(t)$ represents the score of strategy i in the current iteration, and SN represents the number of strategies, which is set to 3 in this paper. The update strategy for each individual is determined by a roulette wheel selection. The pseudocode and flow chart of SDECOA are, respectively, shown in Algorithm 1 and Figure 1.
3.4. Time complexity analysis
The operation of SDECOA includes population initialization, fitness calculation, population position update, individual probability update, and strategy selection, so by analyzing the complexity of each process, the time complexity of SDECOA can be obtained. Assuming the population size is N, the number of iterations is T, and the dimension of the problem is D, the time complexity of population initialization is O(ND). In each iteration process, the time complexity of population position update is O(ND+ND), the time complexity of fitness value calculation and comparison is O(ND+ND), and the time complexity of individual probability update and strategy selection is O(ND). Therefore, the time complexity of SDECOA can be concluded as O(ND+T×(ND+ND+ND+ND+ND+ND)) ≈ O(TND).
4. Experiment results and analysis
4.1. Experimental configuration and design
The programming language used in this experiment is MATLAB, and the compilation software is MATLAB 2021a. The computer configuration consists of a 64-bit Windows 10 operating system, 8.0G RAM, and a processor with a base frequency of 2.10 GHz. To demonstrate the effectiveness of the improved algorithm, our algorithm was initially compared with several classic and advanced algorithms on the 20-dimensional CEC2022 benchmark functions. To further validate the performance of our algorithm, SDECOA was employed to solve 48 real-world constrained optimization problems from CEC2020, and comparisons were made with several algorithms that won in the CEC2020 competition. Finally, our algorithm was applied to the problem of multi-robot online path planning.
4.2. SDECOA for CEC2022 benchmark functions
Solving benchmark test functions is a common method for testing algorithm performance, and this section presents the experimental results of SDECOA solving this test set. The CEC2022 benchmark test functions are the latest test functions, and basic information is shown in Table 1. This test suite mainly includes three dimensions (dim = {2, 10, 20}), with function types primarily including single-peak functions (F1), multimodal functions (F2–F5), hybrid functions (F6-F8), and composite functions (F9–F12). To validate the algorithm’s effectiveness, we compare SDECOA with the original DE and COA. Furthermore, to evaluate its performance, we also test and analyze it against some well-known classical algorithms and recent advanced algorithms. The algorithms involved in testing the CEC2022 benchmark functions include simulated annealing (SA) [Reference Kirkpatrick, Gelatt and Vecchi40], PSO [Reference Kennedy and Russell7], GWO [Reference Mirjalili, Mirjalili, Lewis and wolf optimizer38], sine-cosine algorithm (SCA) [Reference Mirjalili41], whale optimization algorithm (WOA) [Reference Mirjalili and Andrew42], Harris hawk optimizer (HHO) [Reference Heidari, Mirjalili, Faris, Aljarah, Mafarja and Chen43], SSA [Reference Xue and Bo44], modified adaptive sparrow search algorithm (MASSA) [Reference Mirjalili41], self-adaptive differential sine-cosine algorithm (sdSCA) [Reference Akay and Mustafa35], hybrid algorithm of differential evolution and flower pollination (HADEFP) [45], etc. In this set of experiments, all algorithms were independently run 25 times. Except for SA, the iteration count for all algorithms was set to 1000 iterations, with a population size of 50. The other parameters for each algorithm are shown in Table 2.
As shown in Table 3, the means and standard deviations of different algorithms are recorded. In the experimental results across all test functions, the proposed algorithm outperforms the original COA and DE by a significant margin, particularly achieving the best results in solving F1, F6, F7, F8, and F9. Among all the compared algorithms, SDECOA proves to be the most competitive. Table 3 also presents the rankings of all algorithms based on the Friedman rank-sum test, with the data confirming that the proposed algorithm achieved the best performance in solving the CEC2022 test functions.
The convergence curves for different CEC2022 test functions are shown in Figure 2. From these curves, it is evident that SDECOA exhibits substantial improvements in both optimization capability and convergence speed compared to DE and COA. Compared with other algorithms, SDECOA shows the best convergence speed and results for most test functions. The boxplots for all test functions are shown in Figure 3, where it can be seen that SDECOA performs better in terms of median values and lower variance for most functions. Compared to other algorithms, SDECOA is more stable, thus demonstrating the superior robustness of the proposed algorithm.
4.3. SDECOA for CEC2020 problems
To further demonstrate the effectiveness of SDECOA, this section presents the experimental results and data analysis of SDECOA applied to real-world constrained optimization problems. The real-world constrained problems used in this study are based on the CEC2020 real-world constrained optimization problems. The mathematical models for all problems in this study are derived from reference [Reference Sharma and Jabeen46]. Table 4 displays the basic information of 48 problems, where D represents the dimension of the problem, g represents the number of inequality constraints, h represents the number of equality constraints, and f* represents the known optimal value. These problems are mainly designed in fields such as Industrial Chemical Processes, Process Synthesis and Design, Mechanical Engineering, Power Systems, Power Electronics, and Livestock Feed Ration Optimization, with dimensions ranging from 2 to 118 and varying numbers of constraints from 1 to 108.
In this set of experiments, SDECOA is compared with several advanced algorithms that won in the CEC2020 competition. These top-ranking algorithms, in descending order, include SASS [Reference Kumar, Das and Zelinka47], COLSHADE [Reference Gurrola-Ramos, Hernàndez-Aguirre and Dalmau-Cedeño48], sCMAgES [Reference Kumar, Das and Zelinka49], EnMODE [Reference Sallam, Elsayed, Chakrabortty and Ryan50], and BpMAgES [Reference Hellwig and Beyer51]. Each algorithm is independently run 25 times, with 1000 iterations, a population size of 60, and other control parameters consistent with the literature [Reference Xu and Lihong52]. A dynamic adaptive penalty function is used. Table 5 presents the optimal values, mean values, and standard deviations of these algorithms. It is worth noting that shaded cells in the table indicate that the proposed algorithm’s best result exceeds the f* for that problem. In this paper, the Friedman rank-sum test is also conducted on the average values of these 48 optimization problems, and the ranking of the proposed algorithm, obtained through calculation, is superior to the top five algorithms that won the CEC2020 competition.
The introduction in the literature [Reference Ezugwu, Adewumi and Frîncu53] determines the quality of the solutions by calculating the percentage deviation of the mean with respect to the known optimal solution (PDmean). As can be seen from Eq. (27), the smaller the average value obtained from 25 runs, the smaller the PDmean value, indicating higher solution quality. Table 6 presents the PDmean values when different algorithms solve different problems:
where f* represents the best know solution, mean represents the average value of the test results, and $\varepsilon$ represents a very small constant. An analysis of Table 6 shows that the proposed algorithm generally demonstrates superior solution quality compared to these advanced algorithms. Based on the experimental results, the proposed algorithm achieved ideal results in most cases, with results for 26 problems surpassing the known optimal solution. These experimental results are sufficient to demonstrate that the proposed algorithm not only performs well but also excels in addressing real-world constrained optimization problems using SDECOA. Therefore, in comparison with these state-of-the-art algorithms, our algorithm demonstrates excellent competitiveness.
4.4. SDECOA for multi-robot path planning problem
In this section, the experimental content applies SDECOA to the multi-robot path planning problem, the mathematical model code of which can be obtained from reference [Reference Akay and Mustafa35].
4.4.1. Design of the scenario for multi-robot path planning problem
This section introduces the simulation scenario layout for the robot path planning problem. In the simulation model of this problem, all robots are circular with equal radii, and collisions between them are not allowed. Each robot has its own starting point and end point. As for static obstacles, each one has a different size and shape. Similarly, each dynamic obstacle is also circular with equal radii and has its own starting point and end point. Each dynamic obstacle moves at a constant speed without deviating during the motion. In this study, three different scenarios are created, with the numbers of robots, static obstacles, and dynamic obstacles shown in Table 7. Except for Scenario 1, which references literature [Reference Akay and Mustafa35], the other scenarios are arranged independently in this study. The layouts of these three scenarios are shown in Figures 4, 5, and 6. In these scenarios, the black paths represent the planned paths of robots, the blue paths represent the paths of dynamic obstacles, and ‘×’ indicates the respective end points of objects.
In the experiments of the three scenarios mentioned above, in addition to the proposed algorithm, the comparison algorithms include DE, PSO, SCA, COA, and sdSCA, with a population size of 30, and the parameter settings are consistent with Section 4.2. It is worth noting that in this experiment, the iteration of all algorithms stops when all robots reach their end points. Each algorithm runs independently 20 times.
4.4.2. Simulation experiment in Scenario 1
In the simulation experiment in Scenario 1, Figure 7 illustrates the travel paths of each algorithm. In solving the multi-robot path planning problem, the stronger the algorithm’s performance, the smoother the robot’s path and the shorter the average travel distance. As seen in Figure 7, the path corresponding to SDECOA is notably the smoothest. Table 8 records the average travel distance and standard deviation for each robot from the starting point to the destination and also calculates the total average travel distance for the six robots in Scenario 1. For ease of comparison, a histogram of the average travel distances for different algorithms is shown in Figure 8. From this, it is evident that SDECOA demonstrates the best performance. Table 9 lists the number of steps taken by each robot, with fewer steps indicating a smoother path.
The experimental results from Scenario 1 show that, compared to DE and COA, the algorithm’s performance improved by 33.2% and 22.4%, respectively, while the average number of steps taken was reduced by 26.9% and 15.8%.
4.4.3. Simulation experiment in Scenario 2
Figure 9 shows the travel paths of each algorithm in Scenario 2. It is clearly observed in Figure 9 that, as the number of robots increases, the differences in path smoothness between the algorithms become more pronounced. Compared to the other algorithms, the path corresponding to SDECOA is the smoothest. Table 10 records the average travel distance and corresponding standard deviation for the 12 robots from the starting point to the destination. Figure 10 presents a histogram of the average travel distances for different algorithms. From the analysis of Figure 10, it can be seen that, despite the increased complexity of the problem, SDECOA still demonstrates strong performance. Table 11 records the number of steps taken by each robot.
The experimental results from Scenario 2 show that, compared to DE and COA, the algorithm’s performance improved by 46.1% and 50.2%, respectively, while the average total number of steps taken was reduced by 40.0% and 48.3%.
4.4.4. Simulation experiment in Scenario 3
In Scenario 3, the number of robots was increased to 20, and the quantities of both dynamic and static obstacles were significantly raised, making the path planning problem more complex. It should be noted that in this experiment, COA led to the phenomenon of robots getting lost, demonstrating subpar performance. Therefore, the experimental results involving COA are excluded from consideration in this section. In this experiment, Figure 11 illustrates the trajectories of each algorithm in Scenario 3. From Figure 11, it is clearly observable that, even as the problem scale increased, the proposed algorithm still managed to find effective paths compared to the other algorithms. Table 12 records the average travel distance and corresponding standard deviation for 20 robots from the starting point to the destination, while Figure 12 shows the histogram of the average travel distances for different algorithms. The histogram clearly indicates that the average travel distance of SDECOA is significantly lower than that of the other algorithms. Table 13 documents the number of steps taken by each robot.
From the experimental results in Scenario 3, it can be concluded that, compared to DE, the proposed algorithm improved performance by 45.3%, with the total average movement steps reduced by 37.0%.
Through these experimental data, it can be seen that regardless of the number of robots, the proposed algorithm always achieves the best performance. Moreover, as the number of robots increases, the values of other algorithms become very large, while SDECOA can still manage reasonably. The path diagrams in the three scenarios clearly show that the proposed algorithm’s results in a shorter travel distance and smoother paths for the robot, particularly demonstrating significant advantages when the robot operates in more complex environments. Therefore, it can be proven that our algorithm outperforms these algorithms in terms of performance and competitiveness.
5. Conclusion and future work
Current research on robot path planning mainly considers environments with static obstacles or involves a very small number of robots. However, in this study, we consider a scenario with 20 robots and 7 dynamic obstacles, significantly increasing the complexity of the problem. To address the issue of multi-robot online path planning in environments with both static and dynamic obstacles, this paper proposed a SDECOA. The proposed algorithm extends the original COA by incorporating two DE update strategies. These strategies are adaptively selected based on a learning mechanism, which eliminates the dependency on a single update strategy in COA. As a result, the algorithm is capable of achieving excellent performance across a variety of problem domains. Furthermore, the crossover probability, which was originally a constant parameter, is transformed into a dynamic adaptive variable to further improve the accuracy of the solutions. This approach effectively mitigates the premature convergence problem caused by the imbalance between local and global search capabilities in COA, thereby improving the algorithm’s optimization capability.
To validate the performance of the proposed algorithm, it was applied to the CEC2022 benchmark test suite and the CEC2020 real-world constrained optimization problems. Experimental results show that, compared to DE and COA, the proposed algorithm significantly improves performance, solving optimization problems even with over 100 constraints. When compared with the winning algorithm in the CEC2020 competition, SDECOA achieved the best results, as confirmed by the Friedman test. Finally, the algorithm was applied to multi-robot path planning problems in three complex scenarios. In Scenario 1, compared to the original DE and COA, the proposed algorithm reduced the average total travel distance of all robots by 33.2% and 22.4%, respectively, and reduced the average total movement steps by 31.1% and 21.6%. In Scenario 2, the average total travel distance was reduced by 46.1% and 50.2%, respectively, while the average total movement steps were reduced by 40.0% and 48.3%. In Scenario 3, compared to the original DE, the average total travel distance was reduced by 45.3%, and the average total movement steps were reduced by 32.7%. These experimental results demonstrate the strong competitiveness of the proposed algorithm.
This paper presents a novel solution approach for the multi-robot path planning problem, holding promise for addressing more complex path planning challenges in future environments, such as multi-robot path planning in three-dimensional spaces with multiple dynamic obstacles. According to the No Free Lunch theorem, no single algorithm can solve all problems, but continuous improvement can enable an algorithm to solve a broader range of problems. Of course, the number of position update strategies is not always a determining factor for success. In future research on intelligent algorithms, better position update strategies that align with the COA are expected to emerge, along with more challenging real-world optimization problems for SDECOA to solve. One limitation of the proposed algorithm is that it has a relatively large number of control parameters, and the effective update strategy at the initial stage may not necessarily be suitable for the later iterations. Future research could focus on applying the algorithm to more realistic path planning problems. Moreover, this paper only applies SDECOA to single-objective constrained optimization problems; future work could extend its application to high-dimensional multi-objective optimization problems.
Author contributions
Lun Zhu: Methodology and Writing – original draft. Guo Zhou: Writing – original draft. Yongquan Zhou: Supervisor and Writing – review and editing. Qifang Luo: Supervisor and Experimental results analysis. Huajuan Huang: Algorithm. Xiuxi Wei: Software and Results analysis.
Data availability
No data were used for the research described in the article.
Financial support
This work was supported by National Natural Science Foundation of China under Grant U21A20464, 62066005.
Competing interests
All the work outlined in this paper is our own except where otherwise acknowledged and referenced. The work contained in the manuscript has not been previously published, in whole or part, and is not under consideration by any other journal. All authors are aware of and accept responsibility for the Manuscript. The authors declare that they have no conflicts of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.