1 Introduction
There are several ancient old records that reference to artificial beings in the literature. For example, in Greek mythology, there are records related to the Greek god Hephaestus, who created mechanical creatures such as the mythical bronze giant called Talos. It was a giant who protects Crete from any invaders (Gera, Reference Gera2003). Talos myth was the inspiration for the prototype Tactical Assault Light Operator Suit (TALOS), developed by the Massachusetts Institute of Technology (Oldham, Reference Oldham2015).
Before the 20th century, intelligent machines had been built. Currently, these machines could be considered prototypes of the current robots. However, the main concepts and methods developed in modern Robotics emerged in the early decades of the 20th century in the context of technological and science transformation. During the 50s, 60s, and 70s, researchers were interested in assigning greater autonomy to the robots to perform actions like humans. In the 21st century, it is possible to include Robotics as an essential element in the economy and development of society through its efficient and autonomous systems.
Robotics has changed over the decades, and it was focused on the industrial sector since the introduction of robot manipulators to production methods. However, today this area is more diversified. Currently, there are many applications in the Robotics field, which range from the use in the industry to entertainment.
Robotics is a highly interdisciplinary field and involves studies in Mechanical Engineering, Electrical Engineering, Computer Science, Artificial Intelligence (AI), among others. The integration of them allows Robotics to stimulate the development of new approaches to problem-solving. These approaches are based on the integration in many concepts, terminologies, methodologies, and procedures. In this context of interdisciplinary, the Distributed Artificial Intelligence (DAI) area, specifically its subarea called Multi-Agent System (MAS), has been an auxiliary discipline, especially for the Multi-Robot System (MRS) area.
DAI has been established as a research and development field since the 80s, analyzing social and emerging issues of computer systems, allowing them to solve problems using computational processes called agents. In a MAS, the main target is to develop an agent architecture to work autonomously and socially, as well as in the development of the communication and coordination systems. In this case, the solution to the problem arises through local interactions between agents.
A MRS is composed of multiple robots that act and interact with each other so that they carry out tasks under the metaphor of social intelligence. The robots in a MRS are considered robotic agents with the ability to perceive the environment through sensors and act on this environment using motors/actuators. Usually, the robots have some degrees of autonomy and therefore they do not require external commands to perform a specific action (Rockel, Reference Rockel2011). IEEE-RAS Working Group, named Ontologies for Robotics and Automation (Prestes et al., Reference Prestes, Carbonera, Fiorini, Jorge, Abel, Madhavan, Locoro, Goncalves, Barreto, Habib, Chibani, Gérard, Amirat and Schlenoff2013), defined that robots are agentive devices in a broad sense, purposed to act in the physical world to accomplish one or more tasks by working on the environment or themselves. Also, the actions of a robot might be subordinated to the effects of other agents, such as software agents (bots) or humans. Thus, from an ontological perspective, it seems natural to do consider robots as agents. For example, Schmuck and Chli (Reference Schmuck and Chli2017) proposed a centralized architecture for collaborative monocular simultaneous localization and mapping (SLAM) for multiple small unmanned aerial vehicles (UAVs) to act as agents.
In recent decades, the number of works using robots has been expanded, and thereby new demands related to systems modeled and implemented with more than one robot were emerging. At the end of the 80s, the use of MRS has occurred on the assembly line of some factories, although there was little or no autonomy in the manufacturing process. At this time, making use of MAS concepts, the scientific community tried to develop and improve the MRS area (Botelho & Alami, Reference Botelho and Alami2000; Murphy, Reference Murphy2000a). With the full range of acquired knowledge, the MRS field has gained ground, allowing complex activities are carried out in a distributed manner and with more intelligence in their processes. However, according to Arai et al. (Reference Arai, Pagello and Parker2002), this field is still young enough that no topic in this area can be considered mature.
MAS’s frame of reference points out pathways for a methodological integration with MRS area. However, analyzing the scientific literature, it is possible to notice that there are few studies address the integration between MAS and MRS (see Arai et al., Reference Arai, Pagello and Parker2002; Ota Reference Ota2006; Rockel et al. Reference Rockel, Klimentjew and Zhang2012). Therefore, to help to fill this gap, this paper presents a proposal for this interdisciplinary integration. At first, the main theoretical points in both areas are presented and discussed. Based on this theoretical framework, common issues between MAS and MRS have been identified. Among these issues, it is possible to mention that both systems are complex, self-organized, and open. They present emergent behaviors, the agents are autonomous, and the coordination and communication mechanisms are used for interaction between agents, among others. According to Steele and Thomas (Reference Steele and Thomas2007), emergent behavior in MRS occurs when the individual actions combine to form a larger, emergent pattern. Also, it automatically adapts to changes in the environment. The hypothesis of this paper is that such common issues can build up a frame of reference for the interdisciplinary integration between the MAS and MRS areas. After the theoretical studies and aiming to demonstrate the feasibility of the proposed integration, a case study of a decentralized MRS for tracking and surrounding a stationary target is presented (Botelho et al., Reference Botelho, Marietto, Mendes, Ferreira and Silva2017). Also, it was implemented in the robotic simulation called Virtual Robot Experimentation Platform (V-REP 2018), and the results are shown in this paper.
This paper is conducted on two fronts, with respect to related work. The first is investigated in the integration between MAS and MRS. In such integration, in order to develop and control a MRS, Arai et al. (Reference Arai, Pagello and Parker2002) investigated a reactivity and social deliberation for both simulated and real MAS. Ota (Reference Ota2006) described that the MRSs were considered a MAS with some special characteristics. For example, the limitations of the robots should be examined because they have sensors, processors, actuators, and communication devices. Rockel et al. (Reference Rockel, Klimentjew and Zhang2012) created a three-layer architecture connecting MRS and MAS using a new middle layer called Robot System Abstraction Layer. The main idea considered in this paper is to combine a high-level MAS with a MRS. The development of decentralized MRS is explained in the second front of this paper. Currently, there are few studies related to such MRS since most works are centralized. Pereira et al. (Reference Pereira, Campos and Kumar2004) proposed decentralized control policies for a MRS to make the robots move toward a goal position while maintaining the object closure condition. Lawton et al. (Reference Lawton, Beard and Young2003) presented a decentralized behavior-based approach for formation maneuvers for MRS. Moreover, a framework that permits quick decentralized and parallel decision-making was developed in Azarm and Schmidt (Reference Azarm and Schmidt1997). Its main idea was to introduce a decentralized approach for the conflict-free motion of a MRS. Another decentralized MRS, self-organized solution for dense UAV traffic simulation in open 2D and 3D space, under realistic conditions was proposed in Virágh et al. (Reference Virágh, Nagy, Gershenson and Vásárhelyi2016). In the proposed MRS, every UAV calculates its own desired outputs locally, based on the local information, without central processing of any dynamic global information.
This paper is organized as follows. In Section 2, the main theoretical points of MAS and MRS are presented. From this theoretical framework, a proposal for interdisciplinary integration between MAS and MRS is explained in Section 3. Also, the case study to model and implement a decentralized MRS using the interdisciplinary approach is proposed. In this study, the robots in the MRS are programmed to track and surround a stationary target. The robot characteristics, the scenario, the tracking, surrounding, obstacle avoidance, and the simulation results are also highlighted in this section. Finally, the conclusions and future work are considered in Section 4.
2 Integration between MASs and MRSs: theoretical background
The evolution in the Robotics field allowed the emergence of new demands that require the integration of several robots. The development of a system with more than one robot enables the execution of tasks faster and more efficient, as compared to a system in which only one robot is used. By way of illustration, in Feddema et al. (Reference Feddema, Lewis and Schoenwald2002), a cooperative team of Roving All-Terrain Lunar Explorer Rover (RATLER) vehicle is used to surround and monitor an enemy facility. In this coordinated exploration, a team of approximately 18 robots was designed for exploring a building that has not been explored. SLAM algorithm is implemented, and multi-robot mutual observations are used to solve difficult correspondence problems. SLAM is concerned with the problem of building a map of an unknown environment by a mobile robot while at the same time navigating the environment using the map (Riisgaard & Blas, Reference Riisgaard and Blas2003).
In the light of the above, the following hypothesis can be set up: the DAI area, more specifically the MAS, can be used as the theoretical and technical basis for structuring the MRS area. In order to support this hypothesis, the DAI and MRS areas are presented in the following subsections.
2.1 Distributed artificial intelligence and multi-agent systems
Since its beginning in 1955 (McCarthy et al., Reference McCarthy, Minsky, Rochester and Shannon1955), the AI area has defined research lines aiming to find computational solutions to some of the human intelligence features (McCarthy et al., Reference McCarthy, Minsky, Rochester and Shannon1955). In the early seventies, AI already offered robust techniques for the development of intelligent systems. These techniques were based on a system composed of only one control center and a knowledge base (KB). This approach, inherited from the von Neumann model and the conceptions of traditional Psychology, was characterized by centralized modeling (de M. Batista et al., Reference de, Batista, das, Marietto, Botelho, Kobayashi, dos, Alves, de Castro, Ruas, Alkhateeb, Maghayreh and Doush2011).
The centralized paradigm of traditional AI makes some applications inefficient or too slow for problem-solving. For example, performing a full medical diagnosis requires a team of several specialist doctors. In automating this process, it is not indicated the use of a single computational program acting as a centralizer of decisions and actions. But it is stated the use of a system where specialized programs in different medical areas perform specific tasks, defining the patient’s diagnosis in a collectively way (Rezende, Reference Rezende2005).
At the end of the 70s, the AI metaphor was questioned. At this time, the modeling and implementation of computational systems, under the metaphor of social intelligence, started to be investigated. This study was based on the local interaction between agents and the environment. In DAI, an agent is a computer program located in an environment with its internal states, goals, and behaviors. Such a structure allows an agent to interact autonomously with other agents and the environment.
In this new social metaphor, the focus for solving problems is the implementation of autonomous agents with specific abilities. Also, they can solve problems through coordination, communication, and information distribution between them. In this context, the DAI creates and presents an adequate theoretical-technical grounding for the modeling and implementation of social aspects of computational systems. It also offers new and more comprehensive forms for problem-solving, knowledge representation, inference mechanisms, among others (de M. Batista et al., Reference de, Batista, das, Marietto, Botelho, Kobayashi, dos, Alves, de Castro, Ruas, Alkhateeb, Maghayreh and Doush2011).
DAI is based on social intelligence and in phenomena of a sociological and/or ethological nature. The life of an ant colony is an example of an ethological organization based on social intelligence. Although an ant alone cannot be considered an intelligent entity, the colony as a whole presents an intelligent behavior. In a colony, there is a task distribution among the soldier, worker, and queen ants, besides searching for food and subsequent storage.
According to Bond and Gasser (Reference Bond and Gasser1988) and Ruas et al. (Reference Ruas, das, Marietto, de Moraes Batista, dos, França, Heideker, Noronha, da Silva, Alkhateeb, Maghayreh and Doush2011), the main characteristics of the DAI area can be described in the following points:
In the modeling of self-organized systems, the operation of a system is not guided or managed by an external entity. Conversely, the organization is achieved by the interaction and communication between its agents;
As a result of generating open systems and allowing the input and output of agents in the MAS, the DAI systems have great adaptability to the environment by incorporating new elements, information, and configurations;
The development of a distributed system can be done in parts so that specialists in such a domain develop each one;
To improve the system efficiency, parallel activities can help to reduce the processing and reasoning time;
There are application domains that are inherently distributed in space, and therefore it is well modeled and also resolved in a distributed way, for example, air traffic control systems and electric power distribution.
MAS is a research area in DAI, in which the metaphor of social intelligence guides the modeling and implementation of computational systems. A detailed description of the MAS area is considered in the next subsection.
MASs are computational systems composed of a group of agents that interact with each other to solve a problem. The intelligent community is the metaphor used in which social behavior is the basis for the intelligence of the system. This approach starts from a collective building of the solution, where behaviors emerge as a result of the interactions of its elements, following local rules (de M. Batista et al., Reference de, Batista, das, Marietto, Botelho, Kobayashi, dos, Alves, de Castro, Ruas, Alkhateeb, Maghayreh and Doush2011).
The theoretical bases for modeling MAS are presented in this section.
2.1.1 Complex and unpredictable systems
MASs are complex systems and also can be unpredictable systems. Complex systems are composed of two or more interconnected components that create an object network that interacts with each other and exhibits dynamic and aggregate behavior. Also, the action of an element can affect the activities of different elements in the network. It is essential to emphasize that in this paradigm ‘the whole is greater than the sum of its parts’.
Unpredictable systems are complex systems with a high degree of instability and unpredictability in the decision-making process and the results of the agents’ actions. It is important to point out that these aspects need to be addressed dynamically. According to Lempert (Reference Lempert2002), agent-based models are often useful under conditions of considerable uncertainty, where a reliable prediction of the future is not possible.
2.1.2 Self-organizing and open systems
MAS is self-organizing and open systems. Self-organization is a process where the organization of a system is not guided or managed by an external entity but by its entities that make up the system. In self-organized MAS, the agents can structure themselves spatially and temporally in an environment, allowing agents to adapt to their environment through an autonomous reorganization dynamically. With the capacity for self-organization, it is possible to develop MASs with complex dynamics and adaptable to environmental disturbances, without complete knowledge of future conditions.
MASs are considered open systems because the agents are able to enter and leave society, and even with the insertion/withdrawal of some of its members, the system maintains its regular operation. The migration of agents to other MAS is intended to help in problem-solving. Because they are open systems, MASs have high adaptability to the environment, so they get to establish the relationships and communication between agents dynamically.
2.1.3 Emergent behaviors
MASs present emergent behaviors. According to Axelrod (Reference Axelrod1998), these behaviors can be described as large-scale effects caused by local agent interactions. Such results are noted as stable and non-obvious macroscopic patterns emerging from local interaction rules between individual agents.
It is worth pointing out that the emergent behaviors are not explicitly programmed in the agents because they are socially constructed behaviors. In some models, the emergent properties can be formally deduced, but they can also be unpredictable and unexpected since even the consequences of simple local interactions can present considerable difficulties in being anticipated (Ruas et al., Reference Ruas, das, Marietto, de Moraes Batista, dos, França, Heideker, Noronha, da Silva, Alkhateeb, Maghayreh and Doush2011).
2.1.4 Reactive, cognitive, and hybrid agents
In MASs, the agents are usually classified in the following groups: reactive, cognitive, and hybrid. A reactive agent acts reactively to its environment changes. Therefore, its functional model is based on stimulus response. In general, these agents do not have any memory, their future actions are not planned, and direct communication with other agents does not happen. The indirect communication happens by observing the changes occurred in the environment. The reactive agents are implemented with low computational complexity, based on biological or ethological organization models, such as ant and termite societies (Dorigo et al., Reference Dorigo, Birattari and Stützle2006).
Cognitive agents have a specified computational complexity and an explicit representation of the environment and community members. They have memory and can think about their actions in the past and plan their future. These agents communicate with each other directly. In this case, their perception systems (that allows to examine the environment) and the communication (that allows the message exchanges between agents) are different. The human groups are the metaphor used for structuring the agents’ community. In these groups, teams of specialists solve problems in a coordinated way, and the complexity exceeds the individual capacities of each of its members.
Hybrid agents combine the functionalities of reactive and cognitive agents. The objective of this combination is to overcome the limitations presented in the reactive and cognitive architectures. The limitation of cognitive architecture refers to its difficulty in performing fast and efficient reactions to an unforeseen situation. On the other hand, reactive architectures have difficulties in defining alternatives for the agent behaviors and change their actions plans when the condition of the environment diverges from their pre-established objectives.
2.1.5 Autonomous agents
Agents are considered autonomous when they operate without the direct intervention of other external systems (humans or computers), presenting some control over their actions and internal states.
In Rezende (Reference Rezende2005), an agent is considered autonomous when it can analyze a specific situation to be solved, to create alternatives, and to make choices that best suit the specific goals. There are cases in which the agent is not able to recognize the environment. However, to find a solution to the problem, the agent chooses a previous experience.
2.1.6 Coordination: cooperation, collaboration, and competition
In order to make the community act coherently, Nick Jennings (Reference Jennings1995) defines coordination as a process by which an agent rationalizes its local actions and anticipates other agents’ actions. MASs assume coordination among their agents because: (i) no individual has sufficient competence, resources, or information to solve the problem in an isolated manner; (ii) there are interdependencies between agents’ actions; (iii) the system presents global constrains, which must be respected; and (iv) information, resources, and expertise must be distributed.
The coordination process must be able to deal with systems formed by agents with antagonistic or common objectives. In the case of antagonistic objectives, coordination involves issues related to competition. The collaboration is established when the agent has the autonomy to perform a task and can perform it alone. However, for example, if this agent wants to optimize the execution time of this task, it can accept or request collaboration with other agents. Cooperation occurs when an agent is not able to perform a particular task. Therefore, it is needed to ask for the help of other agents.
In this search for a solution in a distributed intelligent system, the coordination process must manage the agents’ behavior since they have local and incomplete information on the problem. This management aims to avoid conflicts, redundant efforts, deadlocks, among others. Therefore, the more the incidence of such situations, the less coordinated the system is.
2.1.7 Communication
In a MAS, the agents often need to exchange information in a coordinated way, such as knowledge and action plans. The communication between agents can occur in the following ways (Ruas et al., Reference Ruas, das, Marietto, de Moraes Batista, dos, França, Heideker, Noronha, da Silva, Alkhateeb, Maghayreh and Doush2011; Yan et al., Reference Yan, Jouandeau and Cherif2013):
Direct (Explicit): in this type of communication, the agents exchange information directly between them. Therefore, the MAS must provide mechanisms for the agents to identify themselves and know the ‘addresses’ of the other agents in the system. The message exchange mechanism has been adopted in direct communication, applying specific languages to the agents’ communication;
Indirect (Implicit): this type of communication occurs in the environment where the agents are inserted. For example, using signals flags or shared structures, such as blackboards;
Without communication: this interaction occurs when an Agent A can directly change the KB of an Agent B by inserting in Agent B’s KB beliefs, goals, among others.
Blackboards. A blackboard is a type of shared memory where agents can exchange information. MAS that uses this strategy as a communication method consists of the following components:
An independent set of agents;
A shared data structure called blackboard, through which agents communicate with one another;
A control system is responsible for defining the order in which agents operate on the blackboard.
Agents write their information items on the blackboard. In this case, there is no direct communication between agents, and the communication and coordination processes are mediated by the state of the solution stored in the blackboard.
Message exchange. In the message exchange, instead of the blackboard systems, the messages are exchanged directly between agents. These messages can be used to establish communication and coordination mechanisms through predefined protocols and to acquire new facts and beliefs. With this type of explicit communication, each agent must know more about the other agents involved in the system (Fallah-Seghrouchini et al., Reference Fallah-Seghrouchini, Haddad and Mazouzi2001).
Speech Acts is one of the linguistic approaches that have been presented adequately to model the syntactic and semantics of a natural language. This theory was first developed by John Austin and later improved by John Searle (Reference Searle1969). Austin noted that certain expressions were like physical actions that changed the state of the world. For example, when a jury declares a defendant guilty, an action is taken. In this case, the situation and social status of the defendant are changed (Huhns & Stephens, Reference Huhns and Stephens1999).
Speech Acts Theory has served as inspiration for the creation of Agent Communication Languages (ACLs). An ACL is a language used specifically for the communication between agents. This language offers a theoretical-technical structure where an agent can identify the responses that another agent emits about its message. In some cases, it can predict the possible receiver’s reactions. Thus, to achieve their goals, agents can plan their speech acts to affect the recipients’ beliefs, goals, and other mental states. Knowledge query and manipulation language (KQML) (Chalupsky et al., Reference Chalupsky, Finin, Fritzson, McKay, Shapiro and Wiederhold1992) and foundation for intelligent physical agents (FIPA) ACL (ACL, 2002) languages are examples of ACLs.
2.2 Multi-robot systems
In the late 1980s, a group of scientists started to investigate the MRS (Yan et al., Reference Yan, Jouandeau and Cherif2013). The main objective in this area is to use multiple robots to solve a problem under the metaphor of social intelligence. In order to accomplish a given task, the robots move and interact with other robots. In general, a MRS maintains synchronous or asynchronous connections between robots that may or may not physically present in the same environment.
MRS area has broadened the interest of scientists and engineers due to the wide variety of applications and tasks that a robot society can perform (Rooker & Birk, Reference Rooker and Birk2007). Also, the development of a MRS allows the tasks to be performed faster and more efficiently than using only one robot (Romero et al., Reference Romero, Preste, Osório and Wolf2014). For example, more robots cover more areas in planetary exploration.
The robotic agents may have distinct perceptual and decision-making abilities, in particular for teams of heterogeneous robots. Therefore, each agent has a different level of influence on the environment and other agents (Pessin et al., Reference Pessin, Osório, Musse, Nonnenmacher and Ferreira2007; Lemvigh & Moller, Reference Lemvigh and Moller2008). A MRS can be developed for the performance of robots in dynamic environments, where unforeseen changes may occur due to the presence of other robots, or even changes in the environment (Farinelli et al., Reference Farinelli, Iocchi and Nardi2004). As an example, Mas et al. (Reference Mas, Li, Acain and Kitts2009) developed a MRS to escort a particular moving target. It is important to point out that the movements of the goal are measured in real time using sensors. As the target moves, the uniform formation of the robots is maintained so that the target is located in the center.
The use of MRS allows, if necessary, a more flexible reconfiguration since each robot is responsible for a task. For example, if there is a problem, only one robot must be reconfigured or changed, not the entire system (Cao et al., Reference Cao, Fukunaga, Kahng and Meng1995). Using a team of robots can be easy to program, cheaper to build than using only a single robot (Yan et al., Reference Yan, Jouandeau and Cherif2013). These features provide higher resource profitability, more fault-tolerance, faster problem resolution due to parallel processing, and increased flexibility through the interconnection of multiple systems.
However, as robots have a strong interaction with the physical world, the following characteristics are considered in the modeling and implementation of MRS (Ota, Reference Ota2006): (i) some hardware limitations related to calculation power, sensing errors, errors in actuating, communications devices, among others; (ii) the robots move in a real environment. Consequently, physical interaction should be considered, such as obstacle avoidance and the manipulation of physical objects.
In order to develop the MRS, the types of robots described in the following section should be considered. It aims to differentiate the composition of a MRS in homogeneous, heterogeneous, and hybrids.
2.2.1 Homogeneous, heterogeneous, and hybrids
Homogeneity refers to the degree of similarity that exists between the robots of the same MRS. The homogeneous robots are built based on the same architecture, which is reproduced in each robot. For this reason, in this type of robot, the robotic agents have high similarity in the design of their internal modules, software, and hardware components and, consequently, in their operation.
In homogeneous systems, each robot performs the same actions with similar results. Thus, if any of these robots fail, the others can compensate for it. The idea is to reduce the susceptibility of fault system and increase the reliability and robustness of the solution. However, the homogeneity of the robots reduces the possibility of change in the functioning of the MRS. Consequently, these systems become less adaptable.
Murphy (Reference Murphy2000a) describes an example of a homogeneous MRS presented in a competition called ‘Pick Up the Trash’, held in 1994 and organized by the Association for the Advancement of Artificial Intelligence (AAAI). In this competition, the robots Io, Ganymede, and Callisto, developed by Georgia Tech, obtained the first place. They quickly and efficiently recognize trash, such as aluminum cans, and throw them in a specific deposit. For this, reactive behaviors were used to collect trash, and cooperative behaviors provide for cooperation between robots. The robots were also equipped with an infrared sensor in the gripper hands used to alert the object in range for grasping. The gripper captures and holds the soda cans, and a color camera was responsible for identifying the cans, wastebaskets, and other robots (Balch et al., Reference Balch, Boone, Collins, Forbes, Mackenzie and Santamaria1995).
On the other hand, heterogeneous robots are constructed from different architectures, differing by their internal modules, control mechanisms, and methods of interpreting information (Murphy, Reference Murphy2000a; Iocchi et al., Reference Iocchi, Nardi and Salerno2001). The heterogeneous MRSs allow a greater adaptation to a new situation and, in some cases, in dynamic environments. If there are robots in the environment with different capabilities interacting for a common purpose, then they cover a greater number of problems. However, there is a greater challenge and difficulty in implementing such a system since it is necessary to integrate each robot’s functions so that a coordinated interaction is possible (Iocchi et al., Reference Iocchi, Nardi and Salerno2001). The motion planning and stabilization approach are proposed in Spurny et al. (Reference Spurny, Baca and Saska2016). It provides an effective technique that solves heterogeneous unmanned ground vehicles (UGV) and micro aerial vehicles (MAVs) formation to target region problem. This technique uses the model predictive control method based on the virtual leader-follower approach as the formation control strategy.
The marsupial robots are an example of a heterogeneous MRS. The main objective of these robots is to develop micro-rovers to be used in areas of difficult access, such as in rescue missions. A marsupial team consists of a large robot that carries one or smaller robots to the task site, much like a kangaroo mother carries a joey in her pouch (Murphy, Reference Murphy2000a). A marsupial and shape-shifting robots for urban search and rescue (USAR) are proposed in Murphy (Reference Murphy2000b) and called Silver Bullet and Bujold robots, respectively. The USAR is focused on locating and extracting people trapped in collapsed or damaged structures. The Bujold robots navigate through voids. The robots are asked to change their configuration to be able to explore different types of environments. Therefore, the authors focused on solving the USAR by considering Bujold robots and a Silver Bullet’s Power Wheels chassis as members of the cooperative and heterogeneous team of robots. The heterogeneous MRS described in Roldán et al. (Reference Roldán, Garcia-Aunon, Garzón, de León, del Cerro and Barrientos2016) includes both ground and aerial vehicles were used to monitor the environmental variable of greenhouses, such as temperature, humidity, luminosity, among others.
Silver Bullet robot is responsible for carrying the Bujold robot, much like a kangaroo carrying its young. In order to accomplish this task, the Silver Bullet has wheels, an onboard computer, radio Ethernet link, GPS, six sonar sensors, inclinometers, two cameras, and a thermal sensor for detecting survivors. On the other hand, the Bujold is a micro-rover physically connected to Silver Bullet. It has a camera, a microphone for hearing survivors, and two headlights. This robot also changes three configurations: sitting up and facing forward, sitting up and facing backward, and laying flat (Murphy, Reference Murphy2000b). Silver Bullet and Bujold robots are developed to work together. For example, the Silver Bullet controls the Bujold, the Bujold uses the Silver Bullet’s computer to run its dedicated software, or the Silver Bullet’s radio Ethernet link is responsible for controlling the Bujold through teleoperation.
The hybrids MRSs evolve groups of homogeneous robots, and at least one heterogeneous robot. In Simmons et al. (Reference Simmons, Apfelbaum, Burgard, Fox, Moors, Thrun and Younes2000), a team of two Pioneer AT robots from Real World Interface Inc. (RWI) and an Urbie robot from IS Robotics is used for exploration and mapping of an unknown indoor environment. These robots are equipped with SICK laser scanners that have a 180° field of view, and the communication is realized via Breezecom radio links. The proposed MRS has two homogeneous robots and one heterogeneous robot, and therefore it is considered as a hybrid MRS.
2.2.2 Coordination
MAS area uses several mechanisms for the management of coordination processes. In this section, some of them are presented, with examples of MRSs that also use the same mechanisms, such as synchronization, organizational structures, and planning.
Synchronization The synchronization of actions in a MAS aims to coordinate the parallel tasks in time, chaining the actions carried out by its agents. Synchronization processes typically use distributed systems techniques.
In non-distributed systems, running on a single central processing unit, semaphores and monitors are generally used to solve synchronization issues. However, synchronization algorithms in distributed systems consider that each machine has its time mechanisms, which moves the software clock forward. The great challenge of these algorithms is to synchronize the clocks (software) of all machines for a similar time pattern. In Lamport (Reference Lamport1978), Lamport showed that the synchronization between different clocks is not needed to be absolute, and an algorithm to synchronize actions in a distributed system is presented.
As an example of MRS that uses synchronization techniques to coordinate the actions of its robots, in Bouteraa et al. (Reference Bouteraa, Mansour, Ghommam and Poisson2011), a synchronized control of manipulator robots is proposed. The strategy is to synchronize each robot in position and velocity concerning the commonly desired trajectory. The control combines trajectory tracking and coordination algorithms. In order to follow a common trajectory, the authors applied the control to synchronize three identical manipulator robots interconnected under a cooperative scheme.
Organizational structure An organizational structure is the formalization of the interaction types between human beings, offering a framework of constraints and expectations on the agent behaviors. This framework guides the interactions between agents through expected behaviors, roles to be assumed by agents, authority relations, among others. In the literature, it is possible to find some types of organizational structures, for example, hierarchical organization, centralized organization, and community of specialists.
In a hierarchical organization, the agents are separated into groups, and a supervisor coordinates each group. The organization is composed of interrelated subsystems, where the power of decision and control is assigned to the supervisor of each group, and a supervised agent belonging to a group can be a supervisor in another group. This type of organization structure allows the distribution of coordination flows between different levels of supervision.
The mapping, navigation, and planning software is organized hierarchically in the MRS described in Butzke et al. (Reference Butzke, Daniilidis and Kushleyev2012). In this work, the main objective was to attend the Multi Autonomous Ground-robotic International Challenge (MAGIC). In the MAGIC 2010 competition, the challenge was to develop a MRS that could explore and map a large dynamic urban environment. Robots should locate, classify, and respond to threats with minimal human intervention. Also, the solution was to use a hierarchical architecture decomposition of perceptual, planning, and control algorithms.
The hierarchical software architecture integrates high-level human operator commands with autonomous and semiautonomous operations. The main objective is to have an efficient high-level interaction from the human operators while simultaneously, the robots operate autonomously. In the sensing and mapping tasks, a bottom-up hierarchical architecture is proposed. However, in the planning and strategic control, a top-down hierarchical architecture is introduced. It allows human operators to efficiently issue task commands to get propagated down a chain of planners and controllers. For example, human operators can order the robots to move to some locations and execute clear-cut actions (Butzke et al., Reference Butzke, Daniilidis and Kushleyev2012).
In a centralized organization, an agent has authority over the other agents in the MAS. Therefore, a subordinate agent should accept the commands from the centralized agent. For instance, in Barrett et al. (Reference Barrett, Rabideau, Estlin and Chien2001), a population of three rovers is remotely commanded by central planning in a leader/follower approach. A lander generates plans that are executed by the rovers. In order to facilitate projection revision, the rovers use sensors to observe local conditions and send updates back to the lander.
A community of specialists is organized based on the following items: (i) it is flat because all agents are on the same level, that is, there are no hierarchical differences between them; (ii) each agent is specialist in a particular domain; (iii) the agents interact with each other, aiming a mutual adjustment of their actions so that they can collectively reach the solution of the problem. Swarmanoid (Dorigo & et al., Reference Dorigo2013) is an example of a specialized agent community. In this work, three different robot types have been used, and each robot has a specialty to solve a particular task. The small foot-bots are specialized in transporting objects or other robots. Also, they move on even and uneven terrains. The hand-bots are capable of climbing some vertical surfaces and manipulating small objects, for example, a book. Finally, the eye-bots are flying robots specialized in exploring the environment and searching for the target. In this work, the objective was to find the shelf and retrieve a book.
Swarmanoid robots are assumed to start from a single deployment area. During the exploration, until finding the target location, the eye-bots build a wireless network that spans the environment by sequentially connecting to the ceiling. This network is used to guide the new eye-bot that joins in the exploration task. The foot-bots form another wireless network on the ground that follows the eye-bot network topology. These networks can pass range and bearing messages between each other. When the eye-bot finds a shelf containing objects, the hand-bot is asked to move at the shelf location to retrieve a book. This request is sent to the deployment area through the network of eye-bots and foot-bots. In this area, the foot-bots assemble to hand-bots and start collectively transporting them to the shelf. Now, these robots are called foot-hand-bot. When they arrive at a shelf location, the eye-bot tells them to climb and to what height. After that, the foot-bots disassemble from the hand-bot. Latter starts to climb the structure, grasps the book, and descends from the shelf. Then, the foot-hand-bot reassembles and moves back to the deployment. Finally, the foot-hand-bot disassembles and waits for further swarmanoid tasks (Dorigo & et al., Reference Dorigo2013).
Planning The coordination work can be aided by an explicit division of tasks, making the agents’ behavior to be directed toward the overall objectives of the system. It can be done through planning theory. In general, multi-agent planning is either centralized or decentralized. In centralized planning, an agent generates a plan for the whole society. This plan is split into subplans that are distributed among the agents in society. In the decentralized planning, each agent generates its partial plane, and during the operation of the MAS, the agents exchange information among themselves to identify and resolve conflicts that arise.
Hierarchical task network (HTN) is proposed to coordinate a team of agents in dynamic domains, such as in the RoboCup Soccer Simulation League 3D. The proposed HTN planner is decentralized because each agent has its planner to coordinate their actions. The robotic soccer domain is a dynamic and non-deterministic environment. In order to achieve coordinated team behavior, the HTN planners are used in each of the agents for high-level coordination and description of team strategies. The main idea in this work is to use the HTN to perform complex or primitive tasks, and also all planning should be done in a distributed fashion. However, for complex tasks, the planner expands them into subtasks until the tasks are primitive (Obst & Boedecker, Reference Obst, Boedecker, Bredenfeld, Jacoff, Noda and Takahashi2006).
A centralized motion planning framework within dynamic robot networks is presented in Clark et al. (Reference Clark, Rock and Latombe2003). It enables multiple mobile robots to maneuver and achieve goals safely in dynamic environments, for example, in the planetary rover and free-floating space. However, robots have limited ranges of sensing and also communication. In order to overcome these weaknesses, the approach is based on dynamic robot networks. This approach allows forming dynamically whenever communication and sensing capabilities permit, sharing the world models and robot targets within each network, and also constructing coordinated trajectories for all robots in each network using a fast-centralized motion planner. The framework is validated in planning the robots within a bounded workspace. The robot was able to re-plan and react in real time to changes in the environment with other robots, stationary, and moving obstacles.
The problem of concurrent assignment and planning trajectories for MRS was considered in Turpin et al. (Reference Turpin, Michael and Kumar2014). A centralized solution to the problem of assigning goals and planning trajectories was applied to a team of eight KMel Robotics NanoQuad quadrotors, which successfully arrive at their goal locations. However, to reduce the maximum distance traveled between a pair of robots, a decentralized algorithm was also proposed. It required local reassignment and re-planning within the communication range.
2.2.3 Communication
As in MAS, communication between robots is necessary for effective coordination in MRSs. Each robot must perceive the environment, act as needed, and also analyze what happens to other robots through communication processes. In this process, robots receive and send information for intelligent decision-making. The main objective of communication is to share information and also to coordinate activities among robots. A team of three homogeneous planar robots was deployed in Hsieh et al. (Reference Hsieh, Cowley, Kumar and Taylor2008) to obtain a radio connectivity map in an urban environment. The radio signal strength data were important for operations, such as surveillance, reconnaissance, among others.
As seen in Section 2.1, the communication is explicit when the messages, via unicast or broadcast, are exchanged directly between the robots. In implicit communication, the robots get information about other robots through the environment or communicate with each other trough blackboard systems. In this case, it is needed to embed different kinds of sensors in the robot (Yan et al., Reference Yan, Jouandeau and Cherif2013).
In Robotics, there are several communication types available, and the robotic agent decides which is the most efficient. For example, communication by wiring cables, infrared, microwave, and wireless. It is worth pointing out that, according to Reis et al. (Reference Reis, Lima, Garcia, Behnke, Veloso, Visser and Xiong2014), wireless communications are one of the technical problems that must be considered by a group of robots. For example, some robots take more time to transmit information, or the network is consistently overloaded. In this case, the cooperative behaviors tasks are ineffective. Ducatelle et al. (Reference Ducatelle, Caro, Pinciroli, Mondada and Gambardella2011) tested in simulation and on real robots a communication-based navigation algorithm for robotic swarms through the wireless network. It used a wireless communication device that provides only line-of-sight communication so that communication links can be related to navigable paths. Therefore, to make the robots to follow paths detected through communication, an Infrared Range-and-Bearing communication system was used to link received messages to relative position information about their sender.
In order to establish reliable wireless communication with a team of client agents who are performing an independent task, Gil et al. (Reference Gil, Kumar, Katabi and Rus2015) deployed an auxiliary team of robot routers. The idea is to find the best positions of the routers that support variable demanded rates to the client’s robots. A novel method is proposed to capture the spatial variation of wireless signals and also to provide the infrastructure to support the requested quality of communication. In the paper, the method is explained in detail, and the results show its efficiency.
A decentralized flocking model for a large group of autonomous flying robots was proposed in Vásárhelyi et al. (Reference Vásárhelyi, Virágh, Somorjai, Nepusz, Eiben and Vicsek2018). It worked in a noisy environment with short-range communication devices, and in the presence of substantial communication delay and with possible local communication outages. It is important to highlight that the communication outages and delays are common in developing decentralized control algorithms for swarms of flying robots.
2.2.4 Centralized and decentralized
One of the great challenges of the Robotics field is related to the decision made by the robot and also in the next action to be performed after a specific task is finished (Pirjanian, Reference Pirjanian1997). In MRS, these decisions can be made in a centralized and decentralized way.
In a centralized system, there is a central control agent, sometimes called a leader, who is in charge of coordinating and organizing the work of other robotic agents. In this case, the leader is involved in the whole system operation, while the other members in the MRS act according to the leader’s guidelines. Also, the leader has all environmental information, including information about the robots.
According to Yan et al. (Reference Yan, Jouandeau and Cherif2013), the advantage of centralized architectures is that the globally optimal plans can be produced because the leader has a global view of the environment. However, the robots are not able to adapt in dynamic environments, once if the leader fails and another robot is not available, the entire MRS is disabled. Due to this, it is not suited MRS operating in unknown and changing environments, for instance, Stanek (Reference Stanek2012).
A centralized MRS architecture to be used in industrial automation is designed in Dahale et al. (Reference Dahale, Chaudhari and Rane2014). It consists of three pick and place robots, three mobile robots, a microcontroller as a controller for coordinating the robots, and a network of infrared sensors to detect objects. The central controller uses the network of sensors to get all the information about the environment, for example, if the robot finished or not a certain task. This information is obtained through a centralized approach, and also optimal plans can be produced from global information.
In this MRS, the work area is divided into zones, and each robot is responsible for executing a task in its zone. After the robot performs the task, it is ready for the next task to be executed or remains idle. Also, the robot assists the other robot only if an available task is detected in its zone (Dahale et al., Reference Dahale, Chaudhari and Rane2014).
On the other hand, a decentralized system is composed of autonomous agents in their decision-making processes, without any leader to control the system. Robots make their own decisions, but in a coordinated way, considering the other robots in the MRS. A decentralized MRS is more scalable, autonomous, independent, and able to achieve its best performance in the execution of tasks. For example, a self-organized swarm of 30 drones without central control exhibiting flocking with collective collision and object avoidance was proposed in Vásárhelyi et al. (Reference Vásárhelyi, Virágh, Somorjai, Nepusz, Eiben and Vicsek2018).
Matsumoto et al. (Reference Matsumoto, Asama, Ishida, Ozaki and Endo1990) presented an autonomous and decentralized robot system called ACTor-based Robots and Equipments Synthetic System (ACTRESS). It is composed of multiple robotic elements, such as mobile robots, manipulator robots, intelligent sensors, intelligent actuators, special equipment, among others. In order to achieve a task, the robots communicate with each other by exchanging messages. In the cooperation problem, the robots negotiate, offer, inquiry, or announce the information.
A decentralized air traffic control solution using drones was proposed in Balázs and Vásárhelyi (Reference Balázs and Vásárhelyi2018). The multiple-path conflicts and flow bottlenecks scenarios were considered in the simulation and real experiments. In these scenarios, drones were placed in a crosswalk situation and had the center of the square-formed common destination. It is important to point out that the simulation framework provides solutions for worst-case scenarios regarding noise and delay to make sure the uncertainty in the reality gap will not decrease safety. A 2D traffic experiment at 20 m altitude was done in the real experiments. They were performed with a tailor-made quadcopter fleet, based on a PixHawk low-level autopilot, running a custom modified version of the ArduCopter code. The authors used 30-drone fleet that took place on a windless, sunny day around Paty, Hungary, moving in a 150 × 150 m arena with 6 m/s according to the more difficult crosswalk target scenario.
3 MASs and MRSs: toward an interdisciplinary integration
MRS area deals with subjects of high complexity. Therefore, it is possible to conclude that a single area of knowledge cannot fulfill this study. In this context, an interdisciplinary integration between both MAS and MRS areas allows the overcoming of limitations inherent in each area. In the literature, when a system has two or more robots that work together, it is often referred to as a team, a society of multiple robots, MAS, or MRS. This relation between MAS and MRS is not a mere coincidence, because in the modeling and implementation of MRSs, it is possible to notice that several concepts are defined based on the MAS’s frame of reference.
The central difference between a MRS and a MAS is that in the first case, the agent is a robot, and in the second, it is a software. Indeed, the evidence presented in the research work studied in this paper (see Section 3.1) indicates that most of the issues in organizing teams of robots apply to software agents as well, and vice versa (Murphy Reference Murphy2000a). The integration between the MAS and MRS areas is shown in Figure 1.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20200812102421832-0497:S0269888920000375:S0269888920000375_fig1.png?pub-status=live)
Figure 1 Integration between MAS and MRS areas
In order to highlight this integration, the MRS proposed in Amato et al. (Reference Amato, Konidaris, Cruz, Maynor, How and Kaelbling2015) is an example to identify and explain the frame of references of MAS and MRS described in Section 2 and shown in Figure 1:
Complex system. It is a complex system because the MRS has three robots, the action of a single robot can affect the action of other robots, and the intelligent behavior emerges from the interaction of various simpler behaviors;
Self-organizing system. It is a self-organizing system because a team of robotic agents tries to organize their actions collectively and also cooperates to optimize the tasks;
Open systems. In theory, every MAS can be modeled to be an open system. However, in the article, the authors have not defined whether robots can be inserted into society, or under what conditions they can leave society without interrupting the functioning of the system;
Emergent behaviors. The main task of the robots is to find and retrieve large and small boxes. First, the robots navigate in the environment to retrieve boxes and bring them back to a designated area. In the experiments, it is possible to notice the cooperative behaviors can emerge from a team of robots that use task allocation, direct communication, and signaling;
Reactive, cognitive, and hybrids agents. Robots of the MRS are hybrids because they have reactive and cognitive behaviors. In order to choose an action, each robot is programmed to verify the locally observable information. They are developed to observe its location, detect other robots if they were within one meter, see and observe the size of the nearest box. These behaviors are reactive because the robots act upon any changes in the environment. In cognitive behavior, the location of the depots is known, and the number and type of boxes are unknown. The planner is responsible for generating and evaluating solutions. The results include the location of each robot and boxes;
Autonomous agents. The authors considered three robots in a warehouse to find and retrieve large and small boxes. These robots are asked to navigate to known depot locations to retrieve boxes and bring them back to a designated drop-off area. However, one robot is not able to move a large box. Thus, two robots are asked to perform this task. Robotic agents defined in the MRS are autonomous because they must cooperate to optimize some objectives in the presence of uncertainty. They are also programmed to make their own choices and act in the environment without any influence from a leader or global plan;
Communication and coordination. In order to validate the framework, three scenarios are proposed in the simulation. In the first, the robots could not communicate with each other. Thus, the cooperation occurs by the controllers that are generated by the planner or observation of other robots. Second, the communication between the robots is available when they are within 1 m of each other. In the experiment, the communication is realized when a robot needed help carrying a large box. The robots used signaling communication in the third scenario, rather than direct communication. In this scenario, when the robots move to the depots, there is a switch in each robot responsible for turning on the blue or red light. In the experiment, when the robot senses the large box in the depots, the red color was turned on. It means that the robot needed assistance. When the assistance is performed, the red light is turned off. If the robot senses the small box, the light was not turned on. Thus, the assistance was not needed. Unfortunately, the authors did not give an example when the robots use the blue light;
Homogeneous and heterogeneous systems. The proposed MRS uses the robotic platform known as iRobot Create (IRobot, 2016), so the hardware is the same for all the robots. However, it is possible to notice in the experiments that only one robot helps other robots that need assistance to carry a large box. Thus, it seems that two different algorithms are implemented in the robots. The first algorithm is responsible for finding and carrying the boxes in the environment, and the second makes a robot able to assist another robot. Therefore, the MRS is heterogeneous because the hardware is the same, but the software, the way to program the robot, is different;
Centralized and decentralized systems. A Decentralized, Partially Observable Markov Decision Processes (Dec-POMDPs) framework is presented in Amato et al. (Reference Amato, Konidaris, Cruz, Maynor, How and Kaelbling2015). It is a general framework for representing multi-agent coordination problems. In order to find and retrieve large and small boxes and bring them back to a designated drop-off area, the authors tested the Dec-POMDPs in a warehousing scenario using three iRobot Create robots. Robots are programmed to navigate to known depot locations. However, the number and type of boxes are unknown. The Dec-POMDPs generate an offline controller for each of the robots while they are executed online in a decentralized manner. Each robot observed its location, see other robots if they were within approximately 1 m, observe the nearest box when in a depot, and also the size of the box if it was holding one.
3.1 Proposed case study
This subsection presents a case study of modeling and implementation of a MRS, based on the frame of reference of MAS and MRS areas described in Sections 2.1 and 2.2, respectively. Such an interdisciplinary approach aims to integrate concepts, terminology, methodology, and procedures. The main idea is to find theoretical-technical solutions for complex problems arising from the constant evolution of society and science and also to show empirically the coherence and viability in the integration between these areas.
The proposed MRS aims to track and surround a stationary target. This system is composed of homogeneous robotic agents observe the environment until finding the stationary target, identified by a blue object. In this environment, robots and walls are considered as obstacles and should be avoided. In the tracking task, each robot is programmed to walk straight, and its speed is constant until it reaches the target. In the surround task, the first robot that finds the target must send a message to all robotic agents, informing the target position. In this task, a mathematical model with direct communication between robots was proposed in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017) and used in this paper. This model considers that the surround is realized in the form of a regular polygon, where each robot stays equidistant therefrom and from each other through the message exchange.
Based on the explanation described so far, the MRS proposed in this paper is modeled with the following principles presented in Section 2.1 and shown in Figure 1:
Complex system. It is a complex system, since (i) it is formed by more than one robot; (ii) the action of one robot affects the subsequent actions of the others; and (iii) the interaction between robots generates dynamic and emergent behaviors;
Self-organizing system. It is a self-organizing system because robots organize themselves collectively, based on interactions with other robotic agents and the environment;
Open system. It is an open system because robots enter or leave the MRS during its running;
Emerging behaviors. It presents emerging behaviors. The track and surround tasks are socially constructed among the robots in the MRS. Therefore, the problem’s solution occurs in an emergent way, through the interactions between the robots;
Reactive, cognitive, and hybrid Agents. Robotic agents are hybrids with reactive and cognitive characteristics. The reactivity can be observed during the tracking of the target. In this task, the obstacle must be avoided until the target is found. Robots behave in a reactive way because they act on the environment using the data that are captured by sensors in a stimulus-response paradigm. The reactive behavior provides a rapid real-time response for a robot (Arkin & Balch, Reference Arkin and Balch1998). During the surround, the robots have cognitive characteristics because they think and choose the best action at a given moment. This choice is made through the perception of the environment and in the direct message exchange;
Autonomous agents. Robotic agents of the surround task are autonomous because external interventions are not needed in the decisions and also to act in the environment. Therefore, the robots control their actions and internal states;
Coordination. The surround task of the stationary target is the main goal of the robots in the proposed MRS. These robots are not able to solve the problems alone. Therefore, coordination is necessary and involves cooperation issues. The organizational structure, more specifically a community of homogeneous specialists, was the mechanism used to establish cooperation among robots.
Communication: Robotic agents are considered social because they interact with each other in the MRS. The communication between the robots, via message exchange, occurs in two steps. In the first step, when the robots move in the environment, the messages exchange happen to avoid collisions with each other. In the second step, the message exchange occurs when the first robot finds the target and informs the (x, y) coordinate of the target to other robots;
Homogeneous and heterogeneous systems. MRS is homogeneous, that is, all robots have the same architecture in both software and hardware. With regard to software, the same algorithm is implemented in robots. In the hardware, all robots of the MRS have the same amount of sensors, two wheels are used for driving, and their mechanical structure is the same;
Centralized and decentralized system. MRS is decentralized because there is no central control or a leader robot that (i) concentrates all information on the environment and (ii) decides the actions that each robot must perform.
The proposed MRS was implemented and validated on the V-REP platform (V-REP, 2018). This platform has several demonstration models and the ability to adjust physical aspects of the simulation, such as mass, acceleration of gravity, and friction. Other features include the simulation of particle motion, such as air or water displacement, automatic collision detection, among others. The ultrasound, laser, infrared, and cameras are some examples of available sensors.
The V-REP platform is also useful in modeling and validation of MRSs. Wagdy and Khamis (Reference Wagdy and Khamis2013) presented a hybrid approach to the group formation problem. This approach used cellular automata, visual grid, and the dynamic leadership technique to tackle the problem of group formation. Also, three scenarios were defined in the V-REP to validate the authors’ approach. In the first and second scenarios, the robots were asked to build up line and rectangle formations, respectively. In the third one, the adaptability of the algorithm was tested because the obstacles were defined in the environment, and the robots should shift the formation to pass the obstacle ahead. In this system, it was assumed that the leader must initiate the formation and had to evaluate the collaboration goal. The evaluation is responsible for satisfying the task of the collaboration that agents have to achieve as a group and the individual needs of the agent.
The robot-escorting task to protect a target, capture, and manage leader-follower formations was performed using V-REP (Batista et al., Reference Batista, Pinto and Romero2015). An escorting scenario with three Pioneer robots, a collidable, measurable, and detectable sphere was defined in the simulation. The Sample Lloyd-based Area Coverage System (SLACS) and repulsive forces as coverage and obstacle avoidance strategies were applied in the simulation.
Considering the characteristics of the proposed MRS, and also the functionalities of the V-REP platform, the following sections highlight the information related to the tracking and surrounding task. The environment and scenario developed in the V-REP are detailed in Section 3.1.1. Section 3.1.2 presents the hardware characteristics, and some software features of the robotic agent explained in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017).
3.1.1 Environment
The environment in the V-REP platform is discretized into a square grid with a size 10 × 10. Therefore, there is a specific coordinate for each cell in the scenario. With these data, each robot is able to identify its Cartesian coordinates (x, y) and the Euler angles. These angles specify the robot orientation about the inertial system, describing the orientation of a rigid body and representing the rotations about the axes of a coordinate system (Bonev et al., Reference Bonev, Zlatanov and Gosselin2002).
V-REP platform provides a CAD-3D environment with several features available for the development of a MRS. This environment, shown in Figure 2, was used to implement and simulate the tracking and surrounding the stationary target. In the surround task, the robots move around the environment until the sensors detect an object, which can be a target or an obstacle.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20200812102421832-0497:S0269888920000375:S0269888920000375_fig2.png?pub-status=live)
Figure 2 Scenario developed in the V-REP platform with n robots, a target, and walls
The object is considered as a target if the blue color is recognized. Otherwise, it is an obstacle that can be a wall or a robot and should be avoided. It is important to point out that the sensors monitor the environment during the entire execution of the simulation.
The scenario illustrated in Figure 2 is composed of a square grid with size 10 × 10 with n robotic agents ri(i = 1, …, n), walls, and a target T(x, y). Initially, the target’s position is not known a priori. Also, the robots and the target T are randomly initialized in the scenario.
After the initialization process, if the target T is not detected, the robots start to track. The first robot that detects it must inform its (x, y) coordinates to other robots by message exchange. Finally, the surround task is performed. The important information about the robotic agents and the tracking and surrounding tasks are described in the next sections.
3.1.2 The robot
The robot called BubbleRob (2017), shown in Figure 3, is available in V-REP platform, and it was used in this work. It has a vision sensor (Sensor, Reference Sensor2017b), an ultrasonic sensor (Sensor Reference Sensor2017a), two wheels of diameter 10 and 20 cm shaft, and finally a ball caster in (a), (b), (c), and (d), respectively. The caster is a great third contact point for the robots. It avoids the possibility of having a part of the robot scrape the ground.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20200812102421832-0497:S0269888920000375:S0269888920000375_fig3.png?pub-status=live)
Figure 3 The robot created on the V-REP platform
The vision sensor in Figure 3(a) has a range of 56 cm and is able to identify shapes and colors. In turn, the ultrasonic sensor in (b) detects an object in the environment closer than 50 cm without observing details of shape and color, for instance. Therefore, the detection of any object occurs first by the vision sensor, and afterward by the ultrasonic sensor. In this scenario, the vision sensor is responsible for identifying the color of any object in front of it.
The surround task can be performed if the vision sensor identifies the blue color, and the ultrasonic sensor detects the target T. However, if any other color is identified, the ultrasonic sensor should detect the obstacle that must be avoided by the robot.
In the next subsections, the behaviors of tracking, surrounding, and detection of the target T are presented. These behaviors are carried out in parallel with n robots. In order to make the MRS operation more efficient and fault tolerance, the tracking and surrounding tasks are realized in a distributed way.
Tracking, target detection, and obstacle avoidance In the tracking task, the robots in the proposed MRS are programmed to move in a straight line, with a constant speed until the sensors detect the target or an obstacle. Almost the entire environment is covered through this blind search method, which is used until a robot finds the target.
It is worth mentioning that other robots and the walls are detected by the sensors and should be considered obstacles. When a robot detects an obstacle, it rotates approximately 120°. This movement consists of changing the speed of the wheels in the opposite direction and counterclockwise during a time interval, generating an arc trajectory. After that, the blind search continues until the target T is found (Botelho et al., Reference Botelho, Marietto, Mendes, Ferreira and Silva2017).
Figure 4 illustrates the scenario in which the robot should avoid the wall (obstacle). In (a), the robot moves toward the wall. The detection occurs in (b), and a rotation around 120° on its axis occurs in (c) and (d). Also, the distance from the robot to the wall is indicated in (a)–(c).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20200812102421832-0497:S0269888920000375:S0269888920000375_fig4.png?pub-status=live)
Figure 4 Tracking and detection (wall) can be observed in (a) and (b), respectively. Wall avoidance happens in (c) and (d)
Target detection After identifying the blue object (target) using the vision sensor, the robot must maintain the straight line until the ultrasonic sensor also detects the target T. The environment in the V-REP is discretized into a grid of rectangular cells, and the coordinates of each cell are made available by the platform. By such representation, in the target detection task, the first robot that detects the target T informs its coordinates to other robots. Also, to maintain the same distance between each robot, the messages exchanged between the robots occur in the surround task, and this communication is described in Section 3.1.4.
The description of the required angles for the robot ri(i = 1, …, n) to be directed to the target T can be found in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017).
Surrounding the target The surround task is realized in the form of a regular n-sided polygon figure, with each robot occupying a vertex of that polygon, and the center of the target T is the center of the polygon. The following properties can justify the choice of the regular polygon: (i) its sides and angles are equal and (ii) it can be inscribed in a circumference.
The regular polygon properties allow the surround formation using well-known mathematical relations with asymmetrical form and efficient functionality. The polygon radius is determined by the line segment that joins the center of the target T to the robot’s center of mass. More information about the mathematical model can be found in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017).
Surround speed control In order to obtain the ideal linear speed of the circular motion of any of the n robots during the execution of the surround task, the ideal proportion of the linear speed of their right wheel in relation to the left wheel should be considered, and it was described in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017).
After describing the proposed model related to the tracking, target detection, and surround tasks, the next task, described in the next section, is to validate, by simulation, the scenario shown in Figure 2.
3.1.3 Scenario on the V-REP platform
MRS implementation of the tracking and surround task of a stationary target, proposed in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017), was performed on the V-REP platform, version 3.1.3 Educational 32-bit. This platform was installed on an ASUS notebook, with an Intel Inside Core Processor i5, with 6GB of RAM and Windows 8. Movies available in Mendes et al. (Reference Mendes, Botelho and Marietto2018b) and Mendes et al. (Reference Mendes, Botelho and Marietto2018a) demonstrate the MRS simulation results with three and four robots, respectively.
The scenario implemented in the V-REP platform, shown in Figure 5, is composed of three homogeneous robots (r 1, r 2, and r 3), a stationary target T, and walls. Additionally, robots are considered as obstacles. In this scenario, a MRS with three robots was modeled because this amount of robots is enough to explain the proposed idea without overloading the environment. However, it is possible to add more robots in the scenario illustrated in Figure 2. For this, it is necessary to add pieces of code responsible for identifying the new robot added in the scenario.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20200812102421832-0497:S0269888920000375:S0269888920000375_fig5.png?pub-status=live)
Figure 5 Simulation results with three robotic agents (Mendes et al. Reference Mendes, Botelho and Marietto2018b)
3.1.4 Simulation results
The initial position of r 1, r 2, r 3, and T are randomly initialized in the scenario, as shown in Figure 5(a). In (b), r 1, r 2, and r 3 always move in a straight line, until the sensors detect the target T or an obstacle. If an obstacle is found, the robot starts the obstacle avoidance process, as shown in Figure 4. After that, the new rectilinear path directed to another location. At this moment of the simulation, the robots act reactively, because their actions are taken as reactions to the perceptions of the environment.
The robot r 1 detects the target T in Figure 5(b). After that, the coordinates (x, y) of the target T are sent through a broadcast communication to other robots (r 2 and r 3) of the MRS. This communication allows the target T to be found faster by all robots. In this case, the first robot that finds the target T must send its position to the other robots in the environment. In this task, the communication is explicit because the message is exchanged directly between the robots.
The robot r 3 is directed to the target T, as illustrated in Figure 5(c). In this movement, this robot performs the procedure described in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017) until it reaches the target T. Meanwhile, r 2 is still performing the procedure for directing to the T in (c)–(d) of Figure 5. Thus, it is possible to note that, based on the message sent, the robots move in a coordinated way until they find T.
After T is detected by r 1, r 2, and r 3 in Figure 5(e), the surround task, and also the adjustment begins to be performed. The adjustment, described in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017), aims to guarantee the distance between two robots. The radius R adopted in the surround task was 0,7 m, defined by the difference between the speeds of the right wheels and the left wheels.
The surround task with r 1, r 2, and r 3 starts to become more stable in (f) and (g) of Figure 5. In this scenario, the robots must remain equidistant at 120° from each other. Therefore, according to Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017), the calculation of the ideal distance that each center of the robot should have, with the center of its neighbor, was approximately 1,1 m. In this step, the algorithm explained in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017) was used with the message exchange defined in Figure 6, so that the distances between r 1, r 2, and r 3 are symmetrical.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20200812102421832-0497:S0269888920000375:S0269888920000375_fig6.png?pub-status=live)
Figure 6 In order to maintain the equidistance in the surround task, the robotic agents send the information of their respective positions
In Figure 6, the robots send their coordinates to ensure the equidistance between them in the surround task. However, Figure 7 shows that only r 1, which was the first robot to identify T, sends the T coordinates to ensure that all robots move toward the T. Finally, Figure 5(h) shows that r 1, r 2, and r 3 are surround of T. It is possible to notice that the autonomy and the emergent behavior performed through coordinated and synchronized communication allowed the surround task to be fulfilled efficiently.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20200812102421832-0497:S0269888920000375:S0269888920000375_fig7.png?pub-status=live)
Figure 7 The r 1 finds the target and sends its coordinates to the other robots that move toward the target and start the surround task
4 Conclusions and future works
This paper presented a proposal for interdisciplinary integration between MAS and MRS. This integration was based on the presentation of the main theoretical points of both areas, and a later indication of how these points could be gathered in a single theoretical framework. Among these points in the MAS, it was possible to mention the complex systems, unpredictable, self-organizing, open, emerging behaviors, reactive agents, cognitive, hybrids, communication, coordination, among others. The homogeneous, heterogeneous, hybrids, centralized, and decentralized were considered in the MRS field. Also, the synchronization mechanisms, organizational structures, and planning were described, using the concept of process management coordination of the MAS. The communication was also investigated, explaining works related to MRSs that also use MASs concepts.
After the theoretical study about MAS and MRS, it was possible to notice that the integration among them was important to overcome limitations that could be found in each of these areas. Therefore, an interdisciplinary integration between them was considered in this paper. In order to demonstrate the feasibility of this proposal, a case study of a decentralized MRS to track and surround a stationary target was modeled in Botelho et al. (Reference Botelho, Marietto, Mendes, Ferreira and Silva2017) and implemented in the V-REP. Each robot in the developed MRS was autonomous, and it was not necessary to implement external systems responsible for managing their tasks. Thus, the processing capacity of each robot was lower when compared to centralized systems.
In the proposed case study, three homogeneous robots and a stationary target were defined in the environment, and the results found in the tracking, target detection, and surround tasks were demonstrated. Based on the MAS principles, the proposed MRS was a complex system, since it was formed by more than one robot, and its action affects the behavior of other robotic agents. The self-organization and emergent behavior were noticed because the tracking and the target surround were performed in an organized way and built through the interactions between the robots. The open system happened because of the possibility of increasing or decreasing the number of robots. The reactivity was observed during the tracking task, that is, each robot was programmed to detect and avoid the obstacles. Otherwise, the cognition was noted in the surround task. Therefore, the robotic agents in the proposed MRS could be considered hybrids. Also, they were autonomous because the external interventions in decision-making were not needed. Finally, the coordination and communication between the robots were defined to perform the surround task.
The future works aim to optimize and extend the application range of the proposed MRS. This system was used as a case study in the integration between the MAS and MRS. Therefore, some improvements are necessary for the tracking task. In this task, the robots have reactive characteristics that can be optimized, such as the possibility to perform random movements until finding the target. However, the robots should have more cognitive behavior. For instance, to find the target quickly and efficiently, some efficient algorithms for mapping and navigation of mobile robots can be used. The robot cognition in the surround task should be improved, implementing an algorithm that allows the circular formation with a moving target. Finally, it is possible to investigate the best communication policy when the coordinates of the target are sent to other robots and ensure the equidistance between the robots in the surround task.
Acknowledgments
This work was supported by the São Paulo Research Foundation (FAPESP) [grant numbers 2013/17929-2 and 2015/02301-3].
Supplementary material
To view supplementary material for this article, please visit https://doi.org/10.1017/S0269888920000375