1. A hard problem: What is a group?
What is a group? More than a century into the study of groups and group identities, this question continues to vex. Human intuition suggests that groups exist (i.e., as far as we know, all typically developing humans on earth hold the mental concept of a social “group” as part of their perception of the world – it is a universal emic concept; Brown, Reference Brown1991). Research furthermore shows that people hold reliable and consistent intuitions about what is and is not a group (e.g., Hamilton, Sherman, & Castelli, Reference Hamilton, Sherman and Castelli2002). Thousands of scientific papers, hundreds of books (e.g., Brown, Reference Brown2000; Forsyth, Reference Forsyth2014/2019; Sedikides, Schopler, & Insko, Reference Sedikides, Schopler and Insko1998), and dozens of scientific journals (e.g., Advances in Group Processes, Group Dynamics, Group Processes & Intergroup Relations, etc.) are predicated on the existence of this construct. Yet what exactly are we talking about?
Almost all definitions agree that “group” means something more than just an aggregate and more than just a dimension of similarity out in the world (Campbell, Reference Campbell1958; Forsyth, Reference Forsyth2014/2019; Ip, Chiu, & Wan, Reference Ip, Chiu and Wan2006; Wilder & Simon, Reference Wilder, Simon, Sedikides, Schopler and Insko1998). A pile of leaves, for example, is not a group. A group is something more psychological, more agentic. Agency on its own, however, does not seem to be sufficient; people waiting for a bus are classically not a group, for example (Lickel et al., Reference Lickel, Hamilton, Wieczorkowska, Lewis, Sherman and Uhles2000). It seems then, that it is not enough that the entities are agents. There must also be coordination and some psychological representation among the individuals that they are so coordinated (Deutsch, Reference Deutsch and Jones1962; Sherif, Harvey, White, Hood, & Sherif, Reference Sherif, Harvey, White, Hood and Sherif1961; Tajfel, Reference Tajfel1982).
Thus, in more sophisticated definitions, we find a “relationship” (Lewin, Reference Lewin1948), “mutual awareness and interaction” (McGrath, Reference McGrath1984), “obligations,” “influence” (Shaw, Reference Shaw1981), “interdependence” (Lewin, Reference Lewin1948), and “social cohesion” existing between individuals – based on “a common identity” that people “define themselves as members of,” that “is recognized by at least one other” (Brown, Reference Brown2000) and that people “care about” based on shared “values,” “experiences,” “interests,” “kinship,” and so on (see Forsyth, Reference Forsyth2014/2019, for a review).
Undoubtedly, these definitions bring us closer to an adequate definition. The concept “group” of course refers to a very broad set of things. So a satisfactory definition – if it is to be inclusive – must be correspondingly broad and short on details. As Allport famously put it, “It is difficult to define an in-group precisely. Perhaps the best that can be done is to say that members of an in-group all use the term we with the same essential significance” (Allport, Reference Allport1954/1958, p. 30).
Nevertheless, we should not yet be fully satisfied with these current definitions. The problem – correct as they are – is that by relying on one or more additional psychological constructs, these definitions are still not yet grounded in concrete, operationalizable terms. For instance, imagine that an artificial intelligence (AI) engineer approaches us and wants to build a machine intelligence capable of “groups.” What exactly would we tell them? Groups involve the formation of “expectations,” “obligations,” and a meaningful “we.” But expectations and obligations about what, exactly? And what exactly is the essential significance of we? We might answer with obligations to “support one another,” to “come to one another's aid,” and so on. But this kind of explication still appeals to psychological concepts – all of which simply shift the goal posts. Already having an intuition or folk-psychological conception of groups is necessary to understand (and implement) what is meant by “support,” “aid,” “interdependence,” “we,” and so on. We are left then with an infinite regress of appeals to intuition. We know a group when we see it, in other words, but we couldn't say exactly what it is. Or, in the more precise language of the type/token distinction (Peirce, Reference Peirce, Weiss, Hartshorne and Burks1931–1935): We can recognize tokens (particular instances) of groups, but we don't have an explicit notion of what constitutes the type (the class itself).
In fact, there have been roughly two different ways of conceptualizing and studying groups in the behavioral sciences. The first seeks to capture how people intuitively represent and reason about groups, largely borne out of the ground-breaking theorizing of mid-twentieth century social psychology. The problem with this approach – as we have just seen – is that a necessary and sufficient definition of a group has not been forthcoming, and the definitions that do exist tend to remain rather psychological, if not verging on tautological.
A second approach has been to not rely on vague, psychological terms at all, but rather to operationalize the concept of a group as something concrete enough that it can be modeled or studied – which typically involves describing objective behaviors among multiple agents. This formalized study of n-person dynamics has been underway for several decades now – originating from a diverse set of fields including AI (Sandholm, Larson, Andersson, Sherry, & Tohmé, Reference Sandholm, Larson, Andersson, Sherry and Tohmé1999), evolutionary biology (Koykka & Wild, Reference Koykka and Wild2017; Rusch & Gavrilets, Reference Rusch and Gavrilets2020), the evolutionary social sciences (Böhm, Rusch, & Baron, Reference Böhm, Rusch and Baron2018; Glowacki, Wilson, & Wrangham, Reference Glowacki, Wilson and Wrangham2017), primatology (Harcourt & de Waal, Reference Harcourt and deWall1992), international relations (Schelling, Reference Schelling1966), and economics and decision-making (Schelling, Reference Schelling1956). These n-person dynamics have been studied in the lab (e.g., Bornstein, Reference Bornstein2003; Gonzalez, Ben-Asher, Martin, & Dutt, Reference Gonzalez, Ben-Asher, Martin and Dutt2015; Yamagishi, Jin, & Kiyonari, Reference Yamagishi, Jin and Kiyonari1999) and in the field (Bissonnette et al., Reference Bissonnette, Perry, Barrett, Mitani, Flinn, Gavrilets and de Wall2015; De Dreu & Van Vianen, Reference De Dreu and Van Vianen2001), and have been informed by a decades-long enterprise of game theoretic and optimality analyses (e.g., Burani & Zwicker, Reference Burani and Zwicker2003; Gamson, Reference Gamson1961; Mesterton-Gibbons et al., Reference Mesterton-Gibbons, Gavrilets, Gravner and Akcay2011; Shehory & Kraus, Reference Shehory and Kraus1998).
The problem with this second approach, however, is that these concrete implementations have not informed the issue raised by the first approach: namely, what – literally – is being represented in the mind when people are perceiving and reasoning about groups? For the purposes of defining a group, researchers in these areas are typically less concerned with directly assessing people's intuitions of what constitutes a group (as in the first approach), and so simply side-step this issue and declare by fiat that a particular set of behaviors, decisions, or processes constitutes a “group” for their particular purposes (i.e., they posit – either implicitly or explicitly – an operational definition). This declare-by-fiat strategy has afforded great progress, as it allows researchers to use mechanistic techniques (e.g., using agent-based or analytic models; e.g., Goldstone, Roberts, & Gureckis, Reference Goldstone, Roberts and Gureckis2008; Gross & De Dreu, Reference Gross and De Dreu2019).Footnote 1
Nevertheless, the problem of defining a group sits awkwardly with respect to this second approach, as well. Namely, if the problem of defining a group were simply a matter of rendering the notion of a “group” into something concrete, then this second approach leaves us with an embarrassment of riches: There are dozens, if not hundreds, of concrete operationalizations of the concept “group” within this literature – including behaviors in the lab, the field, and the computerized world of agent-based models. Yet none of these seem to feed back onto the intuitive concept that they instantiate, such that we could point to any one concrete implementation or model and say “this is a group.” Or, more accurately, we would have to say that every single one is an instance of a “group” (at least to the intuition of the researcher and the reviewers who allowed the paper to be published as a study on “groups”) – all of which suggests that this second approach relies just as much on the human intuition of what constitutes a group as the first (albeit in a different way). In particular, intuition is being used to denote particular behaviors or dynamics as being about groups.
Consequently, as a scientific community, we have created a vast universe of group tokens – either described psychologically or operationalized concretely – without having a clear idea of what constitutes the type. Moreover, we now have the additional problem of not having a clear diagnosis of the original problem: that the problem of defining a group cannot simply be a matter of rendering the concept into something concrete. If it were, the second, concrete-operationalization approach would have already solved the problem of “What is a group?” long ago.
2. A solution: A computational theory
What is missing from current approaches to the study of groups – and what will finally resolve the issue of what is a group – is to have what the vision scientist David Marr (Reference Marr1982) referred to as computational theories: explicit information-processing theories of what the mechanisms that make up the mind are representing and operating upon when, pre-theoretically, one is reasoning about some phenomenon (which in this case would be “groups”).Footnote 2 Computational theories resolve the long-standing tension between the two approaches described above by (1) explicitly acknowledging the importance of psychological representation – as in the first approach (i.e., they do not naively place the notion of a “group” out in the world, but rather, as a representation within the mind), while at the same time (2) grounding that psychological representation in something concrete, objective, and non-tautological – as in the second approach.
Computational theories highlight the issue of computational adequacy: that if what one has stipulated as going on within some information-processing entity is adequate for creating the phenomenon of interest, without intervention from a god-like agent or intelligence (Chomsky, Reference Chomsky1980; Marr, Reference Marr1982; Minsky, Reference Minsky1961, Reference Minsky1974; Pietraszewski & Wertz, Reference Pietraszewski and Wertz2021; Tooby & Cosmides, Reference Tooby, Cosmides, Barkow, Cosmides and Tooby1992). This issue of computational adequacy differentiates the present approach from what we have described as the second, concrete-operationalization approach to the study of groups. Nearly all of that work takes as its initial starting point the imposition of some sort of predefined structure onto the world – typically a game structure, such as the prisoner's dilemma game (or kindred snow-drift game; see Axelrod, Reference Axelrod1984; Oliver, Reference Oliver1993; Perc & Szolnoki, Reference Perc and Szolnoki2010; Vainstein, Silva, & Arenzon, Reference Vainstein, Silva and Arenzon2007; Worden & Levin, Reference Worden and Levin2007, for reviews). These and other predefined structures render the enterprise tractable, allowing researchers to not have to worry about certain pesky details.
In contrast, on our computational-theoretic approach, we are not allowed to think that the world can come preformed into games or structure. Instead, the representational system itself must impose some sort of framing onto the blooming, buzzing confusion of the real world. That is, the representational system has the job of imposing its own “game” structure onto the real world, and then “playing” an internally represented game, by using cues in the real world to run its simulations. As scientists, we then face the same problem as that faced by the mind: determining what – out of all the things that humans are doing out in the real world – constitute “group” things. Or, in the parlance of AI and philosophy, we are forced to confront the relevance problem (e.g., Dennett, Reference Dennett and Pylyshyn1988).
Colloquially, then, our computational-theoretic approach can be thought of in the following way: We are engineers, and our job is to build a robot that can interface with the real world. The question, “What is a group?”, then becomes a question about what could in principle be a group representation be in such a robot. That is, how does our robot “see” groups? What is the definition of a group within its software?Footnote 3
The proposed computational theory is as follows: The folk-psychological construct “group” corresponds to changes within a framework of event types (what computer scientists and AI researchers would refer to as an event grammar or event calculus; e.g., Mueller, Reference Mueller2015). Within this framework, a group representation corresponds to the probabilistic assignment of agents to particular roles within those event types.
Here, we will consider event types featuring cost impositions. We are thereby addressing what a “group” representation is in the context of conflict. Doing so is not meant to suggest that there are only conflict-based group representations within the mind. Rather, we simply have a plausible account for conflict – which is no trivial matter, as our account must describe what is universally true of all instances of group membership in all instances of real or imagined conflict, all the while describing that representation in its most compressed, reduced form. At the end of the paper, we will speculate as to how to think about other, non-conflict-based event types, and thereby make progress toward a complete computational theory of social groups.
The remainder of the paper will explain the proposed conflict-based event-type computational theory of a group representation – all in a conceptual, non-technical manner – and will try to convey a sense of what kinds of research questions it opens up, all while attempting to head-off a number of likely misunderstandings.
3. Starting with the reduction problem
Any computational theory must first deal with what is called the reduction problem in AI: how to take an apparently unbounded, infinite phenomenon and reduce it to a finite, solvable set (Russell & Norvig, Reference Russell and Norvig1995). For example, vision researchers must postulate a finite set of cognitive primitives that allow a viewer to identify and recognize any and all objects (e.g., Marr & Nishihara, Reference Marr and Nishihara1978). Analogously for groups, we must postulate a finite number of cognitive primitives that allow all tokens of group conflict – from playground fights to international conflict, and everything else in between – to be represented and thereby operated upon within the mind.
A good conceptual starting point for solving the reduction problem for group representation is to start with a broader reduction problem: that of multi-agent (or n-person) conflict, which refers to when more than two agents come into conflict with one another. For decades, the complexity and near-infinite variety of n-person conflict has posed a problem: How can a finite set of mental representations and decision-rules handle the complex dynamics that arise when more than two agents come into conflict with one another (e.g., Byrne & Whiten, Reference Byrne and Whiten1988; Harcourt & deWall, Reference Harcourt and deWall1992; Sherratt & Mesterton-Gibbons, Reference Sherratt, Mesterton-Gibbons, Hardy and Briffa2013)? How, in other words, can the mind “see” all instances of multiperson conflict – of which group-related phenomenon are but a subset?
A solution to this problem was offered recently (Pietraszewski, Reference Pietraszewski2016), and while a solution to this broader reduction problem of n-person conflict does not solve the specific problem of what a group representation is, it does narrow down the scope of possible solutions – in the same way that knowing the answer to the question “Is it bigger than a bread-box?” narrows down the scope of possible solutions in the game of 20 questions. This n-person solution was discovered by asking the question, “What would always be true of n-person conflict over evolutionary time?”Footnote 4 From the multigenerational perspective of evolutionary time, the particular details of conflict-related events will always be different. Who is involved, why, and the nature of the conflict itself will not be stable across multiple generations. Therefore, psychological processes for handling conflict cannot be too-tightly tied to any of these particular details, but rather must operate over a class of abstracted invariances (Jackendoff, Reference Jackendoff1992). Once we strip these ever-changing details away, what remains are the following principles:
• Conflict can be understood as the contingent delivery of costs. An inclusive definition conceptualizes conflict as the contingent delivery of costs between agents; an expectation or realization of the contingency if you do X, I will do Y (Archer, Reference Archer1988; Campbell, Reference Campbell and Buss2015; Daly, Reference Daly and Buss2015; Hardy & Briffa, Reference Hardy and Briffa2013). This conceptualization accommodates exchanging costs in kind – exchanging an eye for an eye and a tooth for a tooth – and accommodates an exchange of different costs. For example, when a child pushes another for stealing a toy, they are exchanging one cost for another, repaying lost enjoyment with physical harm.
• N-person conflict can be decomposed into triadic – but not dyadic – interactions. What sets multi-agent or group conflict apart from dyadic conflict is the involvement of a third party. For example, if A attacks B, and then in response B attacks A, and so on and so forth, this may become a protracted conflict, but it will only ever become a multi-agent conflict if at least one third party, C, becomes involved (Caplow, Reference Caplow1959; Grammer, Reference Grammer, Harcourt and de Waal1992; Harcourt, Reference Harcourt, Byrne and Whiten1988; Harcourt & deWall, Reference Harcourt and deWall1992; Heider, Reference Heider1958; Liska, Reference Liska1962; Patton, Reference Patton1996, Reference Patton, Cronk, Chagnon and Irons2000; Pietraszewski, Reference Pietraszewski2012, Reference Pietraszewski2016; Strayer & Noel, Reference Strayer, Noel, Zahn-Waxler, Cummings and Iannotti1986; Von Neumann & Morgenstern, Reference Von Neumann and Morgenstern1944).
• There are only four ways that a third party can become drawn into a conflict within these triads. If A attacks B, and a third party, C, becomes involved, there are only four possible ways in which this might happen (Chase, Reference Chase1985; Strayer & Noel, Reference Strayer, Noel, Zahn-Waxler, Cummings and Iannotti1986), depicted in Figure 1:
1. Generalization: A attacks B, then A attacks C
2. Alliance: A attacks B, then C attacks B
3. Displacement: A attacks B, then B attacks C
4. Defense: A attacks B, then C attacks A
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220707061738195-0819:S0140525X21000583:S0140525X21000583_fig1.png?pub-status=live)
Figure 1. The four ways in which a third party (C) can become involved in an otherwise dyadic conflict between A and B. From left to right, A's attack on B is followed by: A also attacking C; C attacking B; B attacking C; C attacking A. These are Generalization, Alliance, Displacement, and Defense, respectively.
These triadic interactions characterize what is always true of n-person conflict. They are the smallest set of “building block” units that can be strung together over time to describe an infinite number of conflict dynamics – similar to the way a finite number of letters can generate an infinite number of linguistic meanings. Past work had used these four triadic interactions as a coding scheme for documenting real-world multi-agent conflict (Chase, Reference Chase1985; Strayer & Noel, Reference Strayer, Noel, Zahn-Waxler, Cummings and Iannotti1986). However, these triadic interaction types – because they describe all of the ways in which a third party may become involved in a conflict – may also be the computational building blocks out of which all n-person conflicts are cognitively represented (Byrne & Whiten, Reference Byrne and Whiten1988; Harcourt & deWall, Reference Harcourt and deWall1992; Sherratt & Mesterton-Gibbons, Reference Sherratt, Mesterton-Gibbons, Hardy and Briffa2013). We will refer to these as triadic primitives.
Crucial to this proposal is that (1) any one agent can occupy each of the 12 different roles over time (i.e., A, B, and C are representational slots into which particular agents are inserted; they are not agents themselves) and (2) that these triadic interactions will string together or concatenate over time as interactions unfold. Consequently, the human mind need not represent an infinite number of conflict dynamics: it need only reason about each of these four interaction types, and then string them together over time as real or hypothetical events unfold. Such an architecture – described at this high level of abstraction of strings of concatenated triadic primitives – solves the reduction problem with respect to n-person conflict, allowing a finite cognitive architecture to represent an infinite variety of conflict dynamics (see Pietraszewski, Reference Pietraszewski2016, for details).
4. The proposed solution: The mind's definition of a “group” in the context of conflict
With the broader n-person reduction problem tentatively addressed, we can turn back to the original problem of characterizing the mind's definition of group in the context of conflict: If the above four triadic primitives describe all multi-agent conflict, then it would seem likely that the concept of “group” is somehow present in these triadic primitives. But how? Strings of triadic primitives do not yet map onto or produce the intuitive concept “group.” How do we go from these triadic primitives to the concept of “group”?
The proposed solution is as follows: a group (in the context of conflict) is a set of specific roles within these four triadic primitives, where role simply refers to one of the three interactant-slots within each triadic primitive (e.g., being A in Generalization). These roles are depicted in Figure 2:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220707061738195-0819:S0140525X21000583:S0140525X21000583_fig2.png?pub-status=live)
Figure 2. The roles that define the mental concept “group” in the context of conflict (depicted in red). These we will call group-constitutive roles.
What it means that individuals are members of a group, then, is that:
(i) in generalization, they are in roles B and C
(ii) in alliance, they are in roles A and C
(iii) in defense, they are in roles B and C
(iv) in displacement, they are in roles A and C
These group-constitutive roles (again, with roles referring to the agent and/or patient roles taken within the triadic primitives) ground “choosing one agent over another” and “taking sides” in a conflict into a finite set of mental representations. The precise, casual claims are as follows:
• Cost/benefit calculations within each individual agent produce the above triadic interactions.
• Certain classes of cost/benefit calculations will cause certain agents to occupy the above roles with respect to one another.
• This class of cost/benefit calculations “hang together” over evolutionary time. That is, the relationships between agents that cause them to occupy the above roles with respect to one another constitute an enduring entity over evolutionary time.
• This enduring entity allows for selection to create a summary representation in the mind. This is a mental entry for agents who are in the kind of relationship which causes them to occupy these particular roles with respect to one another.
• That summary representation – precisely, the group-constitutive roles within the four triadic primitives – is our mental concept “group” in the context of conflict.
The claim, then, is that on-the-ground relationships will cause agents to occupy these roles with respect to each other. That there are such relationships creates a conceptual entry or slot in our minds over evolutionary time. What defines this entry – and what is common to all of these relationships – is that agents will occupy these roles with respect to one another.
The order of causation is on-the-ground relationships and cost/benefit calculations first, summary representation second. Individuals are making cost/benefit decisions over evolutionary time, based on a number of factors, such as self-interest, caring about the welfare of another, a formal social contract, and so on. These first-person decisions then create an invariance in the world – a class of events (in this case it is the occupation of group-constitutive roles) – that can in turn select for a conceptual entry that summarizes this class (Shepard, Reference Shepard1994).
This summary representation allows agents to communicate about and refer to this class, both within themselves, and also with others. For instance, an agent can observe their own behavior and motivations, and create a summary representation of who they are in a “group” with, and who they are not. Or they can observe other's behaviors and do the same. Of course, by “agent” we mean a set of information-processing functions (as opposed to a conscious agency within the mind; see Pietraszewski & Wertz, Reference Pietraszewski and Wertz2021). As such, there is no entailment that each (token) summary representation becomes conscious and communicable to others, although undoubtedly a subset does.Footnote 5 These communicable summary representations (which would in principle be highly compressed signifiers) should, when communicated, induce an expectation of the class of events to which they refer. That is, if one is told that X and Y are in a “group” within a conflict, then this should lead to the expectation that X and Y will be more likely to occupy the above roles with respect to one another, all else equal.
5. An example: Understanding group-constitutive roles from the perspective of a particular agent
What does it mean that group-constitutive roles are group membership to the human mind? In the examples that follow, we will consider how group-constitutive roles play out from the perspective of a particular agent. These observations, which are entailments of our computational theory, describe additional elements of the overarching information-processing problem of representing, reasoning about, and carrying out the group-constitutive roles. For instance, we will see that occupying group-constitutive roles requires our focal agent to sometimes be motivated to act, to anticipate the actions of others, and to make certain cost/benefit calculations.
We will begin with Figure 3. In the top panel of Figure 3, a third agent, denoted C, observes A attack B. If this agent is to become involved in the conflict, then some constellation of the four possible triadic interaction types will occur. However, as outside observers who do not know if they share a group membership with either A or B, we have no basis for narrowing down which will occur. In contrast, once we know that the onlooking agent is in a group with A and not with B (Fig. 3, bottom left panel), only two of the four interaction types become likely, and the other two become less likely. In contrast, if the onlooking agent is in a group with B and not with A (Fig. 3, bottom right panel), then the other two triadic interaction types become likely. Thus, the representation of the background relationship – the “group” membership – between the direct interactants (A and B) and the onlooker (C) modifies which behaviors are most likely to follow from A attacking B.Footnote 6
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220707061738195-0819:S0140525X21000583:S0140525X21000583_fig3.png?pub-status=live)
Figure 3. A single agent (depicted as the stylized figure) observes a cost imposition occurring between A and B (top panel). Without knowing the group memberships of A, B, and the agent, we have no basis for anticipating which, if any, of the four possible triadic interactions are likely to happen as a consequence of A attacking B. However, once the group membership of the three agents is marked (group membership is denoted in red in the bottom two panels), certain triadic interactions become more likely to occur (those that are highlighted), whereas others become less likely to occur (those that are not highlighted).
Notice that for these triadic interactions to occur, the initially uninvolved onlooker must at times behave. For example, if the uninvolved agent C shares a group membership with A, then they will join with A in attacking B (the first highlighted interaction type in the bottom left panel of Fig. 3). This means that the psychological implementation of these group-constitutive roles requires motivations to act: In this case, group membership is a relationship between A and C that is causing C to be more likely to act against B. Notice, too, that C must sometimes also generate expectations of other's actions. For example, C in this example is also more likely to be attacked by B, given that A and C are group members (the second highlighted interaction type in the bottom left panel of Fig. 3). This means that the psychological implementation of these group-constitutive roles also requires producing expectations about others' behaviors.
The interplay between one's own behavior and the expectations of other's behavior can also be seen in Figure 4, which again depicts group membership from the perspective of a single individual, who is now in the role of A, the attacker. What happens after A attacks B will depend on the group membership of the uninvolved onlooker C. If that onlooker is in a group with the attacker, then their presence is a benefit from the attacker's perspective (bottom left panel of Fig. 4); they may also join in attacking the victim, helping diffuse the costs and risks of the conflict. Or the victim may retaliate against the attacker's fellow group member rather than the attacker themselves, thereby diffusing the risk of counterattack. In contrast, if the onlooker is in a group with the victim, the likely next steps are costly to the aggressor (bottom right panel of Fig. 4): The aggressor will either be motivated to also attack the onlooker, requiring additional cost and risk. Or, in response to the initial attack on their fellow group member, the onlooker will in turn attack the aggressor. Neither event benefits the aggressor. Consequently, if aggressor A can anticipate these outcomes before attacking B, then A will be more likely to attack when A's group member is present (right panel), but will be less likely to attack when B's group member is present (left panel). This is the causal logic for why the presence or absence of group members changes decision thresholds for engaging in or avoiding conflict. This example demonstrates the strategic benefits of representing the group membership of uninvolved third parties, and how one's own decision-making is aided by being able to anticipate how others will behave.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220707061738195-0819:S0140525X21000583:S0140525X21000583_fig4.png?pub-status=live)
Figure 4. The single agent is now in role A – the cost imposer. Again, without knowing group membership we have no basis for anticipating which, if any, of the four possible triadic interactions are likely to happen (top panel). However, once group membership is marked (in red; bottom panels), certain next steps become more likely than others (those highlighted in yellow).
Finally, Figure 5 depicts the perspective of the initial victim. Here, the initially uninvolved onlooker can either be a group member with B (left panel) or with A (right panel). Again, there is a cost/benefit asymmetry between these two scenarios: It will be less costly for the victim to be attacked in the presence of a fellow group member (left panel) than in the presence of a group member of the attacker (right panel). Furthermore, our victim B will sometimes be motivated to act (e.g., attacking C when A and C are group members) and will sometimes anticipate other's actions (e.g., expecting C to attack A on his or her behalf when B and C are group members). For detailed cost/benefit analysis of why these particular events may unfold, see Pietraszewski (Reference Pietraszewski2016, SOM).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220707061738195-0819:S0140525X21000583:S0140525X21000583_fig5.png?pub-status=live)
Figure 5. The single agent is now in role B – the victim of the initial cost imposition. Again, with knowing group membership, we have no basis for anticipating which, if any, of the four possible triadic interactions are likely to happen (top panel). However, once group membership is marked (in red; bottom panels), certain next steps become more likely than others (those highlighted in yellow).
6. Clarifying the definition
We can now refine and clarify the definition: Roughly, somewhere in the mind are floating around the representations [group] and [conflict]. When these co-occur, what is bound to this composite [group-in-the-domain-of-conflict] representation are these group-constitutive roles within the triadic primitives. That is, these roles are the mind's operational or functional definition of a group in the context of conflict, and these triads are the skeletal primitives out of which each instance (token) of a group is perceived and built (and we will see in a moment how all of this scales up). In other words – and tautologically, given our definition – to the degree agents belong to the same group, they will be more likely to act according to these roles, and will also anticipate that other group members will do the same.
This definition entails a minimal bound for what the mind considers a group (conflict) dynamic: Minimally, two agents must be within the group, and a single agent must lie outside of it (where “group” is defined non-tautologically by the occupation of group-constitutive roles). For example, if we imagine a conflictual world with only three agents, in which two are allied and a third is not, then we would see over time the two allied individuals occupying each of the roles that define a group when engaging in conflict with the third lone individual. The two in conflict with the lone individual would constitute a group, according to the definition.
Furthermore, the group-constitutive roles define “group” from a third-party perspective. Agents who are not involved in a particular conflict can nevertheless infer these roles by observing the actions of others, and who occupies these roles will define for these third parties who is in a group with whom. Of course, the roles are mental representations and so cannot just be seen (in the same way that an object cannot just be seen; Marr, Reference Marr1982). Therefore, a substantial amount of input analysis is required to go from the detection of on-the-ground behaviors (such as seeing A shooting at B, hearing about X gossiping about Y, and so on) to the inference that these behaviors are an instance of one of these roles; a problem that is far from trivial. Moreover, contingent cost-infliction is often a drawn-out process, with many gaps and lulls between interactions. For example, if you steal my cattle today, it may be a day, a month, or even a year before I retaliate. Therefore, delays between cost-inflictions will have to be allowed for, as will some opacity of who has done what to whom. It follows then that identifying who is in what group requires understanding who has imposed which costs on whom, and in response to what past behaviors. Inferring group membership is then in part an attribution problem.
Crucially, behaviors will not be the only cues used to infer the occupation of the group-constitutive roles. In particular, because the relationships that cause agents to occupy group-constitutive roles with one another constitute an evolutionary invariance (at a particular level of abstraction), natural selection can “see” what cues correlate with these relationships, and can build learning systems that spawn probabilistic group-constitutive role representations in response to them. Consequently, even very sparse cues that do not have much to do with conflict when looked at superficially (e.g., Dunham, Reference Dunham2018; Pietraszewski, Reference Pietraszewski, Banaji and Gelman2013, Reference Pietraszewski, Van Lange, Higgins and Kruglanski2020) can nevertheless probabilistically elicit these group-constitutive role representations. We will refer to these cues as ancillary attributes of groups (ancillary because they are not behaviors corresponding to the group-constitutive roles themselves, but rather are cues that predict the occupation of these roles, either over evolutionary or developmental time).
What attention to ancillary attributes means, is that even in a world in which the only reason to attend to “groups” is to predict conflict-related outcomes, group tokens that superficially have little or nothing to do with conflict will nevertheless be attended to: a point that has interesting consequences for our understanding of the folk concept “group” – namely, it raises the issue of whether or not the concept “group” can ever be divorced from conflict-related representations no matter how apparently peaceful the group token is. In practice, both ancillary attributes and behaviors corresponding to group-constitutive roles are likely to be present in the world. An additional information-processing problem is to then link up these diverse representations to the correct group token.
In sum, the information-processing within the mind that makes it possible to consider, plan, and generate the above group-constitutive roles is what constitutes the psychology of groups-in-conflict. Furthermore, this psychology is recursive in the sense that I not only expect group members to occupy these roles, I expect that others will also expect this, and so on. Finally, we have so far been treating behavior as all-or-none – that if someone is a group member with someone else, they will produce behaviors X, Y, and Z. Of course, this is an oversimplification, so more fleshed out computational theories can incorporate something like a continuously scaled threshold to act, as part of a larger “motivational” goal-tree/difference-reduction architecture.
7. Scaling up
The architecture for representing each of the triadic primitives is based upon interactions between individual agents. However, at times, multiple agents will all behave the same way within a particular triadic primitive (e.g., a set of agents will all occupy role C in Defense). The architecture can therefore scale up the representation by inserting (or substituting) that entire set of agents into the single-agent slot (the group-constitutive role). This one-to-many substitution allows the same (or a slightly modified) architecture to handle much larger numbers of agents simultaneously. Moreover, to the degree that that particular set of agents are in a group – per the definition – they are permitted to be inserted into all of the agent slots within all of the triadic primitives depicted in Figure 2, which enables an infinitely hierarchical scaling up of nested group representations. This kind of architecture allows us to represent, for example, that Japan and Germany occupied group-constitutive roles with respect to one another during World War II, as did Britain and the United States, without having to calculate the actions of the millions of individuals involved (see also Fiske & Neuberg, Reference Fiske and Neuberg1990; Insko & Schopler, Reference Insko, Schopler and Hendrick1987).
Of course, any one of the roles within these triadic primitives may also be produced for reasons other than group membership. For example, an agent, C, who witnesses A attacking B, may retaliate against A, not because she is a group member with B, but because A's attack was morally reprehensible, and should be punished. Indeed, even if multiple agents all decided to attack B, thus filling the role C in Defense, this would still not constitute a group. Groups, instead, are a special class of relationships in which all of the above roles either will occur, or are expected to occur. That is, group membership applies to all of the triadic primitives. If only a subset are shown to hold, this should hurt the representation that such a relationship is a group.
For example, if we were to demonstrate that all of the agents filling role C in the above example are not willing to place themselves in any of the other group-constitutive roles with respect to A, aside from this one case, then they would not be seen as constituting a group with A. That is, they are not group members precisely because these agents do not occupy all of the above roles with respect to one another. Indeed, different collections of agents can occupy some, all, or none of the above group-constitutive roles with respect to one another. This variation must be represented by the mind, and this represented variation may explain why there is a perceived continuum of “groupishness” or entitativity across different collections of people (Campbell, Reference Campbell1958; Hamilton et al., Reference Hamilton, Sherman and Castelli2002; Ip et al., Reference Ip, Chiu and Wan2006; Pietraszewski, Reference Pietraszewski2016) – an idea that is imminently testable.
8. The utility of the definition
So what does having a plausible theory of a group representation in the context of conflict buy us?
8.1. Ancillary attributes
First, the present account suggests that, for a better part of a century, psychologists have been studying groups largely through the lens of ancillary attributes that groups tend to have, rather than by directly studying the fundamental behaviors (or more precisely, the intergenerationally recurrent dynamics) that constitute group membership itself. Ancillary attributes, to review, are those things that help diagnose the occupation of group-constitutive roles: they are cues that predict or correlate with these roles (either over evolutionary or developmental time), but are not the on-the-ground cost-imposing behaviors that correspond to the group-constitutive roles themselves.
Allport, for example, defined a group as “two or more persons who are assembled to perform some task, to deliberate upon some proposal or topic of interest, or to share some affective experience of common appeal” (Reference Allport1924, p. 260). The problem with this definition is that groups can exist without assembly, deliberation, or a shared affective experience. Now of course it may be true that these things do tend to co-occur with the formation and maintenance of groups (precisely: the classes of relationships that we are referring to as groups), and our psychology should take advantage of this fact – but they are not necessary. In fact, it is trivial to think of counter-examples to such ancillary-attribute-based definitions: A collection of students who are all assembled to take a test are less of a “group” than are a set of students all assembled together to burn the books of the locally scapegoated outgroup, for example. The latter behaviors diagnose a disposition toward behaving toward that outgroup in a way consistent with our definition, whereas the former does not. Other examples of ancillary attributes include spatial proximity, overall similarity along some dimension, and so on.Footnote 7
Historically, psychological theorizing on what defines or constitutes group membership has been largely focused on these ancillary attributes (e.g., Campbell, Reference Campbell1958) – undoubtedly because they are visible. Much of the same can be said for empirical research (e.g., Pietraszewski, Curry, Peterson, Cosmides, & Tooby, Reference Pietraszewski, Curry, Peterson, Cosmides and Tooby2015; Powell & Spelke, Reference Powell and Spelke2013). This is not to say that these ancillary attributes aren't important. They are. Instead, they are simply not always necessary nor sufficient, meaning that in a deep sense that they diagnose group membership, but they are not what group membership is fundamentally about. Notably, if one were to observe a collection of people who share the same arbitrary marker or “affective experience,” for example, but who do not occupy group-constitutive roles within a conflict, then that marker and that experience will not be seen as constituting group membership. In contrast, the present definition is sufficient on its own: Even if a collection of people have no ancillary attributes – for example, are not physically proximate and have no other similarity or shared emotional experience – they will nevertheless be viewed as a group, so long as they are shown to occupy each of the roles in our definition.
This focus on ancillary attributes is no accident. Rather, it is a consequence of a scientific history of relying on intuition: Intuition highlights what is variable about groups (who belongs to what group, and what individual tokens of groups exist) while blinding us to what is universal about group memberships (what constitutes a group, and what is done within cognitive architecture once a group is detected). This is why group psychology has been largely treated as a categorization process – as if the assignment of agents to groups were the main issue – whereas in fact conceptualizing what exactly agents are being assigned to is the far deeper and more fundamental issue.
8.2. Computational adequacy – moving beyond containment metaphors
Second, having an explicit computational theory forces us to address issues of computational adequacy – something missing in the group literature until now. (Computational adequacy, to review, is concerned with asking if the information-processing proposed is sufficient to account for the observed characteristics of a particular phenomenon.)
One consequence of this fact is that nearly all theorizing about groups has relied (either explicitly or implicitly) on a containment or subsumption metaphor – that groups are containers into which individual members are placed (for the reasons just described above). In contrast, the present account suggests that group membership is a relational property (specifically, the relative strength of pairwise comparisons among agents; DeDeo, Krakauer, & Flack, Reference DeDeo, Krakauer and Flack2010; Perry, Barrett, & Manson, Reference Perry, Barrett and Manson2004). On this account, who “belongs” to what group is borne out of a calculation of the relative relationships among the agents involved (which is why – to put it pre-theoretically – who one feels closest to depends on who else is around).
Moving beyond the containment metaphor is important because the metaphor fails to account for even the most basic characteristics of social groupings – such as that when multiple, nested groups exist, switching between group identities can and frequently does occur. For example, two populations may be on opposing sides during a civil war (i.e., each side will occupy non-group-constitutive roles with respect to one another). But if both populations are then attacked by a larger common enemy, the two populations may subsequently occupy group-constitutive roles with respect to each other against that common enemy – meaning that the two populations that had not been in the same group now are.
Although an obvious example, containment metaphors cannot account for why group membership is a function of who is interacting: When taken literally as information-processing accounts, they simply produce assignments of each agent to each category. There is no mechanism to switch between the containers, nor to inhibit one in a favor of another (see also Liska, Reference Liska1962; Von Neumann & Morgenstern, Reference Von Neumann and Morgenstern1944). In contrast, the triadic primitive architecture here performs relational pairwise comparisons among triads of agents, calculating which two are more similar among the agents represented within the triad (which is the crucial step that allows agents on opposing sides in the civil war to be represented as members of the same group for the purposes of interacting with the larger common enemy – a calculation strictly speaking not possible with the containment metaphorFootnote 8).
In contrast, the present definition anticipates these shifts and extracts meaning from them: Which group membership (or identity) is “relevant” or “active” is that which is currently aligned with the group-constitutive roles being currently expressed. Each group representation necessarily carries with it a set of behavioral expectations directed toward specific others, and each individual can be a member of any number of different groups. Therefore, which group membership is relevant at any one time is a function of with whom the agent is interacting.
Furthermore, agents may not be able to resolve all of their group memberships – in the sense that some group representations can be activated simultaneously, producing incompatibilities about which group membership should be the basis of one's current behavior. The metaphorical language of feeling “torn” about opposing group loyalties occurs in such cases because each group membership implies a specific class of behaviors directed at particular others. For example, an agent in role C, observing A attack B, may be a member of one group that would require attacking B, and another that would require attacking A. “Feeling torn” then corresponds to when an agent belongs to at least two groups that would, according to our definition, require acting in different ways toward those others. Group-constitutive roles specify what representations agents who have conflicting group loyalties are wrestling with, and must make a decision about. The definition thereby allows us to conceptualize such felt conflicts as a class of information-processing outcomes.
8.3 Integrating group and individual decision-making, and accounting for what social identities are and why they matter
Third, by defining group membership as changes to internally represented thresholds for producing and/or expecting cost impositions, our definition allows us to integrate the concept of “group membership” with an individual's other cost/benefit calculations and decision-making. This has not been possible before. For example, theorizing that relies on the metaphorical language of individuals becoming “bonded” or “fused” to a group evokes the right idea of an agent acting on behalf of a particular identity, but cannot be quite literally correct, because the agent and the group never become exactly the same entity.Footnote 9 Even young children understand that allies are not voodoo dolls, in that they do not simultaneously experience all of the same internal states simply because they are bonded or associated together (Pietraszewski & German, Reference Pietraszewski and German2013). Adults, furthermore, understand that one's social alliances can be in conflict with one's own independent cost/benefit evaluation of a particular situation (Pietraszewski & German, Reference Pietraszewski and German2013; Shaw, DeScioli, Barakzai, & Kurzban, Reference Shaw, DeScioli, Barakzai and Kurzban2017). Certain theories argue that morality itself exists specifically for this category of eventualities – freeing agents from their own alliance obligations and allowing them to impartially evaluate events from the perspective of disinterested third parties (DeScioli & Kurzban, Reference DeScioli and Kurzban2009, Reference DeScioli and Kurzban2013).
Until we can conceptualize group membership as a set of continuous decisions on the part of individuals, it is not possible to study how conflicting interests within an individual, or between individuals within a group, are integrated and resolved within the mind (a rather serious deficiency – as this is one of the central elements of what we want to understand when we study group conflict). Past accounts that treat groups with a containment metaphor do not have a literal model of how individual decisions and groups relate to one another (nor how different group memberships are resolved – e.g., how one chooses between tribal and religious loyalties). On these past accounts, it is of course possible to note that conflicts of interest and disloyalty exist, but such phenomena do not fall out of the information-processing logic of what a group consists of in the first place, and as such fail to be accounted for. In contrast, if we understand “group” to emerge out of individual decisions within sets of triadic interactions, these conflicts of interest and a number of other phenomena fall out naturally (see Pietraszewski, Reference Pietraszewski2016).
A similar issue arises with respect to the phenomenon of social identity, which is arguably one of the most important contributions to come out of social psychological research in the twentieth century. Briefly, early theorizing about intergroup conflict assumed that conflict would be based, roughly, around conflicts of interest. This was called realistic conflict theory (e.g., Levine & Campbell, Reference Levine and Campbell1972). What social identity theorists discovered was that objective reasons for conflict were not necessary to spawn “groupishness,” even in the lab (e.g., Sherif et al., Reference Sherif, Harvey, White, Hood and Sherif1961; Tajfel, Reference Tajfel1970, Reference Tajfel1982; Tajfel & Turner, Reference Tajfel, Turner, Austin and Worchel1979, Reference Tajfel, Turner, Worchel and Austin1986; Turner, Hogg, Oakes, Reicher, & Wetherell, Reference Turner, Hogg, Oakes, Reicher and Wetherell1987). Instead, people seemed to intrinsically value social relationships, and were willing to pay a cost to “have” them (i.e., they would coordinate their actions according to abstract identities, rather than to objective aspects of the pay-out structure of the world; for analysis and reviews of this family of theories and findings, see Bar-Tal, Reference Bar-Tal2001; Brewer, Reference Brewer and Bar-Tal2001; Dawes, Van De Kragt, & Orbell, Reference Dawes, Van De Kragt and Orbell1988; Dunham, Reference Dunham2018; Park & Van Leeuwen, Reference Park, Van Leeuwen, Zeigler-Hill, Welling and Shackelford2015; Pietraszewski, Reference Pietraszewski, Van Lange, Higgins and Kruglanski2020; Tajfel, Reference Tajfel1982).
An adequate account of the psychology of groups must account for social identities (meaning that agents can have them, and can represent them in others). Yet it has been rather difficult to integrate social identities in literal, mechanistic models of on-the-ground social interaction (for valiant attempts, see Smaldino, Pickett, Sherman, & Schank, Reference Smaldino, Pickett, Sherman and Schank2012). For the most part, models have either assumed away social identities or have added them ad hoc, such that their importance does not emerge from the structure of the individual interactions within the model, but is declared by fiat.
In contrast, the present proposal articulates what causal role social identities play within the mind's information-processing: they are placeholders that cause agents to treat each other as substitutable with respect to one another within the triadic primitives. That is, a social identity is a mental representation (real or imagined) that leads to the scaling up from a single agent to a set of agents within one or more of the agent slots within the triadic primitives. For instance, if a person believes that an attack was caused by a single individual, the contingent response to that attack will only be directed at that individual. However, if a person believes that an attack was caused by a social identity, then that person may retaliate against others also holding that same social identity, even if they were not involved in the initial attack (because from their perspective, the two agents holding the same social identity are substitutable with one another).
This framework allows us to understand why social identities are of value, and why people are willing to work so hard to maintain and change other's representations of them (e.g., Haslam, Reicher, & Platow, Reference Haslam, Reicher and Platow2011): Representations of who and who does not belong to the same social identity become powerful determinants of how particular conflicts spread, and who retaliates against whom. It is this down-stream effect of extending a conflict out onto many more previously uninvolved agents that explains why a conflict involving social identities activates additional attribution processes (Brewer, Reference Brewer and Bar-Tal2001; Dawes et al., Reference Dawes, Van De Kragt and Orbell1988) and produces more fraught and protracted conflict dynamics than those that do not (Abrams & Rutland, Reference Abrams, Rutland, Levy and Killen2008; Bar-Tal, Reference Bar-Tal2001; Puurtinen, Heap, & Mappes, Reference Puurtinen, Heap and Mappes2015; Walter, Reference Walter2004, Reference Walter2009).
Moreover, individuals can hold different perceptions of what social identities exist, even when directly interacting with one another. For example, history is replete with bloodied colonial powers assuming away the heterogeneity of local, native social identities and treating everyone as if they are the enemy (such as retaliating against villagers for raids by rebel groups in the forest), which eventually becomes a self-fulfilling prophecy. The current framework allows us to understand what it means, precisely, that different agents hold these diverging representations, and what work, in principle, these representations are doing in each of their minds.
8.4 Accounting for group emergence
Fourth, a literal notion of what it means for each agent to represent themselves and others as belonging to the same group within a conflict offers a principled way to think about group emergence. For example, the current proposal suggests that groups (or more precisely: increases in the degree to which agents are probabilistically assigned to group-constitutive roles) can emerge out of individual actions, such that their existence may be unplanned until some event transpires.
For instance, suppose that we are one of a collection of people waiting around for a bus. This collection is not yet, on any meaningful definition, a “group.” But suddenly someone drives by and throws a stone at one of us. We can either (i) throw a stone back at the car (now intuitively “more” of a group, as we are occupying role C in Defense), (ii) hit the victim (a different group, as we are occupying role C in Alliance), or (iii) do nothing (non-group).
Before we act, each one of these possibilities is equally possible and realizable. Yet it is only when we act (or, perhaps, make a decision to act) that certain groupings now become more likely going forward. Moreover, we must also coordinate – a single decision (i.e., a single triadic interaction type) does not make a group, particularly if no one else agrees with our assessment. For example, we may throw a stone at the car, believing that we are now in a group with the victim. Whereas the victim turns and hits us, contradicting our assessment. Certain behaviors can therefore be seen as bids for establishing group memberships, precisely because group memberships are in a deep sense a set of behaviors.Footnote 10
While this stone-throwing example is oversimplified, something roughly analogous to this de-centralized dynamic often plays out in the genesis of real-world groupings. The current framework, therefore, suggests a number of research programs with the goal of viewing bids at group formation – such as political rhetoric and narratives of historical grievances (e.g., Lopez, Reference Lopez2020; Moncrieff & Lienard, Reference Moncrieff and Lienard2019) – through the lens of the triadic primitives and group-constitutive roles.
9. Next steps
9.1 An engineering approach to the study of groups, and a more in-depth task analysis
We can now return to our hypothetical from the beginning of the paper, in which an AI engineer approaches us and wants to build a machine intelligence capable of “groups.” What exactly would we tell them? Our computational theory of a group in the context of conflict now provides them with a clearer, more literal set of information-processing functions to be implemented in mechanisms. These include
• Machinery for stringing together long chains of concatenated triadic primitives, in which the same agents are able to be assigned to different agent slots within each triadic primitive (Bob may be in role C in one, A in another). These chains can be thought of as a path through a much larger triadic state space (e.g., see Pietraszewski, Reference Pietraszewski2016). These chains describe the most likely interactions that will occur among agents as a conflict unfolds, and are potentiated by (and thus are tagged to) agent-based frames and event-based frames (the former being concerned with assessing what will happen if particular agents interact, whereas the latter is concerned with assessing what will happen if a particular event occurs.)
• Machinery for storing and making use of “group” representations, which (as shown in the examples of Figs. 3–5) collapse large portions of this triadic state space, modifying which paths are most likely. That is, group representations are not containers, they are representations (namely, modifiers) that modify what would otherwise be the defaults of this triadic state space, potentiating certain areas of this space, and curtailing others.
• Machinery for updating group representations and most-likely triadic-state-space representations, based on ongoing events in the world (e.g., the actions that I take and the actions that others take).
• Machinery for generating agent-based and event-based counterfactuals (e.g., what would happen if I took action X, what would happen if Bob took action X, what would happen if I run into Bob, what would happen if Bob runs into Bill, and so on), the outputs of which should be stored in a manner to allow for easy/random access, and would in principle feed into motivational and planning systems.
The above are just a handful of the minimal information-processing requirements for something like a set of mechanisms capable of using group representations as defined in the current proposal. There must also be mechanisms for scaling up and scaling down based on social identities, for quarantining counterfactuals from actual events, for translating cues in the environment into probabilistic group-constitutive role representations (and for translating these representations into concrete behavioral predictions), for identifying disloyalty and distinguishing between disloyalty toward a group and the non-existence of a group, and so on. This is exactly what we want: The point of having a computational theory is that it reveals information-processing problems to be solved.
The information-processing requirements (or problems) outlined above represent a core element of humans' everyday commonsense knowledge – an issue of central importance within current AI (e.g., Etzioni, Reference Etzioni2018; Mueller, Reference Mueller2015). For example, it is obvious why two complete strangers who have never met – someone wearing an axis uniform who was born in Marburg and someone wearing an allied uniform who was born in Iowa city – are shooting at each other when we look at grainy footage from World War II. Just as it is blindly obvious that someone is taking their safety in their hands by cheering for Manchester United at Liverpool or for the Yankees at Fenway. Yet no robot or AI currently holds such intuitions. This is because these intuitions are the outputs of the above-listed information-processing steps, which can be thought of as a series of social inference engines (akin to a physics engine; Ullman, Spelke, Battaglia, & Tenenbaum, Reference Ullman, Spelke, Battaglia and Tenenbaum2017), without which an appreciable swath of commonsense knowledge about the social world would not be possible.
The above-listed information-processing steps are also an important part of the solution to the frame problem within the social domain (i.e., for representing what will change in the world when action X is performed; Kamermans & Schmits, Reference Kamermans and Schmits2004; for an initial empirical exploration, see Pietraszewski & German, Reference Pietraszewski and German2013) – and something like these inference engines will need to be implemented in some kind of event calculus (Mueller, Reference Mueller2015) or any other kindred architecture that permits deduction (i.e., prediction and explanation) while avoiding monotonicity (i.e., “combinatorial explosion”: Minsky, Reference Minsky1974). The event types presented here (a verbal and graphical description involving contingent cost imposition) are but a sliver of a fully computationally adequate account, and so our description of the event calculus itself is thus far highly cursory. Nevertheless, the goal here is to provoke the fleshing out everything that would be needed (i.e., a task analysis).
One of the primary goals of this paper, then, is to invite the broadest possible community of researchers – particularly those who work in AI, software engineering, computer and cognitive science, and so on – into the study of social groups, and allow them to ask (in collaboration with researchers already studying social groups): What are all the things that one would need to put into an information-processing entity, such that it could go out into the real world, observe and interact with that world, and then generate and use the particular representational system presented in this current proposal? There are likely hundreds, if not thousands of information-processing functions and subfunctions required to make all of this happen. Enumerating what these functions are will provide us with a computational ontology – a set of concepts that describe information-processing entities and their relationships with one another (e.g., Mueller, Reference Mueller2015; Russell & Norvig, Reference Russell and Norvig1995), and will define what we should be looking for within the mind (or implementing in automata). A complete understanding of the psychology of social groups will require proposing, testing, and then either confirming, refining, or correcting every single element of this ontology (along with many others) – an explicitly computational-theoretic approach that will constitute a mutually informative Cognitive Psychology and Cognitive Science of social groups. Despite the incredible social importance of understanding the psychology of group-based conflict, it is not clear that any such research program is currently underway.
9.2 Empirical predictions, methods, and outstanding questions
On the experimental side, our computational theory makes a number of empirical predictions that await testing. For example, the behaviors that correspond to what we have called group-constitutive roles should be inputs that lead to the perception of groups-in-conflict. That is, if one were to show instances of cost-infliction behaviors among triads of agents, then participants should perceive those agents occupying group-constitutive roles as being members of the same group. Likewise, group-constitutive roles should be also outputs, in that they should be expected patterns of behavior once a summary representation is communicated. For example, if we tell participants that Harry, Ron, and Hermione are all in a group, and that they are in conflict with Malfoy, Crabbe, and Goyle, who are all also in a group, then participants should be able to predict which roles these agents will occupy with respect to one another. In other words, the logic of the definition should be intuitive to human participants, just as Figures 3–5 should have been intuitive to the reader. Furthermore, the use of these group-constitutive roles as both inputs and outputs should be universal and hold for all humans on earth.
The relationship between more direct cues (i.e., behaviors that clearly denote group-constitutive roles, or verbal labels referring to a known summary representation) and what we are calling ancillary attributes of group membership will also be important to establish. That is, ancillary attributes or cues (i.e., things that are concomitants of the occupation of the group-constitutive roles, such as similarity, proximity, etc.) should be used to probabilistically infer group membership in the absence of more direct cues. A number of as-yet unanswered questions then arise. For example, how do direct cues fare against ancillary cues? In principle, ancillary cues should be used when no other direct cues are present, but should be secondary to direct cues.
Another question is what happens when ancillary cues are pitted against direct cues. The current proposal predicts that direct cues should corrode the validity of ancillary cues. How, in turn, are ancillary cues learned, and do some occur independently of learned associations? For instance, some attributes such as proximity and shared opinions may be intrinsically linked to guesses about group membership (e.g., Gershman & Cikara, Reference Gershman and Cikara2020; Gershman, Pouncy, & Gweon, Reference Gershman, Pouncy and Gweon2017; Lau, Gershman, & Cikara, Reference Lau, Gershman and Cikara2020; Pietraszewski, Reference Pietraszewski, Banaji and Gelman2013), whereas other attributes – such as how one ties one's shoe laces, which is sometimes used by gang members – may depend much more on social context. The proposal here would predict that arbitrary similarities should become interesting to the mind and be seen as “group” markers insofar as they track or correspond to group-constitutive roles (see also sect. 6, “Clarifying the definition,” above). Indeed, there is already some experimental evidence for some of these predictions floating around in the past literature (e.g., Bar-Tal, Reference Bar-Tal2001; Cikara, Bruneau, Van Bavel, & Saxe, Reference Cikara, Bruneau, Van Bavel and Saxe2014; Cikara, Van Bavel, Ingbretsen, & Lau, Reference Cikara, Van Bavel, Ingbretsen and Lau2017; Ip et al., Reference Ip, Chiu and Wan2006; Lau, Pouncy, Gershman, & Cikara, Reference Lau, Pouncy, Gershman and Cikara2018; Patton, Reference Patton1996, Reference Patton, Cronk, Chagnon and Irons2000; Pietraszewski, Reference Pietraszewski, Banaji and Gelman2013; Pietraszewski et al., Reference Pietraszewski, Curry, Peterson, Cosmides and Tooby2015; Rhodes & Chalik, Reference Rhodes and Chalik2013), but the full weight and entirety of these predictions remain to be tested.
Finally, because the present proposal is explicit about what the end-state representation in the mind needs to be (that a “group” token out in the world become yoked to either direct or ancillary cues, which are in turn yoked to group-constitutive roles), a principled learnability analysis of the social environment can be conducted. That is, one can understand when and why particular tokens need to be apprehended via ancillary cues (e.g., because the direct cues are infrequent, dangerous to observe, and so on).
There are also a number of unresolved empirical issues brought up by the present proposal. For instance, a cost/benefit analysis of each of the group-constitutive roles within the triadic primitives suggests that some roles will be more costly than others. For example, defending a victim against an aggressor is less costly than helping an aggressor attack a victim, all else equal (for the explanation, see Pietraszewski, Reference Pietraszewski2016). This means that within the group-constitutive roles, some may be more likely than others. Therefore, are some more anticipated than others? And are those that are more costly the least expected? Asymmetries between offense and defense, for example, have been found (Böhm, Rusch, & Gürerk, Reference Böhm, Rusch and Gürerk2016; De Dreu et al., Reference De Dreu, Gross, Méder, Giffin, Prochazkova, Krikeb and Columbus2016; De Dreu & Gross, Reference De Dreu and Gross2019; Lopez, Reference Lopez2017; Rusch, Reference Rusch2013, Reference Rusch2014a, Reference Rusch2014b), and the present proposal suggests a number of new comparisons to test in future studies. Likewise, the relationship between the different group-constitutive roles should also be probabilistic – meaning that seeing two agents occupying group-constitutive roles in one triadic primitive may increase the expected probability that they will also occupy group-constitutive roles within another triadic primitive (see also sect. 8.4, “Accounting for group emergence”). The mind, in other words, is expected to guess about group membership based on partial information – an idea that is imminently testable.
10. Toward a complete computational theory of social groups – including non-conflict-based representations
Finally, because agents can also coordinate and cooperate with one another – and there is nothing in our conflict-based event types to allow for the representation of such events – the current theory of what constitutes a group representation is not yet complete. Therefore, even if correct, the present account will need to be supplemented with (at minimum) additional event types – and there are a number of existing theories and taxonomies that can be repurposed as plausible hints at what these types might be (e.g., Alexander, Reference Alexander1987; Balliet, Tyber, & Van Lange, Reference Balliet, Tybur and Van Lange2017Footnote 11; Heider, Reference Heider1958; Tatone, Geraci, & Csibra, Reference Tatone, Geraci and Csibra2015).
Although the current proposal is silent on what these might be, we can at least speculate that a plausible (if not obvious) set of candidate event types for something like polyadic cooperation might be something akin to sets of direct reciprocity (A gives to B, B repays in kind over time) and indirect reciprocity (A gives to B, as a consequence, B gives to C; see Alexander, Reference Alexander1987). However, this is just speculation, and there may need to be additional elements to make these computationally adequate, or something more specific may be necessary – issues that await future work. What we can say with some certainty is that neither direct nor indirect reciprocity can account for the current conflict-based event types (i.e., these are not isomorphic with respect to one another)Footnote 12 – which means that a complete computational theory of groups will likely be comprised of multiple event-type frameworks.
In the search for a complete computational theory of group representation, care will have to be taken to conceptually-distinguish between two different enterprises: The first is to provide a computationally adequate account of every group token. In this respect, each token will likely be a composite of a number of event types (in the same way that an object representation is a composite of visual information-processing types, like lines and colors). A second enterprise is to explain why people can think about and understand the over-arching category “group,” and why they agree about a continuum of groupishness across these tokens. The first enterprise will require a proliferation of types. The second will require some unification or integration of those types. There are many more reduction problems to be solved, in other words, than just the problem of multi-agent conflict dynamics covered here. The principle of computational adequacy – combined with considerations of what is both necessary and sufficient – will have to guide these future efforts.
11. Summary and conclusion
Despite an enormous literature on groups and group dynamics, little attention has been paid to explicit computational theories of how the mind represents and reasons about groups. The goal of this paper has been, in a conceptual, non-technical manner, to propose a simple but non-trivial framework for starting to ask questions about the nature of the underlying representations that make the phenomenon of social groups possible – all described at the level of information-processing. This computational theory, when combined with many more such theories – and followed by extensive task analyses and empirical investigations – will eventually contribute to a full accounting of the information-processing required to represent, reason about, and act in accordance with group representations.
Acknowledgments
The author would like to thank Mina Cikara, Carsten De Dreu, Florian van Leeuwen, William von Hippel, and several anonymous reviewers for helpful comments on previous versions of this manuscript.
Financial support
The author was supported by the Max Planck Society while preparing this manuscript.
Conflict of interest
None.
Target article
Toward a computational theory of social groups: A finite set of cognitive primitives for representing any and all social groups in the context of conflict
Related commentaries (29)
A neuroscientific perspective on the computational theory of social groups
Advantages and limitations of representing groups in terms of recursive utilities
Are we there yet? Every computational theory needs a few black boxes, including theories about groups
Beyond folk-sociology: Extending Pietraszewski's model to large-group dynamics
Can group representations based on relational cues warrant the rich inferences typically drawn from group membership?
Coalitionary psychology and group dynamics on social media
Compassion within conflict: Toward a computational theory of social groups informed by maternal brain physiology
Conciliation and meta-contrast are important for understanding how people assign group memberships during conflict situations
Developmental antecedents of representing “group” behavior: A commentary on Pietraszewski's theory of groups
Group? What group? A computational model of the group needs a psychology of “us” (not “them”)
How do we know who may replace each other in triadic conflict roles?
Interacting with others while reacting to the environment
Internal versus external group conflicts
Latent structure learning as an alternative computation for group inference
Learning agents that acquire representations of social groups
More than one way to skin a cat: Addressing the arbitration problem in developmental science
On vagueness and parochialism in psychological research on groups
Paranoia reveals the complexity in assigning individuals to groups on the basis of inferred intentions
Private versus public: A dual model for resource-constrained conflict representations
Psychological and actual group formation: Conflict is neither necessary nor sufficient
Shadow banning, astroturfing, catfishing, and other online conflicts where beliefs about group membership diverge
Shared intentionality and the representation of groups; or, how to build a socially adept robot
Signals and cues of social groups
Social groups and the computational conundrums of delays, proximity, and loyalty
Societies and other kinds of social groups
The labelled container: Conceptual development of social group representations
Towards a computational network theory of social groups
Triadic conflict “primitives” can be reduced to welfare trade-off ratios
Using laboratory intergroup conflict and riots as a “stress test”
Author response
More “us,” less “them”: An appeal for pluralism – and stand-alone computational theorizing – in our science of social groups