Hostname: page-component-745bb68f8f-lrblm Total loading time: 0 Render date: 2025-02-11T15:06:02.982Z Has data issue: false hasContentIssue false

TTOM in action: Refining the variational approach to cognition and culture

Published online by Cambridge University Press:  28 May 2020

Samuel P. L. Veissière
Affiliation:
Division of Social and Transcultural Psychiatry, Department of Psychiatry, McGill University, Montreal, Quebec, CanadaH3A 1A1. samuel.veissiere@mcgill.camaxwell.ramstead@mcgill.calaurence.kirmayer@mcgill.ca Culture, Mind, and Brain Program, McGill University, Montreal, Quebec, CanadaH3A 1A1 Department of Anthropology, McGill University, Montreal, Quebec, CanadaH3A 2T7
Axel Constant
Affiliation:
Culture, Mind, and Brain Program, McGill University, Montreal, Quebec, CanadaH3A 1A1 Charles Perkins Centre, The University of Sydney, Sydney, New South Wales, Australia2006. axel.constant.pruvost@gmail.com Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3AR, UK. k.friston@ucl.ac.uk
Maxwell J. D. Ramstead
Affiliation:
Division of Social and Transcultural Psychiatry, Department of Psychiatry, McGill University, Montreal, Quebec, CanadaH3A 1A1. samuel.veissiere@mcgill.camaxwell.ramstead@mcgill.calaurence.kirmayer@mcgill.ca Culture, Mind, and Brain Program, McGill University, Montreal, Quebec, CanadaH3A 1A1 Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3AR, UK. k.friston@ucl.ac.uk
Karl J. Friston
Affiliation:
Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3AR, UK. k.friston@ucl.ac.uk
Laurence J. Kirmayer
Affiliation:
Division of Social and Transcultural Psychiatry, Department of Psychiatry, McGill University, Montreal, Quebec, CanadaH3A 1A1. samuel.veissiere@mcgill.camaxwell.ramstead@mcgill.calaurence.kirmayer@mcgill.ca Culture, Mind, and Brain Program, McGill University, Montreal, Quebec, CanadaH3A 1A1 Department of Anthropology, McGill University, Montreal, Quebec, CanadaH3A 2T7

Abstract

The target article “Thinking Through Other Minds” (TTOM) offered an account of the distinctively human capacity to acquire cultural knowledge, norms, and practices. To this end, we leveraged recent ideas from theoretical neurobiology to understand the human mind in social and cultural contexts. Our aim was both synthetic – building an integrative model adequate to account for key features of cultural learning and adaptation; and prescriptive – showing how the tools developed to explain brain dynamics can be applied to the emergence of social and cultural ecologies of mind. In this reply to commentators, we address key issues, including: (1) refining the concept of culture to show how TTOM and the free-energy principle (FEP) can capture essential elements of human adaptation and functioning; (2) addressing cognition as an embodied, enactive, affective process involving cultural affordances; (3) clarifying the significance of the FEP formalism related to entropy minimization, Bayesian inference, Markov blankets, and enactivist views; (4) developing empirical tests and applications of the TTOM model; (5) incorporating cultural diversity and context at the level of intra-cultural variation, individual differences, and the transition to digital niches; and (6) considering some implications for psychiatry. The commentators’ critiques and suggestions point to useful refinements and applications of the model. In ongoing collaborations, we are exploring how to augment the theory with affective valence, take into account individual differences and historicity, and apply the model to specific domains including epistemic bias.

Type
Authors' Response
Copyright
Copyright © The Author(s), 2020. Published by Cambridge University Press

We are grateful to the commentators for their critiques, challenges, and elaborations of our model of human cognition, action, and cultural learning called Thinking Through Other Minds (TTOM). Several commentators provided arguments and examples that address points raised by others, which suggests – to our great satisfaction – that the TTOM model is useful, not only as a step toward an integrative theory of enculturation, but also as a framework for interdisciplinary collaboration and knowledge exchange.

The target article offered an account of the distinctively human capacity to acquire – and think through – cultural knowledge, norms, and practices. To this end, we leveraged recent ideas from theoretical neurobiology to understand the human mind in social and cultural contexts. Our aim was both synthetic – building an integrative model adequate to account for key features of cultural learning and adaptation; and prescriptive – showing how the tools developed to explain brain dynamics can be applied to the emergence of social and cultural ecologies of mind.

Our commentators raised important issues regarding definitions, concepts, and methods, and called for further development of the model. Given space constraints, we cannot respond to every point in each commentary. To address the key issues, we have organized this response thematically into six sections. In what follows, we discuss: (1) refining the concept of culture to show how TTOM and the free-energy principle (FEP) can capture essential elements of human adaptation and functioning; (2) addressing cognition as an embodied, enactive, affective process involving cultural affordances; (3) clarifying the significance of the formalisms related to the FEP and active inference, including (3.1) entropy minimization, (3.2) Bayesian inference, (3.3) Markov blankets, and (3.4) enactivist views of cognition; (4) developing empirical tests and applications of the model; (5) incorporating cultural diversity and context at the levels of (5.1) intra-cultural differences, (5.2) individual differences, and (5.3) the transition to a digital niche; and (6) potential applications to psychiatry.

Our aim is to clarify a picture of human cognition – not simply in terms of a generic “first principle” paradigm – but one designed to account for specific kinds of adaptation that are reflected in the patterns of social and cultural organization that depend on the body, emotions, interpersonal perception, and epistemic biases.

R1. The domain of culture

Several commentators noted the need to clarify the notion of culture that underwrites the TTOM model. Whiten's insightful commentary suggests that accounts of culture should explain the cultural behaviors of non-human animals, which in recent decades have been documented extensively. In particular, Whiten notes features of animal culture that a unified theory of culture will have to account for, including: phylogenetic reach, that is, the ubiquity of transgenerational transmission of learning behaviors in many species; intraspecies pervasiveness, that is, the inclusion of many forms of behavior within culturally learned repertoires, which may exhibit diversity within a specific or community; stability and fidelity over time; and finally, adaptive preferences for certain modes or types of social learning.

Whiten suggests that, from the perspective of animal cultures, some of the features of culture we discuss are optional, for example, inter-group differentiation. However, it is precisely cultural diversity, borrowing, exchange, hybridization, and competition (i.e., differences that arise primarily through cultural learning) that allow the processes of cultural change and elaboration distinct from the rudimentary replication of limited behavioral repertoires. The 4300-year-long “fidelity” in nut-cracking technologies mentioned by Whiten, along with the minor (and similarly stable) regional variations in ape-foraging strategies, reflects local affordances rediscovered in each generation, apparently without intergenerational cultural transmission (Moore Reference Moore2013). On this view, such “fidelity” may actually reflect a rigid dependence on what we have called natural affordances (Ramstead et al. Reference Ramstead, Veissière and Kirmayer2016), without the cumulative intergenerational cultural elaboration that Tomasello has called the “ratchet effect,” which appears to be unique to humans (Tennie et al. Reference Tennie, Call and Tomasello2009).

The potential phylogenetic reach of basic forms of culture raised by Whiten may yield important insights into forms of learning we share with other animals. Our account, however, focuses on the phylogeny of the enhanced theory of mind modalities and cognition–culture iterative loops that appear unique to humans. Central to our account is the cognitive package that we call Thinking Through Other Minds (TTOM). Under the FEP, we construe this as package as comprising a set of abilities and a domain of statistical regularities that are coupled: abilities enabling us to tune to the minds of others and to navigate environmental uncertainty. The TTOM model explains this coupling by appealing to evolved and learned priors about conspecifics and their mental states.

Among hominids, current models for the evolution of TTOM emphasize cooperative breeding; a strategy that likely evolved in the Homo Erectus lineage circa 2 million years ago (MYA) (Hrdy Reference Hrdy2011) that selects for individuals who are skilled at understanding others’ needs, giving care, and eliciting care. Across species, the evolution of cooperative breeding likely follows different timescales and different pathways to similar traits. New World monkeys such as marmosets, for example, who share a last common ancestor (LCA) with humans 35 MYA, are cooperative breeders with better mind-reading and prosocial abilities than the non-cooperative breeding great apes (LCA with humans 5 MYA), and have as such been recognized as better models than chimpanzees for understanding human evolution (Miller et al. Reference Miller, Freiwald, Leopold, Mitchell, Silva and Wang2016). Humans, in turn, have refined skills for detecting the epistemic cues and their precision allowing for more complex forms of cultural transmission.

We use the computational construct of precision of epistemic cues to account for the relative stability, fidelity, and scaffolded elaboration of cultural forms of life over time. To account for preference guiding behavior based on the formal construct of culture acquisition proposed by TTOM, one might appeal to gene-culture coevolutionary explanations. This is what we suggested with the example of prestige biases and regimes of attention in humans. Prestige biases are an example of external component of the regime of attention – passed on as high-precision epistemic cues (scaffolded on, then divorced from physical dominance hierarchies) – whose effect is enabled by genetically inherited predispositions to social learning functioning as an internal component of regimes of attention (see Figure 4 of the target article).

In contrast to Whiten's appeal to animal culture, Vickhoff starts from the example of the arts, forms of human culture that involve elaborate systems of shared customs, codes, and scripts. Vickhoff considers music as an instance of culture, because it is governed by a system of shared expectations, and conforms to a “style” – a type of convention with esthetic value. To this, we would add a third salient characteristic of music, the arts and, indeed, of culture more generally: improvisation and creative invention (Torrance & Schumann Reference Torrance and Schumann2019). All of these aspects are related to TTOM. Vickhoff suggests that, as an enculturated agent, the listener can access the composer's mind as expressed through culturally scripted features of music. Hence, in Vickhoff's words, we not only access culture through other minds but also engage with other minds through culture. Although the TTOM model accounts for the idea of “set of expectations,” it may be less intuitive to apply the framework to thinking about cultural genres or styles. This is so because such uses speak to notions of esthetic value and of creativity and novelty-seeking, which do not have obvious interpretations under TTOM and the FEP.

Creativity is a kind of exploration of new possibilities for perception and action. Similar to other exploratory activities, it is driven by the human propensity for novelty-seeking or curiosity. Several authors have provided accounts of culture and curiosity in terms of the epistemic value of exploring a niche or a larger adaptive landscape to identify current or future possibilities (Friston et al. Reference Friston, FitzGerald, Rigoli, Schwartenbeck and Pezzulo2017a; Reference Friston, Lin, Frith, Pezzulo, Hobson and Ondobaka2017b; Moulin & Souchay Reference Moulin and Souchay2015; Schmidhuber Reference Schmidhuber2006; Schwartenbeck et al. Reference Schwartenbeck, Fitzgerald, Dolan and Friston2013). On this view, the FEP applies to a second-order process of optimization on longer time scales and, in some instances, across many alternate niches (Bengio Reference Bengio, Kowaliw, Bredeche and Doursat2014). A similar argument can be used for the value of creativity to generate an expanded adaptive repertoire of actions and responses. Improvisation, invention, and innovation are basic to human cultures, not only in the domains of esthetic experimentation that have come to be designated as “the arts” in contemporary societies, but equally in the most quotidian activities.

The relationship between culture and creativity in the framework of TTOM needs further development, to be sure, and Vickhoff gives some clues about how to address this. He mentions recent work on synchrony in human action, which is beginning to reveal the mechanisms of micro-coordination of action in real time and its consequences for feelings of emotional attunement and social affiliation (Kiverstein et al. Reference Kiverstein, Miller and Rietveld2019; Parkinson Reference Parkinson2019; Tschacher & Haken Reference Tschacher and Haken2007; Van de Cruys & Wagemans Reference Van de Cruys and Wagemans2011; Van de Cruys et al. Reference Van de Cruys, Chamberlain and Wagemans2017). Vickhoff notes, as well, that the reduction of uncertainty can be pleasurable (Vuust et al. Reference Vuust, Dietz, Witek and Kringelbach2018). The esthetic pleasure of music may arise in part from a balance between the setting up expectations (through rhythm and repetition of melodic and harmonic structures) and violating expectations with novelty, and then re-establishing order by further repetition or thematic resolution (Huron Reference Huron2006; Vuust et al. Reference Vuust, Dietz, Witek and Kringelbach2018). This play with the tensions of confirmation of expectation and surprise then constitutes a microcosm of the larger adaptive tasks of dealing with an often-unpredictable world (Friston & Friston Reference Friston, Friston and Bader2013).

A similar challenge to the view that all human action interactions are motivated by the minimization of uncertainty is spelled out in Mirski, Bickhard, Eck, & Gut's (Mirski et al.) commentary. For these authors, a free-energy account of culture fails to explain the emergence of new or competing normativities, and the varied ways in which many human actions appear to possess no fitness-enhancing or uncertainty-reducing function. They mention “people deciding to die or suffer for some highly abstract cause” as one such candidate exception to our model. First, we should note that the pursuit of “suffering” is typically patterned and prescribed socially, and seen in many cultural domains from rites of initiation, to religion and athletics (Atkinson Reference Atkinson2008; Coakley & Shelemay Reference Coakley and Shelemay2007; Gaines & Farmer Reference Gaines and Farmer1986). We see this as paradigmatic of, rather than an exception to the search for high-precision cultural modeling under the FEP. The most widely accepted account of “righteous violence” of dying for an abstract cause, in turn, uses Terror Management Theory to explain how perceived threats to one's group and belief systems coming from external agents can motivate altruistic death as a group fitness-enhancing strategy (Pyszczynski et al. Reference Pyszczynski, Motyl and Abdollahi2009). In the framework of active inference, altruistic self-sacrifice is motivated by an effort to reduce a perceived increase in the uncertainty of the social world. Cognitive anthropologist Scott Atran's work on suicide terrorism has yielded further clues on how some people come to die for “abstract ideas.” For Atran (Reference Atran2003), strong motives to defend an imagined community are not simply abstract: they are extrapolated from small-scale group bonding, where strong ties of solidarity – installed by military training – produce the motivation or willingness to die for one's friends, seen as an extension of a broader community. These examples can help us understand how new and competing normativities are also competitions for generative models of the world. Conflict over meaning and how the world ought to be invariably occurs within and between groups. We understand social change as occurring under these dynamics of optimizing world models through intra-group and inter-group competitions. We think that the FEP offers a way to account for both social stabilization and social change, and indeed for human historicity itself.

R2. Culture as embodied affective engagement with affordances

In describing human behavior as culturally patterned via regimes of attention that modulate salience and guide action, we aimed to ground our account in so-called 4EA approaches that emphasize the embodied, embedded, extended, enactive, and affective nature of cognition. As pointed out by several commentators, our attention-and-expectation-centered, information-heavy account left out important details on the affective, sensorimotor, physiological, and phenomenological dimensions of this puzzle.

Baggs & Chemero express the concern that our choice of terminology leads us to pursue an approach that is disembodied and inferential rather than embodied and enactive. They note that Vygotsky long ago proposed an approach to cultural learning as developmental engagements between social actors (initially child and parent) that is enabled by language (Vygotsky Reference Vygotsky1980). This developmental trajectory has been well documented (Tomasello et al. Reference Tomasello, Kruger and Ratner1993). The mother's discursive framing of the child's activity becomes the child's self-talk, which in turn becomes inner speech. The narrative self is born in this move from internalized dialog to monolog. Beyond this developmental picture, Baggs & Chemero insist that “language-involving cognition operates according to a different set of norms, and is not merely a more elaborated form of adaptive fitness.”

We agree that the ability to use language is a game changer for cultural learning. It allows humans to explore imaginary fitness landscapes, install higher-order priors without the need for individual lived experience, expand action repertoires by following recipes or instructions, and explicitly name, frame, debate, and deliberate about social norms. Most powerfully, language allows recursion and self-reflexivity. In theorizing how most human priors become saturated with other minds, and in this sense divorced from direct interaction with the world, we agree with Baggs & Chemero that this process can be cashed out under a Vygotskyian model of scaffolding via social learning and the education of attention. We have made this argument elsewhere (Ramstead et al. Reference Ramstead, Veissière and Kirmayer2016).

We strongly disagree, however, with Baggs & Chemero's claim that the affordances construct cannot be used to explain socially and culturally appropriate behavior. We see no reason to think the engagement of humans with their social and cultural worlds cannot be cast in terms of affordances. Indeed, Gibson himself used the concept of affordance to describe human social and cultural behavior (Reference Gibson1986, p. 119ff). The difficulty in accepting that dynamics of enculturation can be affordance-like, as we have argued elsewhere (Ramstead et al. Reference Ramstead, Veissière and Kirmayer2016), stems from the conservative definition of “affordance” adopted by many cognitive scientists, which applies the concept only to possibilities for action that can be “picked up” without instruction and do not require social learning to recognize and use (e.g., Moore Reference Moore2013).

Drawing in part from Chemero's (Reference Chemero2011) work, we have defined a cultural affordance as a relationship between the embodied skills of an encultured agent and relevant aspects of the cultural environment (Ramstead et al. Reference Ramstead, Veissière and Kirmayer2016). In our model, the relevance of an affordance is cashed out in terms of culturally shared priors about salience; and the ability to engage with cultural affordances corresponds to policy selection in active inference. In the target article, we provide an embodied dynamical account of how such engagement with a cultural niche is possible. That account explains what goes on “under the hood” (and indeed, “around the hood” in the environment) in scaffolded learning processes that stretch from imitative learning to explicit rule following and deliberation. It is with this view in mind that we have described culture as involving a set of deontic affordances. From this perspective, the cultural world not only solicits certain modes of attention and action, but also entails the obligation to respond to the world and other people in constrained ways. These obligations are felt in the body, expressed in habitual stances and actions, as well as (sometimes) in elaborated moral language and deliberation.

Clement & Dukes offer the important specification that culturally patterned practices do not simply direct attention in a value-free fashion, but entail the structuring of valence, as well. For these authors, “it is the valence and the intensity of others’ emotional expressions in particular that can be used to detect what is expected from each member” (para. 4). We could not agree more with this description of social learning as fundamentally affective, and culture as deeply axiological – less about “what is,” as Clement and Dukes put it, and more about “what matters and what is meaningful.” We understand this process as operating beyond explicitly normative situations, and extending to the affective qualities of the cultural world. For example, inferring whether someone's style of dress signals high or low social status, or assessing the “intrinsic” desirability of a prospective purchase, involves picking up on what relevant others expect the world to afford – simply learning fixed associations to social cues without this component of TTOM might lead to frequent errors as we move across contexts or niches. In other words, we are able to pick up the multidimensional valences assigned to specific features of the world by local regimes of attention. We can learn these valences as embodied dispositions to respond to cultural affordances, without explicit awareness of the relevant norms.

In their commentary, Smith & Lane show how the TTOM model could be expanded readily to make an explicit place for emotional processing and interoceptive embodied inference. In their approach, emotional processes are cast as emotional policy selection and emotional state inference (Smith et al. Reference Smith, Lane, Parr and Friston2019a). This extension of the formalism rests on an additional layer of parametric depth that biases action toward inference by the agent of its own affective states. We would add that interoceptive inferences are also modeled socially against the affective states of others – indeed, they are sometimes taught more or less explicitly in the way caregivers react to, and thereby teach us to react to and assign predictive meaning to our own internal states and social positions (Gendron & Barrett Reference Gendron and Barrett2018; Hoemann et al. Reference Hoemann, Xu and Barrett2019). This hierarchical and interpersonal structure of an active inference agent fits well with TTOM, which extends all the way from explicit high-level inferences involving emotional concepts to low-level automatic emotional response generation (Seth & Friston Reference Seth and Friston2016).

This model of emotion, when coupled with the concept of deontic policy selection (Constant et al. Reference Constant, Ramstead, Veissière and Friston2019b) discussed in the concluding section of the target article, can get us closer to what Buskell expects of TTOM: a view of characteristically human phenomena (such as radical organizational change), which requires an understanding of affect and normative behavior. We have begun some work in this direction (Constant et al. Reference Constant, Ramstead, Veissière and Friston2019b) showing how concept of deontic policy selection can help account for phenomena such as social conformity (Asch Reference Asch1955; Toelch & Dolan Reference Toelch and Dolan2015). In this model, we provide a formal description of how the acquisition of regimes of attention allows enculturated agents to zero in on what appears to be the most socially relevant response to a situation. This policy selection may have its own affective consequences which can influence subsequent action. This looping process follows directly from the TTOM model augmented with the concept of deontic action selection and the implementation of affect under the FEP by Smith et al. (Reference Smith, Lane, Parr and Friston2019a).

Van de Cruys & Heylighen provide a compelling example of affective social phenomena that is readily implementable with a model of affect coupled to TTOM. They propose that TTOM can help account for social phenomena that evoke, maintain, and are amplified by negative affect (e.g., dogmatism or radicalism). For instance, “guilt [can] also become a hidden cause efficiently explaining away someone's behavior in the eyes of others belonging to the culture.” Learning the mapping between one's actions, others’ suffering and feelings of guilt attaches an affective valence to policies that can guide behavior. Moral emotions such as guilt, then, contribute to social coordination in particular cultural contexts and indeed, as others have suggested, to the unique forms of pro-sociality found in human forms of life (Tomasello Reference Tomasello2009).

The commentary by Allen, Legrand, Correa, & Fardo suggests that our account is overly focused on cognitive and brain-bound prior beliefs, and does not sufficiently address embodied forms of inference that are ongoing at other scales. Indeed, work by Allen's group demonstrates that visceromotor processes and interoception can be usefully cast as forms of active inference (Allen et al. Reference Allen, Levy, Parr and Friston2019). We agree that social and cultural processes such as those described in the TTOM model may be built on a basis of embodied, interoceptive priors some of which have a long phylogenetic history. We would simply add that cultural processes also allow priors to be installed from the top down, through linguistic coding and mimetic processes, so that in any actual instance, culturally learned behavior is likely to be an outcome of both interoceptive and exteroceptive influences and both embodied and linguistic practices (Di Paolo et al. Reference Di Paolo, Cuffari and De Jaegher2018). This puts cultural history and learning on an equal footing with evolutionary and co-evolutionary influences.

The ongoing social dynamics of affective influence are foregrounded in several commentaries. As Mouras reminds us in his commentary, shared affective, motivational, and sensorimotor processes can be observed and experimentally manipulated directly in social interactions. Mouras's discussion of experiments on postural correlates of in-group bias in empathy for pain offers many useful pointers on how to operationalize models of shared agency in TTOM. Mouras and colleagues have made elegant use of the Minimal Group Paradigm, in which in-groups and out-groups are arbitrarily constituted in a lab environment (Tajfel et al. Reference Tajfel, Billig, Bundy and Flament1971), to demonstrate the quick, flexible, yet precise ways in which embodied and affective joint models can emerge as long as an assumption of shared goals is present. The experiments described by Mouras show that empathy for others’ pain can occur quickly once in-group conditions obtain, and that merely imagining fellow group members in pain affects sensorimotor processes with effects such as automatic (nonconscious) stiffening of posture. These findings echo recent studies on physiological correlates of group bonding that evince shared responses in both participants and spectators during high-arousal conditions of group synchrony, such as marching in a stadium (Jackson et al. Reference Jackson, Jong, Bilkey, Whitehouse, Zollmann, McNaughton and Halberstadt2018) and ritual firewalking (Konvalinka et al. Reference Konvalinka, Xygalatas, Bulbulia, Schjødt, Jegindø, Wallot, Van Orden and Roepstorff2011). Taken together, these studies point to an emerging socio-physiology that reveals some of the embodied interactions that undergird TTOM.

Further points raised by commentators help us describe social interactions as shaped by relational dynamics that include developmental histories and processes of attachment and affiliation. Strand's commentary, on the importance of attachment patterns with supportive others as potential modulators of cultural learning, raises important issues, including the claims that: (1) reward learning mechanisms are central to the patterning of culture via Bayesian inference; (2) neural pathways subserving these mechanisms are likely evolutionarily old, and (3) attachment styles are an important locus of cultural differences. In support of this view, Strand cites recent studies showing that collectivistic cultures exhibit a higher prevalence of insecure-anxious individuals, compared to higher rates of insecure-avoidants in individualistic cultures (Yamagishi & Hashimoto Reference Yamagishi and Hashimoto2016). We agree that attachment and reward mechanisms are central in the phylogeny and ontogeny of culture. On this view, cultural evolution has favored modes of interaction that fulfill and leverage our evolved need for rewarding attachments, enabling the construction of adaptive social ties in each generation (Boyer & Liénard Reference Boyer and Liénard2006; Reference Boyer and Liénard2008). We can think of the patterning of culture in terms of kinds of affiliative relationships, communal activities, rituals, and ceremonies that serve a general function of social uncertainty minimization, and contribute to specific, adaptive cooperative action, whereas offering a protective buffer against the risks of loneliness and rejection.

Strand's comments are also helpful to understand another strange loop in cultural evolution: attachments are central to the patterning of culture, yet are themselves culturally patterned. In Yamagishi and Hashimoto's (Reference Yamagishi and Hashimoto2016) niche construction model of cross-cultural differences in attachment, an avoidant style is understood as adaptive in individualistic cultures (in which help-seeking is socially costly), whereas anxious attachments confer social adaptiveness in cultures that require high levels of cooperativity and social deference (where individualism is socially costly). Of note here, both “individualism” and “collectivism” are collectively patterned via social norms and deontic cultural affordances that demand different levels of autonomy in specific contexts.

Pezzulo, Barca, Maisto, & Donnarumma's (Pezzulo et al.) insightful commentary on social epistemic action further assists us in specifying that, in addition to being rewarding for its own sake, human joint action often entails conveying cues for the sake of others. Pezzulo et al. explain that unlike epistemic actions (aimed at updating one's contextual beliefs about relevant features of the world) and pragmatic actions (acting on the world after resolving contextual uncertainty), social epistemic actions serve the function of directing others’ attention to patterns and regularities that we want them to infer. This is precisely the phenomenon we have referred to as regimes of attention (Ramstead et al. Reference Ramstead, Veissière and Kirmayer2016). The authors point out that although such actions can be verbal, humans routinely employ a wealth of kinetic and sensorimotor cues (what they cleverly call “motionese”) to convey the internal states and intentions they want others to infer, or to “signal more directly what and where salient information is.” Pezzulo et al. rightly emphasize the importance of social epistemic actions on “faster timescales” (e.g., during teaching, learning, and real-time communication) but we also find the notion useful for an account of the evolution of human sociality on longer co-evolutionary and developmental timescales. The existence of species-wide ostensive or indexical cues, such as pointing (to direct others’ attention), or holding out a hand (to signal helping behavior) offers strong evidence for evolutionarily old strategies for communicating universally legible intentions and emotions to others. The presence of laughter and vocal crying in apes, humans, and pre-verbal human infants offers further evidence for phylogenetically old and involuntary (or automatic) drives for social epistemic actions. Indeed, both actions occur under weak or absent voluntary control, and are best described as forms of honest signaling that reveal true internal states (Provine Reference Provine2017). For example, an experiment using short audio clips found that hearing joint laughter was sufficient for participants from 24 diverse language groups to differentiate pairs of friends from strangers (Bryant et al. Reference Bryant, Fessler, Fusaroli, Clint, Aarøe, Apicella, Petersen, Bickham, Bolyanatz, Chavez, De Smet, Díaz, Fančovičová, Fux, Giraldo-Perez, Hu, Kamble, Kameda, Li, Luberti, Prokop, Quintelier, Scelza, Shin, Soler, Stieger, Toyokawa, van den Hende, Viciana-Asensio, Yildizhan, Yong, Yuditha and Zhou2016).

This view of cultural transmission as entailing multiple forms of learning falling on a spectrum from automatic “picking up” of information to explicit teaching and deliberation assists us in taking up Gweon's helpful pointers for “a more complete picture of social learning.” Gweon points out, building on Cisbra and Gergely's (Reference Csibra and Gergely2009) Natural Pedagogy paradigm, that children are intrinsically motivated to identify high-utility, high-quality information from reliable sources, and are subsequently motivated to teach that information to their peers. We agree with Gweon that this view of children as active agents of both Bayesian inference and attention-directing communication is key to understanding cultural transmission. In this line, we welcome Michael & de Bruin's invitation to consider the importance of mind-shaping mechanisms and to clarify how the TTOM model of enculturation accounts for a full implicit-to-explicit inference spectrum. We find the postulated existence of evolutionarily old, developmentally early behavior-influencing mind shaping mechanisms (De Bruin and Strijbos Reference De Bruin and Strijbos2020; Zawidzki Reference Zawidzki2013) fully compatible with the account we have outlined above. Our account of social cognition recognizes multiple levels and instances of inference from the “quick and embodied” to the effortful and deliberative. In doing so, we join previous “multi-system” efforts by Christensen and Michael (Reference Christensen and Michael2016) intent on resolving tensions between theory–theory, simulation, and embodied accounts of mind-shaping and mindreading.

R3. Formal clarifications: What the FEP adds to our understanding of cultural learning

The TTOM model gains precision and explanatory power by leveraging the active inference framework. Unfortunately, some of the key features of active inference have been misunderstood in the recent literature, as well as in some of the commentaries. Clarifying these misunderstandings is crucial to appreciate how TTOM can be applied to model cultural learning.

R3.1. Optimization and “entropy minimization”

One frequent confusion concerns the construct of entropy and its relationship with variational free energy in the active inference framework. The commentary by Fortier-Davy, for example, characterizes our view in the paper as the claim that “humans tend to minimize entropy” or, still more strongly, that “humans seek to minimize entropy” [emphasis added]. We note that nowhere in the text do we actually make either of these claims. However, statements such as Fortier-Davy's, although inaccurate, are often presented in the literature as a criticism of active inference. These critiques treat active inference as if it were equivalent to entropy minimization. The accompanying critique, then, is that, because there is a large body of evidence from psychology and social science that human behaviors do not always (or usually) act to minimize entropy, active inference must not apply globally to human cognition.

Certainly, entropy reduction and FEP minimization are related, but the link is more nuanced than simple equivalence. Variational free energy is an upper bound on surprisal; and assuming that the system we are considering is measurable (i.e., is equipped with a nonequilibrium steady state), the time average of surprise will converge toward entropy. This means that it is true that a system that is able to track and minimize free energy from moment to moment will also place an upper bound on entropy on average and over time (equivalently, place a lower bound on model evidence or marginal likelihood). At this first level of analysis, which considers only moment-to-moment dynamics, Fortier-Davy's claim about the relationship between the FEP and entropy seems to hold. However, there is more to this story.

The relationship between entropy and variational free energy is enriched when considering the enactive aspect of self-evidencing, that is, by the consideration of planning sequences of actions. Contemporary formulations of active inference work not merely with variational free energy but with expected free energy, which is the (average) expectation value of free energy for a given policy (Friston et al. Reference Friston, FitzGerald, Rigoli, Schwartenbeck and Pezzulo2017a; Reference Friston, Lin, Frith, Pezzulo, Hobson and Ondobaka2017b). If we consider a probability distribution over the space of available policies, minimizing expected free energy actually means that we will end up maximizing the entropy of that distribution: in effect, a maximally entropic or flat distribution over the space of policies means that the agent is “keeping its options open.” Thus, it follows from our formal account that humans (and all creatures) pursue actions that increase the entropy or spread of their available action repertoire. This idea is reflected in recent work on the epistemic value of actions (Parr & Friston Reference Parr and Friston2017a; Pezzulo et al. Reference Pezzulo, Cartoni, Rigoli, Pio-Lopez and Friston2016).

Further, the idea that living creatures minimize the entropy of their states is not contradictory to the idea that humans (and other animals) will seek out novel, high entropy stimuli. Indeed, this is just what active inference agents do when they select the policy that minimizes expected free energy. This, again, follows from the formal construction of expected free energy (Parr & Friston Reference Parr and Friston2017a; Pezzulo et al. Reference Pezzulo, Cartoni, Rigoli, Pio-Lopez and Friston2016). One way to express the expected free energy of a policy is as risk minus novelty (the same formula used in economics); and another is as pragmatic value plus epistemic value. Both of these formulations account for the novelty-seeking behavior that is typical of agents such as humans.

An agent that acts to minimize expected free energy will first explore the world and resolve uncertainty by seeking out high-entropy (and therefore, information rich) stimuli, before acting on the pragmatic value of a policy. Indeed, information is defined as the amount of uncertainty resolved by making an observation. As such, the most informative observation is the one that resolves the most amount of uncertainty, which means that agents may seek out the most entropic stimuli in order to derive the greatest amount of information. Finally, recent active inference formulations parameterize expected free energy (Hesp et al. Reference Hesp, Smith, Allen, Friston and Ramstead2019); in this setting, agents have beliefs or preferences about how surprised they should typically be. These qualifications about the relationship between the entropy and active inference naturally account for the findings discussed by Fortier-Davy that humans tend to prefer medium-entropy rather than low-entropy stimuli.

A broader concern about the utility of TTOM is raised by Overgaard, who finds the model ambiguous insofar as it seems to conflate a simple description of the ethnographic reality of what people do (namely, the obvious truth that thought and action occur cooperatively with other people) with a more specific computational model of specific kinds of learning (about which he is dubious or noncommittal). We think there is no real ambiguity here, but a simple progression from description to explanation. TTOM begins with the descriptive facts that Overgaard acknowledges but moves on to account for what is going on in these interactions – and the phenomena of cultural learning. To do this, TTOM incorporates a model of implicit learning that involves our brains and bodies interacting with one another and the world (e.g., as our neural networks are tuned to predict our social and designed environment through rolling cycles of sensorimotor action and perception). The subtlety is that TTOM is a theoretical model of cultural learning that describes cultural learning and cognition as following particular constraints (formalized mathematically in the target article). Yet at the same time, TTOM is something that people “do” insofar as those equations capture the dynamics of neuronal and social ecologies that constitute the sort of enculturated beings that we are. TTOM is at once a name for and a map of what we do.

R3.2. Building on Bayes: Cultural learning, optimization, and adaptation

Colombo gives us the opportunity to clarify another, often ill understood fact about the FEP and active inference. Colombo poses the following problem: if “utility (or adaptive value) of an outcome is equivalent to its probability [then] complying with social norms [which entails maximising the probability of certain outcomes] always has adaptive value.” Obviously, we agree with Colombo that this is not the case. Sociocultural norms and dynamics do not need to promote individual fitness or cultural progress. Indeed, norms and culturally prescribed goals can also lead to death or societal collapse (Diamond Reference Diamond2010). If cultural progress occurs, it is most likely because cultural norms promoted changes that yielded population-level advantages in response to environmental pressures over long stretches of time. Conformity is useful to social groups and, indeed, the tendency to conform varies both among individuals within a group and, on average, between groups. But, the norms to which individuals learn to conform need not benefit everyone individually, and could even involve harm for some individuals or subgroups, yet persist because they enhance population-level fitness. The discrepancies, conflicts, and trade-offs between these utilities at different levels and timescales can account for the persistence of norms that are not beneficial to the individual or to some segment of a social group.

Colombo's main concern, however, is with another kind of conflation between descriptive expectations, normative expectations, and preferences for conformity to social norms. The FEP is a normative framework in the sense that it states what all organisms must have done, given that they have existed for some duration (i.e., minimized their free energy), in the same spirit that natural selection will tend to maximize (expected) fitness at the population level. The FEP is normative in the sense that it tells us about the conditions that must have been met by environment-sensitive systems, if they have maintained themselves at the nonequilibrium steady state. The FEP is not normative in the sense that it tells us what all organisms ought to do, or what will necessarily happen to them. Thus, it leaves plenty of space for thinking about things such as branching processes, spandrels, maladaptive traits, constraints, trade-offs, and mismatches – all the stuff we need to understand the place of adaptive value in the contingent history and development (ontogenetic or evolutionary) of the system of interest. The observation that particular suboptimal or maladaptive norms obtain is not a problem but an observation that forces us to explore the impact of initial and boundary conditions and historical trajectories on the constitution of humans and their cultural niches.

A related objection is raised by Zefferman & Smaldino, who point out that our model does not sufficiently account for the learning and cognitive biases that drive cultural transmission. According to Zefferman and Smaldino, the FEP offers little explanatory power when compared to these learning rules. We agree that cultural evolution likely favored multiple rules reflecting diverse mechanisms, optimized over time, or recruited for specific kinds of cultural learning and adaptive tasks. In the target article, we made a case for prestige, expertise, and in-group attentional biases as obvious candidates for high-precision learning rules driving cultural evolution. Rules that may be maladaptive (in that they are mismatched with the environment, or confer fitness on certain traits or groups at the expense of other rules that are more adaptive) or that may not operate under active inference do pose a challenge to our model. For example, Perreault, Moya, and Boyd argue that “many adaptive problems are difficult because the environment does not provide clear cues to the best behavior. What is the best design for a bow? What causes malaria? It is not clear what decision rule will be favored by selection when the environmental cue does not allow accurate inference” (Perreault et al. Reference Perreault, Moya and Boyd2012). Our work so far has spelled a general model of the architecture of cultural learning systems as whole (see Figure 4 of the target article) at the expense of an exhaustive description of the specific rules that emerge in the system. We welcome the challenge to enhance our model with a better account of invariant rules, which are likely to be domain specific.

The commentary by Mirski et al. also addresses the need to specify the mechanisms of cultural learning but misses another key point about the general TTOM framework – by misunderstanding how new local “rules” can emerge in the patterning of culture. They suggest that approaches to cognition based on active inference are incorrect because they cannot explain how non-phylogenetic priors, such as those learned through experience, could come to guide the behavior of agents. This ignores the fact that our model accommodates the learning of (empirical) priors at various timescales, and most notably through immersion in coordinated action and experience – a view of learning in context often called empirical Bayes (Friston et al. Reference Friston, FitzGerald, Rigoli, Schwartenbeck, O'Doherty and Pezzulo2016; Kass & Steffey Reference Kass and Steffey1989; MacKay Reference MacKay1992). Mirski et al. merely stipulate that the FEP cannot account for normative phenomena and do not engage the framework and formalism that we provide precisely for online learning of such patterns.

R3.3. The Markov blankets of culture

In conjunction with the FEP, TTOM relies on the notion of Markov blankets as ways of demarcating cognitive systems and their environments. Thomas Parr adds helpful refinements to the cultural affordance model we deploy in our paper by focusing on the ways in which humans (and those studying them) draw Markov blankets. Parr brings to the fore three perspectives on the ways in which we, as scientists, choose to draw Markov blankets around systems of interest to build a computational model. First, there is the selection of the blanketed system itself (the set of internal states and their blanket), which licenses an inferential interpretation of the system dynamics. This interpretation chooses a set of systemic or internal states that infer external, non-systemic states through their vicarious coupling via the blanket states. Second, once such a blanketed system is carved out, one needs to select which blanket states will drive the activity of sensory states; this is the question of attention. Third, the system must also select which parts of the environment are most relevant to its policy selection; this is the issue of salience, novelty, and the more general issue of selective sampling.

An important empirical question to understand the behavior of any organism, on Parr's view, is to identify the relevant “influences from the outside in” that modulate “implicit choices made by internal states of a Markov blanket as to which blanket states most influence their dynamics.” A central argument in our account is that for Homo sapiens, “the outside” is culturally encoded with group-level, action-generating expectations about the world that predict differentiated (hence patterned) experiences of, and action in the world. Through developmental scaffolding via shared modulations of attention (“enculturation”), the most relevant buffer of statistical regularities for humans gradually becomes other people's expectations about the world and how to function in it optimally given a set of social norms and environmental constraints (Veissière Reference Veissière2018). It is this buffering dynamics of a shared Markov blanket (Poirier et al. Reference Poirier, Faucher and Bourdon2019) between the human groups and the world that we have called TTOM.

R3.4. Addressing enactivist objections

Several objections to the active inference approach are raised from enactivist perspectives echoing those of Baggs & Chemero. In the case of the commentaries by Hutto, and Kiverstein & Rietveld, these objections are based on a very narrow reading of computation, information, and representation. Active inference is a theory of belief-guided action and related information flows. Information, in this context, is not a cognitivist or Cartesian construct that would render our theory incompatible with models focused on dynamic coping and attunement. Rather, variational free energy is an information theoretic measure that emerges from dynamic interactions between a Markov-blanketed system (with its own set of beliefs or generative model) and its embedding environment – which as we have argued, crucially includes culture via ecologies of other minds. We have known since the pioneering work of Pattee (Reference Pattee1977) and Kelso (Reference Kelso1994) that informational and dynamical descriptions are complementary, not contradictory. Variational free energy quantifies precisely the extent to which a system becomes a model of its environment through their reciprocal dynamic interactions. Second, and most importantly, where we decide to draw a Markov blanket has consequences for our inferential interpretation of its dynamics. For human systems, this means distinguishing between two cases: (1) those in which two agents (as blanketed systems) are inferring each other; and (2) those in which a higher-order system, which has two agents as its internal states or component parts, infers aspects of the external world. We believe TTOM applies to both cases; in the first case, it provides a new take on Theory of Mind abilities of agents (i.e., their capacity to infer each others’ mental states); in the second, it provides a new account of cultural dynamics as a form of group inference (Clark Reference Clark, Metzinger and Wiese2017b; Kirchhoff et al. Reference Kirchhoff, Parr, Palacios, Friston and Kiverstein2018). The flexibility of this formalism licenses our use of it as a model of human dynamic interactions across spatial and temporal scales.

Hutto's commentary suggests that the TTOM model is making a “spectatorial assumption,” in that it suggests that humans do not have direct access to another's mind and that they must employ inferential abilities to access the mental states of others. We accept this characterization of our view. It seems obvious to us. Even if one were to focus on the phenomenology of interpersonal engagement, it also seems like a plain fact about experience that human agents never have direct, unmediated access to the minds of other agents. Humans have a rich phenomenological experience, on the basis of which we must infer the mental states of others (i.e., facets of their own experience). This follows quite simply from the basic setup of the problem that all living things face: they only have access to the environment through their sensory states (including interoception), and must reconstruct the causes of their sensations – other persons included – in an inferential matter (Hohwy Reference Hohwy2016). This, however, does not mean that human agents are cut-off from their social world. Nor does it mean that all knowledge of others involves explicit inference. Indeed, as we have argued elsewhere (Ramstead et al. Reference Ramstead, Badcock and Friston2018), the multiscale active inference formulation entails that – to the extent that they form a higher-order social ensemble – systems that are segregated at one scale are integrated into a higher-order dynamics. The regulated coupling of internal and external states via the dynamics of blanket states implies a semantics and indeed semantic content in the strongest sense (Constant et al. Reference Constant, Clark and Friston2019a). This inherently representational semantics of active inference is frequently missed by enactivist interpretations.

R4. Applications and empirical tests of the model

We welcome comments from Brown, Brusse, Huebner, & Pain (Brown et al.), who speak from the vantage point of cognitive archeology. They agree that unifying models can help “dissolve disputes by bringing rival positions under a single theoretical framework,” but remain unconvinced that the FEP can offer such a framework. In particular, Brown et al. express their doubt that our model can generate testable predictions. This concern is shared by Dołęga, Schlicht, & Dennett (Dołęga et al.), who argue that our account does not sufficiently distinguish between the levels of explanation. They suggest that our model is best understood as applying to Marr's computational level of explanation (i.e., characterizing the information processing task). Of course, this specification has consequences for theory building. For example, by showing that one can cast both mindreading and niche construction as forms of active inference, we show that there is no need to choose between competing alternatives. Although our account shows how internalist and externalist accounts can both be justified in some contexts, Dolega et al. claim that it does not provide us with a way of deciding between the two options for any given phenomenon. We are thus, in their view, over-inclusive and gain explanatory scope at the cost of explanatory power and precision. These charges are well put and pose an important challenge to the usefulness of the TTOM model. However, we think the apparent empirical limitations of the model reflect its current formulation in generic terms. Detailed examples in particular domains are needed to produce specific hypotheses and test the fit of models with experimental data. Moreover, plenty of evidence (both neural and behavioral) is already available to demonstrate the viability of the principles that ground our model as applied to individual cognition (e.g., Friston Reference Friston2010; Keller & Mrsic-Flogel Reference Keller and Mrsic-Flogel2018).

The methods we employ to formulate TTOM are borrowed from theoretical neurobiology. These basic principles have been applied a wide range of problems and tested against empirical data, and we know they can explain and reliably predict many features of individual human behavior (Cullen et al. Reference Cullen, Davey, Friston and Moran2018; Mirza et al. Reference Mirza, Adams, Mathys and Friston2016). The application of these methods to cognitive systems beyond individual humans and their brains, however, is more recent. The challenge is to figure out what should count as empirical evidence for an ecologically extended, and spatiotemporally scalable model like TTOM. We think the paleoanthropological record holds some clues that may help us answer these questions, whereas addressing a second point raised by Brown et al. about what they perceive as our model's failure to address debates between the internalist and externalist approaches to cognition.

The TTOM model, to clarify, proposes an “interactionist” view of cognition as at once internal to individual brains and bodies, and extended, embedded, and enhanced through external features of the social world. Work in paleo-anthropology – that emphasizes the co-evolution of human cognition with their anthropogenic environment modifications – provides explicit examples of this process. For example, Dietrich Stout's analysis of cumulative cultural evolution in the lower paleolithic suggests that capacity for Theory of Mind was significantly enhanced through selective pressure to teach and learn the making of stone tools (Stout Reference Stout2011; Stout et al. Reference Stout, Passingham, Frith, Apel and Chaminade2011). In other words, tool production played a crucial role in selecting for better perspective-taking abilities. Using ethnographic evidence from the Ju/’hoansi Bushmen, Wiessner (Reference Wiessner2014) argued that the invention of fire, and the ritual practice of firelight talks at nighttime played an important role (in both phylogeny and ontogeny) in “evoking higher orders of theory of mind via the imagination, conveying attributes of people in broad networks (virtual communities), and transmitting the ‘big picture’ of cultural institutions that generate regularity of behavior, cooperation, and trust at the regional level.” On this interactionist view, then, fire also played a role in the evolution of cognition. On more recent accounts of the “Broad Spectrum Revolution,” a further leap in human cognitive, technological, and cooperative foraging skills occurred around 40,000 years ago after anthropogenic depletion of megafauna provided new selective pressures for diversification of hunting and gathering strategies (Sterelny Reference Sterelny2007; Reference Sterelny2011; Zeder Reference Zeder2012).

These diverse examples point to multiple ways in which humans have learned to resolve uncertainty through various TTOM-enhanced, environmentally mediated pathways. We have outlined a unifying account of human cultural co-evolution, which can be operationalized under the FEP. However, detailed predictions and testable hypothesis will have to take on board historical contingency and boundary conditions.

R5. Modeling diversity and difference in TTOM

TTOM models the acquisition of culture by a generic agent in a stable, relatively homogeneous social niche. However, the reality in most contemporary societies is one of the high levels of diversity. In our responses to Mirski et al., we have discussed how social change, emerging normativities, and social conflict can be described as processes of group selection and optimization. Many levels of difference and variation remain to be examined in our account.

Intra-cultural and inter-individual diversity, as well as non-conformity and intra-individual change (when individuals change their mind over time or in response to changes in context) raise thorny questions for TTOM. Whether our ongoing transition to a digital world with unpresented access to billions of minds is a game-changer for TTOM is another pertinent question. The commentaries by Christopoulos & Hong, Bouizegarene, and Clark are especially helpful in addressing these essential refinements to the model.

R5.1. Cultural diversity and contextual variation

In their commentary on the multicultural mind, Christopoulos & Hong remind us that humans are not simply passive recipients of the cultural models through which they interact with their world, but instead actively construct their world through a variety of learned strategies. Christopoulos & Hong discuss the case of individuals exposed to multiple cultural environments, who thereby acquire an intuitive understanding that “the very same behavior could have different causes, interpretations and consequences” depending on the cultural context in which it is manifested. We might add that such individuals may also possess reflective knowledge that the same environment affords different actions to different people. “Cultural-frame switching,” thus, can be conceptualized as the ability to switch between different possibilities for thought, affect, and action by leveraging priors from different regimes of attention. This capacity is particularly evident in the case of migrants who integrate into their culture of adoption, but also in those who learn new ways of seeing and doing through travel, or who grow up with a “home” culture different that the dominant culture of the larger society around them. Christopoulos & Hong's observation that many individuals do not fit neatly within a single cultural model, we think, applies to our understanding of “culture” more generally because most people are exposed to more than one cultural framework.

The encoding of the external world with skill-bound, value-laden patterned possibilities for attention and action is only made possible via TTOM. Whether some or any affordances are entirely TTOM-free is an open question. Fortier-Davy, for example, disputes our claim that differences in optical illusions across culture provide an example of TTOM. On Fortier-Davy's account, the attunement of visual priors depends on external features of the environment alone. Of course, these physical environments are humanly constructed and so bear the traces of other minds but, as Fortier-Davy avers, no expectations about those minds may be necessary to learn some of the perceptual biases or expectations associated with specific cultural contexts. However, for more complex inferences the process of TTOM is essential. Thus, Fortier-Davy presents the scenario of individuals growing up amid “complex and ambiguous scenes,” which he understands as structuring a “holistic perceptual style” that may induce vulnerability to “illusions requiring context-independent scrutiny.” Cultural differences in automatic attention to contextual versus isolated information are well documented (Kitayama et al. Reference Kitayama, Duffy, Kawamura and Larsen2003). Context-dependent perceptual styles are typically found in more collectivistic cultures in which strong attention to social context is primed from early childhood. Attention to complex systems of causality in the external world (such as weather patterns or the distribution of plant and animal species relevant to hunter–gatherer cultures) is also learned socially via immersion in particular cultural contexts and the education of attention guided by more experienced role models. Clearly, other minds in such settings are always instrumental in the structuring of perceptual priors. We remain convinced that for all culturally proficient humans, perceptual priors, policies, and actions are patterned in a similar fashion.

A fully worked out, testable Bayesian model of culture – operating via TTOM – would need to identify the relevant “cultural base-rates” to which people outsource their priors in different situations. Most people (even those not explicitly raised in different cultures) likely draw on a variety of cultural, subcultural, class, educational, political, and esthetic models to navigate their world from context to context and task to task. In each case, the evocation and deployment of particular cultural repertoires depends on the interaction of the individual's learning history with particular contextual affordances, norms, and expectations. How might a testable model intent on predicting such a person's behavior identify a set of relevant, switchable cultural base rates according to situation? The Minimal Group Paradigm experiments mentioned by Mouras, as we discussed above, clearly show that humans can switch group-based frames of references and allegiances in quick, effortless, and seemingly arbitrary ways (such as being randomly assorted into blue or red T-Shirt groups). The limits of such flexibility remain poorly understood. Are there locally stable, conventionally constituted base-rates (such as those associated with age, group-affiliation, gender, or class) that will tend to prevail over other markers of group identity? We are not aware of any studies testing these important questions.

R5.2. Individual variation

Bouizegarene's commentary similarly calls for refinements to our generic account of cognition and culture – this time by appealing to individual differences in conformity and receptivity to cultural affordances. Discussing individual differences in normative identity styles, Bouizegarene asks how our account could explain how “some cultured agents seek and tolerate the uncertainty of questioning their identity beyond social norms and voluntarily go about a long process of thinking autonomously about themselves, rather than using norms as an antidote to this uncertainty?” These, we agree, are pertinent questions that a fine-grained variational account of culture must address. First, we should clarify that a higher tolerance for individual uncertainty and questioning, in relation to social norms, does not entail a complete divorce from social norms. Indeed, beyond the intrinsically social (and hence patterned) dimensions of language, meaning, and collective goals in which questioning takes place, such questioning of norms is made possible only in relation to social norms. A fuller niche construction account of sociocultural evolution, thus, could cast the co-evolution of a spectrum of personality traits with a population from “conservative” to “novelty-seeking” (e.g., conscientiousness, anxious attachments, and agreeableness on the one hand, openness to experience, thrill-seeking, and oppositionality on the other) as a group-level solution to ensure optimal adaptability to environmental change, and optimal conservation of cumulative culture.

R5.3 Transition to the digital niche

Clark's questions about human–machine interactions and the “thermodynamics” of digitally mediated human life are important for a fuller articulation of our model that can address the changing techno-cultural landscape. For example, the emerging problem of “smartphone” or “screen” addiction may reflect new dynamics of information foraging and TTOM. We have argued elsewhere (Stendel et al., Reference Stendel, Ramstead, Veissiére, Kirmayer, Worthman, Kitayama, Lemelson and Cummingsin press; Veissière & Stendel Reference Veissière and Stendel2018) that a hyper-abundance of informational uncertainty online solicits the hyper-activation of evolutionarily old attentional biases for social and group-fitness-enhancing information. In turn, this affords an addictive relationship with screens through a constant search for social rewards, social comparison, and high-precision cultural information. We have termed these dynamics the “hypernatural monitoring hypothesis” (Veissiére & Stendel Reference Veissière and Stendel2018). The human minds’ limitations on processing vast amounts of information online have recently been described as “bottlenecking” mechanisms that favor belief-consistent, negatively valenced, predictive information of a social nature (Hill Reference Hills2019). These recent mechanisms of digital niche construction have been proposed as candidate explanations for the rise of new social challenges, such as increasing extremism, political polarization, and the proliferation of misinformation (Hill Reference Hills2019). The TTOM model may provide a way to formulate some of the online dynamics that contribute to these social problems.

The socially leveraged processes of epistemic foraging that underpin TTOM can be described with a few simple algorithms. The bottlenecking mechanisms of informational uncertainty minimization observed on the Internet show us just how much information about threats and group affiliations matter to human minds. Still, much work remains to be done to typologize the attentional biases (“choices,” or “policies”) that underpin TTOM. Future work will need to distinguish between biases directly geared toward other minds (such as attentional preferences for eyes, faces, group affiliation, and propositional attitudes), those that harness, enrich, and “anthropomorphize” evolutionarily older, developmentally earlier biases, and those that are not about other minds at all. Thus, we can think of automatic mechanisms that track prestige, social status, and reputation (Henrich & Gil-White Reference Henrich and Gil-White2001) as “recycling” general epistemic-foraging mechanisms (found in all living organisms) scaffolded into dominance hierarchies (found in all social mammals). These high-precision cues offer relevant information about which fitness-enhancing model to track and learn from. For humans, these dynamics operate both automatically and self-reflectively through symbolic and status cues grounded in normatively configured hierarchies governing optimal moral standing and social functioning. Similarly, we can think of evolutionarily old threat-detection modalities such as the negativity bias (Vaish et al. Reference Vaish, Grossmann and Woodward2008) and pollution avoidance mechanisms (Stevenson et al. Reference Stevenson, Case, Oaten, Stafford and Saluja2019) as becoming re-encoded via TTOM through such symbolically enriched processes as superstition, xenophobia, bullying non-conformists, hypochondria, paranoia, magical thinking, conspiracy theories, and the myriad other metaphors and narrative models that postulate the existence of “dark,” “malefic” forces and agents as the “cause” of negative internal states and social problems (Boyer Reference Boyer2018).

R6. Applications of TTOM to psychiatry

In addition to a better understanding of social problems, we think our model can help advance the study of psychopathology by highlighting interactions between the social and neural dimensions of various disorders.

In their wide-ranging commentary, Dumas, Gozé, & Micoulaud-Franchi (Dumas et al.) provide a very useful extension of our discussion of the shared affective and phenomenological processes that underwrite human experience. To advance an “interactive turn in psychiatric semiology,” Dumas et al. call for a unified computational framework that could account both for sociocultural variations in psychiatric symptomatology (including how patients interpret and enact different explanatory models of illness, and how clinicians leverage different explanatory models in their interactions with patients) and current neuroscientific findings on physiological disturbances in the brain. Indeed, they go beyond our original argument, to suggest that the TTOM model can contribute to the development of methods that examine how the whole extended cultural phenotype of humans interacts with the whole genotype in what they call social physiology. We welcome this ambitious project, and applaud Dumas et al.'s broader discussion of multi-scale approaches to psychiatric disorders grounded in active inference – in particular, their comments on neurodevelopmental conditions such as autism and schizophrenia that have been described as minimal self-impairments, and that may be operating at more basic levels than the cognitive-behavioral loops that are the focus of the current TTOM model.

The potential relevance of TTOM to psychiatry is also brought out by Lifshitz & Luhrmann in their commentary on the ways in which culture can structure the affective valence and content of hallucinations. In addition to pointing out, as Gold and Gold (Reference Gold and Gold2015) have done elsewhere, that all types of hallucinations appear to involve social scenarios and “the relevant others that humans think through” to guide their existence, Lifshitz and Luhrmann provide compelling examples of sensorially rich “hallucinations” (interacting with imagined agents who are not actually present) in contexts such as religious practice that do not involve any pathology or underlying brain dysfunction. This points to the far-reaching ways in which culture can affect perception. Their review of recent findings on the cross-cultural patterning of experience in schizophrenia also shows how basic TTOM impairments, which likely do entail dysfunctions of an organic nature (and, as such, might lead to behavior recognized as dysfunctional in any cultural context), predict widely different degrees of distress depending on the specific cultural information conveyed in hallucinations, and on local cultural assumptions about the nature of affliction. Lifshitz and Luhrmann thus raise important questions on the reach and limits of cultural influences on phenomenological plasticity of mental disorders in general.

Finally, we welcome the suggestions for extending TTOM offered by Bolis & Schilbach. We find their dialectical account of the interactions and relationships between the agents compelling and appreciate their call to extend our model to take into account the diversity of human experience, including neurodiversity. The idea that difficulties in interaction between the neurotypical and neurodiverse individuals result from the misalignment between individuals, rather than simply from some underlying deficit in the neurodiverse population, has far-reaching implications for our understanding of and attitudes toward individual differences and difficulties in adaptation. Their suggestion to focus more closely on the dynamics of real-time interactions among humans points to fruitful avenues for empirically testing the TTOM model.

R7. Conclusion: The future of TTOM

We opened this discussion by presenting the “puzzle” of culture for a species characterized by immense diversity in skill sets, along with ways of thinking, feeling, and perceiving the world. We sought to clear the conceptual muddle and dispel any just-so story to describe the patterning of culture around evolved capacities for shared attention. Our model can thus be read as extending earlier multidisciplinary efforts to investigate the ways in which human language, systems of meaning, kinship, social institutions, norms and organization reflect, extend, and are constrained by basic evolved structures of the human mind. In addition to outlining how the brain and socially constructed niches are dynamically coupled in cultural learning, we have argued that these dynamics can be usefully operationalized under the FEP.

By highlighting the patterned dynamics through which the world comes to afford different things to different groups (and that lead different groups to be treated in different ways by other groups), our model offers a naturalistic account that may help operationalize some views typically understood as “socioconstructivist.” This should not be read as a radical endorsement of culture as an anything-goes process, entirely divorced from natural and biological constraints. The evolutionary and developmental acquisition of cultural affordances, as we have argued, builds on a set of attentional biases for coalitional intention-tracking, threat-avoidance, and prestige-cued, and social fitness-enhancing information – where the latter maximizes an individual's access to relevant skills, explanatory models, values, moral status, social recognition, and social support. All cultures and cultural subgroups operate with these dynamics.

Although we are convinced that many human affordances are collectively modulated via TTOM, our model does not deny the existence and importance of other external and natural affordances, many of which may be configured by cultural activities that construct our material world and local niches. Once a body is equipped with culture-bound skills, and once a world is layered with culture-bound meaning, patterned dynamics of cooperative action and improvisation become possible.

After engaging with our commentators’ provocative critiques and suggestions for refinement we are left with the sense that the future of TTOM looks bright. In ongoing collaborations, we are exploring how to augment the theory with affective valence, take into account individual differences and historicity, and begin to model specific domains such as epistemic bias. Once again, we are deeply grateful for this creative colloquy and look forward to continued TTOM.

References

Allen, M., Levy, A., Parr, T. & Friston, K. J. (2019) In the body's eye: The computational anatomy of interoceptive inference. BioRxiv 603928. https://doi.org/10.1101/603928.CrossRefGoogle Scholar
Asch, S. E. (1955) Opinions and social pressure. Scientific American 193(5):3135.CrossRefGoogle Scholar
Atkinson, M. (2008) Triathlon, suffering and exciting significance. Leisure Studies 27(2):165–80.CrossRefGoogle Scholar
Atran, S. (2003) Genesis of suicide terrorism. Science 299(5612):15341539.CrossRefGoogle ScholarPubMed
Bengio, Y. (2014) Evolving culture versus local minima. In: Growing adaptive machines: Combining development and learning in artificial neural networks, eds. Kowaliw, T., Bredeche, N. & Doursat, R., pp. 109138. Studies in Computational Intelligence. Springer.Google Scholar
Boyer, P. (2018) Minds make societies: How cognition explains the world humans create. Yale University Press.Google Scholar
Boyer, P. & Liénard, P. (2006) Why ritualized behavior? Precaution systems and action parsing in developmental, pathological and cultural rituals. The Behavioral and Brain Sciences 29(6):595613, discussion 613–50.CrossRefGoogle ScholarPubMed
Boyer, P. & Liénard, P. (2008) Ritual behavior in obsessive and normal individuals: Moderating anxiety and reorganizing the flow of action. Current Directions in Psychological Science 17(4):291–94.CrossRefGoogle Scholar
Bryant, G. A., Fessler, D. M., Fusaroli, R., Clint, E., Aarøe, L., Apicella, C. L., Petersen, M. B., Bickham, S. T., Bolyanatz, A., Chavez, B., De Smet, D., Díaz, C., Fančovičová, J., Fux, M., Giraldo-Perez, P., Hu, A., Kamble, S. V., Kameda, T., Li, N. P., Luberti, F. R., Prokop, P., Quintelier, K., Scelza, B. A., Shin, H. J., Soler, M., Stieger, S., Toyokawa, W., van den Hende, E. A., Viciana-Asensio, H., Yildizhan, S. E., Yong, J. C., Yuditha, T. & Zhou, Y. (2016) Detecting affiliation in colaughter across 24 societies, Proceedings of the National Academy of Sciences 113(17):4682–87.CrossRefGoogle ScholarPubMed
Chemero, A. (2011) Radical embodied cognitive science. MIT press.Google Scholar
Christensen, W. & Michael, J. (2016) From two systems to a multi-systems architecture for mindreading. New Ideas in Psychology 40:4864.CrossRefGoogle Scholar
Clark, A. (2017b) How to knit your own Markov blanket. In Philosophy and predictive processing, eds. Metzinger, T. K. & Wiese, W.. Frankfurt am Main: MIND Group.Google Scholar
Coakley, S. & Shelemay, K. K. (2007) Pain and its transformations: The interface of biology and culture. Harvard University Press.Google Scholar
Constant, A., Clark, A. & Friston, J. (2019a) Representation wars: Enacting an armistice through active inference. PhilSci Archive preprint. http://philsci-archive.pitt.edu/16641/.Google Scholar
Constant, A., Ramstead, M. J. D., Veissière, S. P. L. & Friston, K. J. (2019b) Regimes of expectations: An active inference model of social conformity and decision making. Frontiers in Psychology 10:679. https://doi.org/10.3389/fpsyg.2019.00679.CrossRefGoogle Scholar
Csibra, G. & Gergely, G. (2009) Natural pedagogy. Trends in Cognitive Sciences 13(4):148–53.CrossRefGoogle ScholarPubMed
Cullen, M., Davey, B., Friston, K. J. & Moran, R. J. (2018) Active inference in OpenAI Gym: A paradigm for computational investigations into psychiatric illness. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging 3(9):809–18. https://doi.org/10.1016/j.bpsc.2018.06.010.Google ScholarPubMed
De Bruin, L. C. & Strijbos, D. W. (2020) Does confabulation pose a threat to first-person authority? Mindshaping, self-regulation and the importance of self-know-how. Topoi 39(1):151–61. doi:10.1007/s11245-019-09631-y.CrossRefGoogle Scholar
Di Paolo, E. A., Cuffari, E. C. & De Jaegher, H. (2018) Linguistic bodies: The continuity between life and language. MIT Press.CrossRefGoogle Scholar
Diamond, J. (2010) Two views of collapse. Nature 463(7283):880–81.CrossRefGoogle Scholar
Friston, K. J. (2010) The free-energy principle: A unified brain theory? Nature Reviews Neuroscience 11(2):127–38. https://doi.org/10.1038/nrn2787.CrossRefGoogle ScholarPubMed
Friston, K. J., FitzGerald, T., Rigoli, F., Schwartenbeck, P., O'Doherty, J. & Pezzulo, G. (2016) Active inference and learning. Neuroscience and Biobehavioral Reviews 68:862–79.CrossRefGoogle Scholar
Friston, K., FitzGerald, T., Rigoli, F., Schwartenbeck, P. & Pezzulo, G. (2017a) Active inference: A process theory. Neural Computation 29(1):149. https://doi.org/10.1162/NECO_a_00912.CrossRefGoogle Scholar
Friston, K. J. & Friston, D. A. (2013) A free energy formulation of music generation and perception: Helmholtz revisited. In Sound-perception-performance. ed. Bader, R., pp. 4369. Springer, Heidelberg: Springer International Publishing.CrossRefGoogle Scholar
Friston, K. J., Lin, M., Frith, C. D., Pezzulo, G., Hobson, J. A. & Ondobaka, S. (2017b) Active inference, curiosity and insight. Neural Computation 29(10):2633–83.CrossRefGoogle Scholar
Gaines, A. D. & Farmer, P. E. (1986) Visible saints: Social cynosures and dysphoria in the Mediterranean tradition. Culture, Medicine and Psychiatry 10(4):295330.CrossRefGoogle ScholarPubMed
Gendron, M. & Barrett, L. F. (2018) Emotion perception as conceptual synchrony. Emotion Review 10(2):101–10.CrossRefGoogle Scholar
Gibson, J. J. (1986) The ecological approach to visual perception. Lawrence Erlbaum.Google Scholar
Gold, J. & Gold, I. (2015) Suspicious minds: How culture shapes madness. Free Press.Google Scholar
Henrich, J. & Gil-White, F. J. (2001) The evolution of prestige: Freely conferred deference as a mechanism for enhancing the benefits of cultural transmission. Evolution and Human Behavior 22(3):165–96.CrossRefGoogle ScholarPubMed
Hesp, C., Smith, R., Allen, M., Friston, K. & Ramstead, M. (2019) Deeply felt affect: The emergence of valence in deep active inference. https://doi.org/10.31234/osf.io/62pfd.CrossRefGoogle Scholar
Hills, T. T. (2019) The dark side of information proliferation. Perspectives on Psychological Science 14(3):323–30.CrossRefGoogle ScholarPubMed
Hoemann, K., Xu, F. & Barrett, L. F. (2019) Emotion words, emotion concepts, and emotional development in children: A constructionist hypothesis. Developmental Psychology 55(9):1830–49.CrossRefGoogle ScholarPubMed
Hohwy, J. (2016) The self-evidencing brain. Noûs 50(2):259–85. doi:10.1111/nous.12062.CrossRefGoogle Scholar
Hrdy, S. B. (2011) Mothers and others. Harvard University Press.Google Scholar
Huron, D. B. (2006) Sweet anticipation: Music and the psychology of expectation. MIT Press.CrossRefGoogle Scholar
Jackson, J. C., Jong, J., Bilkey, D., Whitehouse, H., Zollmann, S., McNaughton, C. & Halberstadt, J. (2018) Synchrony and physiological arousal increase cohesion and cooperation in large naturalistic groups. Scientific Reports 8(1):127.CrossRefGoogle ScholarPubMed
Kass, R. E. & Steffey, D. (1989) Approximate Bayesian inference in conditionally independent hierarchical models (parametric empirical Bayes models). Journal of the American Statistical Association 84(407):717–26.CrossRefGoogle Scholar
Keller, G. B. & Mrsic-Flogel, T. D. (2018) Predictive processing: A canonical cortical computation. Neuron 100(2):424–35.CrossRefGoogle ScholarPubMed
Kelso, J. A. S. (1994) The informational character of self-organized coordination dynamics. Human Movement Science 13(3):393413.CrossRefGoogle Scholar
Kirchhoff, M., Parr, T., Palacios, E., Friston, K. & Kiverstein, J. (2018) The Markov blankets of life: Autonomy, active inference and the free energy principle. Journal of the Royal Society Interface 15(138):111. 20170792.CrossRefGoogle ScholarPubMed
Kitayama, S., Duffy, S., Kawamura, T. & Larsen, J. T. (2003) Perceiving an object and its context in different cultures: A cultural look at new look. Psychological Science 14(3):201–6.CrossRefGoogle Scholar
Kiverstein, J., Miller, M. & Rietveld, E. (2019) The feeling of grip: Novelty, error dynamics and the predictive brain. Synthese 196(7):2847–69. https://doi.org/10.1007/s11229-017-1583-9.CrossRefGoogle Scholar
Konvalinka, I., Xygalatas, D., Bulbulia, J., Schjødt, U., Jegindø, E. M., Wallot, S., Van Orden, G. & Roepstorff, A. (2011) Synchronized arousal between performers and related spectators in a fire-walking ritual. Proceedings of the National Academy of Sciences 108(20):8514–19.CrossRefGoogle Scholar
MacKay, D. J. (1992) Information-based objective functions for active data selection. Neural Computation 4(4):590604.CrossRefGoogle Scholar
Miller, C. T., Freiwald, W. A., Leopold, D. A., Mitchell, J. F., Silva, A. C. & Wang, X. (2016) Marmosets: A neuroscientific model of human social behavior. Neuron 90(2):219–33.CrossRefGoogle ScholarPubMed
Mirza, M. B., Adams, R. A., Mathys, C. D. & Friston, K. J. (2016) Scene construction, visual foraging, and active inference. Frontiers in Computational Neuroscience 10(56):116. https://doi.org/10.3389/fncom.2016.00056.CrossRefGoogle ScholarPubMed
Moore, R. (2013) Social learning and teaching in chimpanzees. Biology and Philosophy 28(6):879901.CrossRefGoogle Scholar
Moulin, C. & Souchay, C. (2015) An active inference and epistemic value view of metacognition. Cognitive Neuroscience 6(4):221–22.CrossRefGoogle ScholarPubMed
Parkinson, B. (2019) Intragroup emotion convergence: Beyond contagion and social appraisal. Personality and Social Psychology Review, October, 1088868319882596.CrossRefGoogle Scholar
Parr, T. & Friston, K. J. (2017a) Uncertainty, epistemics and active inference. Journal of the Royal Society Interface 14(136):20170376. https://doi.org/10.1098/rsif.2017.0376.CrossRefGoogle Scholar
Pattee, H. H. (1977) Dynamic and linguistic modes of complex systems. International Journal of General Systems 3(4):259–66.CrossRefGoogle Scholar
Perreault, C., Moya, C. & Boyd, R. (2012) A Bayesian approach to the evolution of social learning. Evolution and Human Behavior 33(5):449459.CrossRefGoogle Scholar
Pezzulo, G., Cartoni, E., Rigoli, F., Pio-Lopez, L. & Friston, K. J. (2016) Active inference, epistemic value, and vicarious trial and error. Learning & Memory 23(7):322–38.CrossRefGoogle ScholarPubMed
Poirier, P., Faucher, L. & Bourdon, J.-N. (2019) Cultural blankets: Epistemological pluralism in the evolutionary epistemology of mechanisms. Journal for General Philosophy of Science September, 116. https://doi.org/10.1007/s10838-019-09472-8.Google Scholar
Provine, R. R. (2017) Laughter as an approach to vocal evolution: The bipedal theory. Psychonomic Bulletin & Review 24(1):238–44.CrossRefGoogle ScholarPubMed
Pyszczynski, T., Motyl, M. & Abdollahi, A. (2009) Righteous violence: killing for God, country, freedom and justice. Behavioral Sciences of Terrorism and Political Aggression 1(1):1239.CrossRefGoogle Scholar
Ramstead, M. J. D., Badcock, P. B. & Friston, K. J. (2018) Answering Schrödinger's question: A free-energy formulation. Physics of Life Reviews 24:116. https://doi.org/10.1016/j.plrev.2017.09.001.CrossRefGoogle ScholarPubMed
Ramstead, M. J. D., Veissière, S. P. L. & Kirmayer, L. J. (2016) Cultural affordances: Scaffolding local worlds through shared intentionality and regimes of attention. Frontiers in Psychology 7:1090.CrossRefGoogle ScholarPubMed
Schmidhuber, J. (2006) Developmental robotics, optimal artificial curiosity, creativity, music, and the fine arts. Connection Science 18(2):173–87.CrossRefGoogle Scholar
Schwartenbeck, P., Fitzgerald, T., Dolan, R. J. & Friston, K. (2013) Exploration, novelty, surprise, and free energy minimization. Frontiers in Psychology 4(October):710.CrossRefGoogle ScholarPubMed
Seth, A. K. & Friston, K. J. (2016) Active interoceptive inference and the emotional brain. Philosophical Transactions of the Royal Society B: Biological Sciences 371(1708):20160007.CrossRefGoogle ScholarPubMed
Smith, R., Lane, R. D., Parr, T. & Friston, K. J. (2019a) Neurocomputational mechanisms underlying emotional awareness: insights afforded by deep active inference and their potential clinical relevance. Neuroscience & Biobehavioral Reviews 107:473–91.CrossRefGoogle Scholar
Stendel, M., Ramstead, M. J. D. & Veissiére, S. (in press) Internet sociality. In Culture, mind, and brain: Emerging concepts, methods, and applications, chapter 20. eds. Kirmayer, L. J., Worthman, C. M., Kitayama, S., Lemelson, R., & Cummings, C. A.. Cambridge, UK: Cambridge University Press.Google Scholar
Stevenson, R. J., Case, T. I., Oaten, M. J., Stafford, L. & Saluja, S. (2019) A proximal perspective on disgust. Emotion Review 11(3):209–25.CrossRefGoogle Scholar
Sterelny, K. (2011) From hominins to humans: how sapiens became behaviourally modern. Philosophical Transactions of the Royal Society B: Biological Sciences 366(1566):809–22.CrossRefGoogle ScholarPubMed
Sterelny, K. (2007) Social intelligence, human intelligence and niche construction. Philosophical Transactions of the Royal Society B: Biological Sciences 362(1480):719–30.CrossRefGoogle ScholarPubMed
Stout, D. (2011) Stone toolmaking and the evolution of human culture and cognition. Philosophical Transactions of the Royal Society B: Biological Sciences 366(1567):1050–59.CrossRefGoogle ScholarPubMed
Stout, D., Passingham, R., Frith, C., Apel, J. & Chaminade, T. (2011) Technology, expertise and social cognition in human evolution. European Journal of Neuroscience 33(7):1328–38.CrossRefGoogle ScholarPubMed
Tajfel, H., Billig, M. G., Bundy, R. P. & Flament, C. (1971) Social categorization and intergroup behaviour. European Journal of Social Psychology 1(2):149–78. https://doi.org/10.1002/ejsp.2420010202.CrossRefGoogle Scholar
Tennie, C., Call, J. & Tomasello, M. (2009) Ratcheting up the ratchet: On the evolution of cumulative culture. Philosophical Transactions of the Royal Society of London, Series B: Biological Sciences 364(1528):2405–15.CrossRefGoogle ScholarPubMed
Toelch, U. & Dolan, R. J. (2015) Informational and normative influences in conformity from a neurocomputational perspective. Trends in Cognitive Sciences 19(10):579–89.CrossRefGoogle ScholarPubMed
Tomasello, M. (2009) Why we cooperate. MIT Press.CrossRefGoogle Scholar
Tomasello, M., Kruger, A. C. & Ratner, H. H. (1993) Cultural learning. Behavioral and Brain Sciences 16(3):495511.CrossRefGoogle Scholar
Torrance, S. & Schumann, F. (2019) The spur of the moment: What jazz improvisation tells cognitive science. AI & Society 34(2):251–68.CrossRefGoogle Scholar
Tschacher, W. & Haken, H. (2007) Intentionality in non-equilibrium systems? The functional aspects of self-organized pattern formation. New Ideas in Psychology 25(1):115.CrossRefGoogle Scholar
Vaish, A., Grossmann, T. & Woodward, A. (2008) Not all emotions are created equal: The negativity bias in social–emotional development. Psychological Bulletin 134(3):383403.CrossRefGoogle ScholarPubMed
Van de Cruys, S., Chamberlain, R. & Wagemans, J. (2017) Tuning in to art: A predictive processing account of negative emotion in art (commentary). The Behavioral and Brain Sciences, June 40. http://research.gold.ac.uk/19928/.CrossRefGoogle Scholar
Van de Cruys, S. & Wagemans, J. (2011) Putting reward in art: A tentative prediction error account of visual art. I-Perception 2(9):1035–62.CrossRefGoogle ScholarPubMed
Veissière, S. (2018) Cultural Markov blankets? Mind the other minds gap! Comment on “Answering Schrödinger's question: A free-energy formulation” by Maxwell James Désormeau Ramstead et al. Physics of Life Reviews 24:4749.CrossRefGoogle Scholar
Veissière, S. P. & Stendel, M. (2018) Hypernatural monitoring: a social rehearsal account of smartphone addiction. Frontiers in Psychology 9:141.CrossRefGoogle ScholarPubMed
Vuust, P., Dietz, M. J., Witek, M. & Kringelbach, M. L. (2018) Now you hear it: A predictive coding model for understanding rhythmic incongruity. Annals of the New York Academy of Sciences 1423(1):1929.CrossRefGoogle Scholar
Vygotsky, L. S. (1980) Mind in society: The development of higher psychological processes. Harvard University Press.CrossRefGoogle Scholar
Wiessner, P. W. (2014) Embers of society: Firelight talk among the Ju/'hoansi Bushmen. Proceedings of the National Academy of Sciences 111(39):14027–35.CrossRefGoogle ScholarPubMed
Yamagishi, T. & Hashimoto, H. (2016) Social niche construction. Current Opinion in Psychology 8:119–24. http://dx.doi.org/10.1016/j.copsyc.2015.10.00.CrossRefGoogle ScholarPubMed
Zawidzki, T. W. (2013) Mindshaping: A new framework for understanding human social cognition. MIT Press.CrossRefGoogle Scholar
Zeder, M. A. (2012) The broad spectrum revolution at 40: resource diversity, intensification, and an alternative to optimal foraging explanations. Journal of Anthropological Archaeology 31(3):241–64.CrossRefGoogle Scholar