Hostname: page-component-745bb68f8f-f46jp Total loading time: 0 Render date: 2025-02-07T04:49:55.822Z Has data issue: false hasContentIssue false

On the neural implausibility of the modular mind: Evidence for distributed construction dissolves boundaries between perception, cognition, and emotion

Published online by Cambridge University Press:  05 January 2017

Leor M. Hackel
Affiliation:
Department of Psychology, New York University, New York, NY 10003. leor.hackel@nyu.edu
Grace M. Larson
Affiliation:
Department of Psychology, Northwestern University, Evanston, IL 60208. gracelarson2017@u.northwestern.edu
Jeffrey D. Bowen
Affiliation:
Department of Psychological and Brain Sciences, University of California, Santa Barbara, Santa Barbara, CA 93106. bowen@psych.ucsb.edu
Gaven A. Ehrlich
Affiliation:
Department of Psychology, Syracuse University, Syracuse, NY 13244. gaehrlic@syr.edu
Thomas C. Mann
Affiliation:
Department of Psychology, Cornell University, Ithaca, NY 14853. tcm79@cornell.edu
Brianna Middlewood
Affiliation:
Department of Psychology, Pennsylvania State University, University Park, PA 16801. blm266@psu.educarlosgarrido@psu.edu
Ian D. Roberts
Affiliation:
Department of Psychology, The Ohio State University, Columbus, OH 43210. roberts.1134@osu.edu
Julie Eyink
Affiliation:
Department of Psychological and Brain Sciences, Indiana University, Bloomington, IN 47405. jeyink@indiana.edu
Janell C. Fetterolf
Affiliation:
Department of Psychology, Rutgers University, Piscataway, NJ 08854. j.fetterolf@gmail.com
Fausto Gonzalez
Affiliation:
Department of Psychology, University of California, Berkeley, Berkeley, CA 94720. fjgonzal@berkeley.edu
Carlos O. Garrido
Affiliation:
Department of Psychology, Pennsylvania State University, University Park, PA 16801. blm266@psu.educarlosgarrido@psu.edu
Jinhyung Kim
Affiliation:
Department of Psychology, Texas A&M University, College Station, TX 77840. jhkim82@tamu.edu
Thomas C. O'Brien
Affiliation:
Department of Psychology, University of Massachusetts, Amherst, Amherst, MA 01003. tcobrien@psych.umass.edu
Ellen E. O'Malley
Affiliation:
Department of Psychology, State University of New York, Albany, Albany, NY 12222. ellen.e.omalley@gmail.com
Batja Mesquita
Affiliation:
Center for Social and Cultural Psychology, University of Leuven, B-3000 Leuven, Belgium.mesquita@psy.kuleuven.be
Lisa Feldman Barrett
Affiliation:
Department of Psychology, Northeastern University, Boston, MA 02115. l.barrett@neu.edu Department of Psychiatry and the Martinos Center for Biomedical Imaging, Massachusetts General Hospital/Harvard Medical School, Boston, MA 02114.

Abstract

Firestone & Scholl (F&S) rely on three problematic assumptions about the mind (modularity, reflexiveness, and context-insensitivity) to argue cognition does not fundamentally influence perception. We highlight evidence indicating that perception, cognition, and emotion are constructed through overlapping, distributed brain networks characterized by top-down activity and context-sensitivity. This evidence undermines F&S's ability to generalize from case studies to the nature of perception.

Type
Open Peer Commentary
Copyright
Copyright © Cambridge University Press 2016 

Firestone & Scholl (F&S) rely on an outdated view of the mind to argue against top-down influences on perception. We highlight three of their assumptions that are untenable given contemporary neuroscience evidence: that the brain is modular, reflexively stimulus-driven, and context-independent. This evidence undermines their leap from critiques of individual studies to the conclusion that cognition does not affect perception.

The brain is not modular

F&S assume that the words cognition and perception refer to distinct types of mental processes (“natural kind” categories; Barrett Reference Barrett2009) localized to spatially distinct sets of neurons in the brain, sometimes called modules or mental organs (Fodor Reference Fodor1983; Gall Reference Gall1835; Pinker Reference Pinker1997). As an intuition pump, the authors ask readers to “imagine looking at an apple in a supermarket and appreciating its redness” (sect. 1, para. 3). “That is perception,” they suggest, compared with appreciating an apple's price, which they argue is “cognition.” This modular view assumes that the brain's visual processing is “encapsulated” from nonperceptual influences. For example, F&S propose that context effects on perception are fully encapsulated within the visual system and therefore are not a meaningful example of top-down effects.

This approach promotes using phenomenology to guide scientific insight, which epitomizes naive realism – the belief that one's experiences reveal the objective realities of the world (Hart et al. Reference Hart, Tullett, Shreves and Fetterman2015; Ross & Ward Reference Ross, Ward, Reed, Turiel and Brown1996). The distinctive experiences of seeing and thinking do not reveal a natural boundary in brain structure or function. The idea that the brain contains separate “mental organs” stems from an ancient view of neuroanatomy (see Finger Reference Finger2001 for a history of neuroanatomy). Modern neuroanatomy reveals that the brain is better understood as one large, interconnected network of neurons, bathed in a chemical system, that can be parsed as a set of broadly distributed, dynamically changing, interacting systems (Marder Reference Marder2012; Sporns Reference Sporns2011; van den Heuvel & Sporns Reference van den Heuvel and Sporns2013). These systems are domain general: Their interactions constitute mental phenomena that we consider distinct, such as perception, cognition, emotion, and action (for discussions, see Anderson Reference Anderson2014; Barrett Reference Barrett2009; Barrett & Satpute Reference Barrett and Satpute2013; Lindquist & Barrett Reference Lindquist and Barrett2012; Pessoa 2014; Yeo et al. Reference Yeo, Krienen, Eickhoff, Yaakub, Fox, Buckner, Asplund and Chee2015). For example, Figure 1 displays a meta-analytic summary of more than 5,600 neuroimaging studies from the Neurosynth database (www.neurosynth.org; Yarkoni et al. Reference Yarkoni, Poldrack, Nichols, Van Essen and Wager2011) showing brain “hot spots” that evidence a consistent increase in activity across a wide variety of tasks spanning the domains of perception, cognition, emotion, and action (for other evidence, see Yeo et al. Reference Yeo, Krienen, Eickhoff, Yaakub, Fox, Buckner, Asplund and Chee2015). Seemingly distinct mental phenomena are implemented as dynamic brain states, not as individual, static mental organs, violating the assumption that the mind has intuitive “joints.”

Figure 1. Results of a forward inference analysis, revealing hot spots in the brain that are active across 5,633 studies from the Neurosynth database. Activations are thresholded at FWE P<0.05. Figure taken from Clark-Polner et al. (Reference Clark-Polner, Wager, Satpute, Barrett, Barrett, Lewis and Haviland-Jones2016).

The brain is not reflexively “stimulus driven.”

F&S assume that perception is reflexively driven by sensory inputs from the world that are commonly referred to as “bottom-up input.” For example, they describe cross-modal effects and context effects on perception as occurring “reflexively” based on visual input alone. But again, neuroanatomy is inconsistent with claims of reflexiveness. Cortical cytoarchitecture is linked to information flow within the brain (see Barbas, Reference Barbas2015; for discussion, see Chanes & Barrett, Reference Chanes and Barrett2016) and shows how representations of the past, created in the vast repertoire of connectivity patterns within the cortex (referred to as “top-down” and colloquially called “memory” or “cognition”), are always involved in perception, and often dominate. Vision is largely a top-down affair (e.g., Gilbert & Li Reference Gilbert and Li2013). For example, by most estimates, only 10% of the synapses from incoming neurons to primary visual cortex originate in the thalamus, which brings sensory input from the retina; the remaining 90% of these synapses originate in the cortex itself (Peters Reference Peters2002). Indeed, a bottom-up, reactive brain would be metabolically expensive and anatomically infeasible (e.g., see Sterling & Laughlin Reference Sterling and Laughlin2015). F&S dismiss top-down connections as irrelevant to their argument, because knowledge of anatomical connections is “common ground for all parties” (sect. 2.4, para. 2) and so these connections cannot be “revolutionary.” The issue of their novelty is irrelevant, however: F&S are arguing a position that violates the functional architecture of the brain.

Top-down anatomical connections are consistent with a predictive brain that models the world through active inference (e.g., Bar Reference Bar2007; Barrett & Simmons Reference Barrett and Simmons2015; Clark Reference Clark2013; den Ouden et al. Reference Den Ouden, Kok and De Lange2012; Friston Reference Friston2010; Rao & Ballard Reference Rao and Ballard1999). This process not only allows for, but also is predicated on, the existence of top-down effects. Specifically, the brain generatively synthesizes past experiences to continually construct predictions about the world, estimating their Bayesian prior probabilities relative to incoming sensory input. The brain then refines predictions accordingly. This means that top-down influences typically drive perception, and are constrained or corrected by incoming sensory inputs, rather than the other way around. Indeed, when humans and nonhuman animals change their expectations, sensory neurons change their firing patterns (e.g., Alink et al. Reference Alink, Schwiedrzik, Kohler, Singer and Muckli2010; Egner et al. Reference Egner, Monti and Summerfield2010; Makino & Komiyama Reference Makino and Komiyama2015; for a discussion, see Chanes & Barrett, Reference Chanes and Barrett2016).

Although F&S acknowledge unconscious, reflexive inference in the visual system, they dispute the idea that “cognitive inferences” shape perception. Their distinction between reflexive visual inference and cognitive inference again advocates for a boundary that is rooted in naive realism, between reflex and volition (Descartes 1649/Reference Descartes1989). It has long been known that the main distinction between automatic and controlled processing (or System 1 and System 2) is primarily phenomenological (for a discussion, see Barrett et al. Reference Barrett, Tugade and Engle2004). Because the brain's control networks are involved in processing prediction error (applying attention to neurons to shape which inputs from the world are considered information and which are noise (Barrett & Simmons Reference Barrett and Simmons2015; Behrens et al. Reference Behrens, Woolrich, Walton and Rushworth2007; Gottlieb Reference Gottlieb2012; Pezzulo Reference Pezzulo2012), they are always engaged to some degree; relative differences in activity should not be confused with “activation” and “deactivation” or “on” and “off”). It is more consistent with neuroanatomy to assume a continuum of brain modes, with one end characterized by brain states constructed primarily with prediction (e.g., phenomena called “memory,” “daydreaming,” “mind wandering,” etc.), and the other end characterized by brain states where prediction error dominates (e.g., novelty-processing, learning, etc.), with a range of gradations in between. Evidence that the brain is active, not merely reactive, undermines the idea that perception is a bottom-up reflex isolated from cognition.

The brain is context-dependent

In discussing their “El Greco” fallacy, F&S implicitly assume that top-down effects would uniformly influence all elements in a visual field. This represents a fundamental misunderstanding of how the brain constructs Bayesian inferences, and in particular, reveals an underappreciated role of context. The authors argue that if an aperture looks more narrow when passing through it holding a wide rod (Stefanucci & Geuss Reference Stefanucci and Geuss2009), then a second aperture that a participant adjusts to match the perceived width of the first (but is not required to pass through) should look similarly narrow, at least while one is holding the same rod. Following the authors' logic, these two distortions should cancel out, leaving no measurable impact of holding the rod on width estimates. However, the first aperture is meant to be passed through, whereas the second is not, and it is well known that requirements for action strongly shape moment-to-moment processing (Cisek & Klaska Reference Cisek and Klaska2010). To assume that the top-down influence on width estimates would be the same and therefore cancel out under these distinct conditions suggests a misunderstanding of top-down effects.

Context shapes Bayesian “priors” that inform the brain's sensory predictions (Clark Reference Clark2013; Friston Reference Friston2010). As a consequence, sensory neurons behave differently when involved in contextually distinct perceptual tasks (Gilbert & Sigman Reference Gilbert and Sigman2007). Requirements for action (dictated by the task context) exert a top-down influence on the processing of visual information. In light of these considerations, research questions should shift from, “Are there top-down effects on perception?” in a global, undifferentiated way to “In what contexts and at what level in a hierarchy of predictions do different top-down effects emerge in a nuanced way?”

We agree with F&S that some studies of top-down effects may capture processes that are not traditionally categorized as perception (e.g., the impact of demand characteristics on judgment), and that studies designed for disconfirmation are an important part of theory testing. But the main thrust of their critique is based on folk categories of perception and cognition as reified in a modular, reactive, and context-insensitive brain. These assumptions, and the conclusion they support, are untenable given the considerable neuroscientific evidence that processing in the human brain is distributed, active, and exquisitely sensitive to context.

ACKNOWLEDGMENTS

This manuscript was supported by a US National Institute on Aging grant (R01AG030311), a US National Institute of Child Health and Human Development grant (R21 HD076164), and contracts from the US Army Research Institute for the Behavioral and Social Sciences (contracts W5J9CQ12C0049 and W5J9CQ11C0046) to Barrett. The views, opinions, and findings contained in this article are those of the authors and should not be construed as an official position, policy, or decision of the US National Institutes of Health or Department of the Army unless so designated by other documents.

References

Alink, A., Schwiedrzik, C. M., Kohler, A., Singer, W. & Muckli, L. (2010) Stimulus predictability reduces responses in primary visual cortex. The Journal of Neuroscience 30(8):2960–66.Google Scholar
Anderson, M. L. (2014) After phrenology: Neural reuse and the interactive brain. MIT Press.CrossRefGoogle Scholar
Bar, M. (2007) The proactive brain: Using analogies and associations to generate predictions. Trends in Cognitive Sciences 11(7):280–89.CrossRefGoogle ScholarPubMed
Barbas, H. (2015) General cortical and special prefrontal connections: Principles from structure to function. Annu Rev Neurosci 38:269–89.CrossRefGoogle ScholarPubMed
Barrett, L. F. (2009) The future of psychology: Connecting mind to brain. Perspectives on Psychological Science 4(4):326–39.CrossRefGoogle Scholar
Barrett, L. F. & Satpute, A. B. (2013) Large-scale brain networks in affective and social neuroscience: Towards an integrative functional architecture of the brain. Current Opinion in Neurobiology 23(3):361–72.Google Scholar
Barrett, L. F. & Simmons, W. K. (2015) Interoceptive predictions in the brain. Nature Reviews Neuroscience 16:419–29.Google Scholar
Barrett, L. F., Tugade, M. M. & Engle, R. W. (2004) Individual differences in working memory capacity and dual-process theories of the mind. Psychological Bulletin 130:553–73. PMCID: PMC1351135.Google Scholar
Behrens, T. E. J., Woolrich, M. W., Walton, M. E. & Rushworth, M. F. A. (2007) Learning the value of information in an uncertain world. Nature Neuroscience 10:1214–21.Google Scholar
Chanes, L. & Barrett, L. F. (2016) Redefining the role of limbic areas in cortical processing. Trends in Cognitive Sciences 20(2):96106.CrossRefGoogle ScholarPubMed
Cisek, P. & Klaska, J. F. (2010) Neural mechanisms for interacting with a world full of action choices. Annual Review of Neuroscience 33:269–98.CrossRefGoogle ScholarPubMed
Clark, A. (2013) Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences 36(3):181204.CrossRefGoogle ScholarPubMed
Clark-Polner, E., Wager, T. D., Satpute, A. B. & Barrett, L. F. (2016) “Brain-based fingerprints?” Variation, and the search for neural essences in the science of emotion. In: The handbook of emotion, fourth edition, ed. Barrett, L. F., Lewis, M. & Haviland-Jones, J. M., pp. 146–65. Guilford.Google Scholar
Den Ouden, H. E., Kok, P. & De Lange, F. P. (2012) How prediction errors shape perception, attention, and motivation. Frontiers in Psychology 3:548.Google Scholar
Descartes, R. (1989) On the passions of the soul. Hackett (Original work published 1649).Google Scholar
Egner, T., Monti, J. M. & Summerfield, C. (2010) Expectation and surprise determine neural population responses in the ventral visual stream. The Journal of Neuroscience 30(49):16601–608.CrossRefGoogle ScholarPubMed
Finger, S. (2001) Origins of neuroscience: A history of explorations into brain function. Oxford University Press.Google Scholar
Fodor, J. A. (1983) Modularity of mind: An essay on faculty psychology. MIT Press.CrossRefGoogle Scholar
Friston, K (2010) The free-energy principle: A unified brain theory? Nature Reviews Neuroscience 11:127–38.Google Scholar
Gall, F. J. (1835) On the functions of the brain and of each of its parts: With observations on the possibility of determining the instincts, propensities, and talents, or the moral and intellectual dispositions of men and animals, by the configuration of the brain and head, vol. 1. Marsh, Capen & Lyon.Google Scholar
Gilbert, C. D. & Li, W. (2013) Top-down influences on visual processing. Nature Reviews Neuroscience 14(5):350–63.Google Scholar
Gilbert, C. D. & Sigman, M. (2007) Brain states: Top-down influences in sensory processing. Neuron 54(5):677–96.Google Scholar
Gottlieb, J. (2012) Attention, learning, and the value of information. Neuron 76(2):281–95.CrossRefGoogle ScholarPubMed
Hart, W., Tullett, A. M., Shreves, W. B. & Fetterman, Z. (2015) Fueling doubt and openness: Experiencing the unconscious, constructed nature of perception induces uncertainty and openness to change. Cognition 137:18.Google Scholar
Lindquist, K. A. & Barrett, L. F. (2012) A functional architecture of the human brain: Emerging insights from the science of emotion. Trends in Cognitive Sciences 16(11):533–40.CrossRefGoogle ScholarPubMed
Makino, H. & Komiyama, T. (2015) Learning enhances the relative impact of top-down processing in the visual cortex. Nature Neuroscience 18(8):1116–22.Google Scholar
Marder, E. (2012) Neuromodulation of neuronal circuits: Back to the future. Neuron 76(1):111.Google Scholar
Pessoa, L. (2015) Précis on The Cognitive-Emotional Brain . Behavioral and Brain Sciences 38:e71.Google Scholar
Peters, A. (2002) Examining neocortical circuits: Some background and facts. Journal of Neurocytology 31 (3–5):183–93.Google Scholar
Pezzulo, G. (2012) An active inference view of cognitive control. Frontiers in Psychology 3:478.Google Scholar
Pinker, S. (1997) How the mind works. Norton.Google Scholar
Rao, R. P. & Ballard, D. H. (1999) Predictive coding in the visual cortex: A functional interpretation of some extra-classical receptive-field effects. Nature Neuroscience 2(1):7987.CrossRefGoogle ScholarPubMed
Ross, L. & Ward, A. (1996) Naïve realism in everyday life: Implications for social conflict and misunderstanding. In: Values and knowledge, ed. Reed, E. S., Turiel, E. & Brown, T., pp. 103–35. Erlbaum.Google Scholar
Sporns, O. (2011) The human connectome: A complex network. Annals of the New York Academy of Sciences 1224(1):109–25.Google Scholar
Stefanucci, J. K. & Geuss, M. N. (2009) Big people, little world: The body influences size perception. Perception 38:1782–95.Google Scholar
Sterling, P. & Laughlin, S. (2015) Principles of neural design. MIT Press.Google Scholar
van den Heuvel, M. P. & Sporns, O. (2013) Network hubs in the human brain. Trends in Cognitive Sciences 17(12):683–96.CrossRefGoogle ScholarPubMed
Yarkoni, T., Poldrack, R. A., Nichols, T. E., Van Essen, D. C. & Wager, T. D. (2011) Large-scale automated synthesis of human functional neuroimaging data. Nature Methods 8(8):665–70.CrossRefGoogle ScholarPubMed
Yeo, B. T., Krienen, F. M., Eickhoff, S. B., Yaakub, S. N., Fox, P. T., Buckner, R. L., Asplund, C. L. & Chee, M. W. (2015) Functional specialization and flexibility in human association cortex. Cerebral Cortex 25(10):3654–72.CrossRefGoogle ScholarPubMed
Figure 0

Figure 1. Results of a forward inference analysis, revealing hot spots in the brain that are active across 5,633 studies from the Neurosynth database. Activations are thresholded at FWE P<0.05. Figure taken from Clark-Polner et al. (2016).