Towards a universal neurobiological architecture for learning to read

Marcin Szwed; Fabien Vinckier; Laurent Cohen; Stanislas Dehaene

doi:10.1017/S0140525X12000283

Towards a universal neurobiological architecture for learning to read

Published online by Cambridge University Press: 29 August 2012

Laurent Cohen and

Marcin Szwed: Affiliation:
Zakład Psychofizjologii, Instytut Psychologii (Department of Psychophysiology, Institute of Psychology), Jagiellonian University, 31120 Kraków, Poland. mfszwed@gmail.com Département de Psychologie, Aix-Marseille Université, 13284 Marseille, France. mfszwed@gmail.com Laboratoire de Psychologie Cognitive, CNRS, UMR 6146, 13284 Marseille, France. mfszwed@gmail.com
Fabien Vinckier: Affiliation:
Faculté de Médecine Pitié-Salpêtrière, IFR 70, Université Pierre et Marie Curie (University of Paris 6), 75013 Paris, France. fabien.vinckier@gmail.comlaurent.cohen@psl.ap-hop-paris.fr Institut National de la Santé et de la Recherche Médicale, Institut du Cerveau et de la Moelle Épinière, UMRS 975, 75013 Paris, France. fabien.vinckier@gmail.comlaurent.cohen@psl.ap-hop-paris.fr
Laurent Cohen: Affiliation:
Faculté de Médecine Pitié-Salpêtrière, IFR 70, Université Pierre et Marie Curie (University of Paris 6), 75013 Paris, France. fabien.vinckier@gmail.comlaurent.cohen@psl.ap-hop-paris.fr Institut National de la Santé et de la Recherche Médicale, Institut du Cerveau et de la Moelle Épinière, UMRS 975, 75013 Paris, France. fabien.vinckier@gmail.comlaurent.cohen@psl.ap-hop-paris.fr Departament de Neurologie, Groupe Hospitalier Pitié-Salpêtrière, and Assistance Publique–Hôpitaux de Paris, 75651 Paris, France. laurent.cohen@psl.ap-hop-paris.fr
Stanislas Dehaene: Affiliation:
Collège de France, 75005 Paris, France. stanislas.dehaene@cea.fr Cognitive Neuroimaging Unit, Institut National de la Santé et de la Recherche Médicale, 91191 Gif sur Yvette, France. stanislas.dehaene@cea.frwww.unicog.org Division of Life Sciences, Institute of Bioimaging, Neurospin, Commissariat à l'Energie Atomique, 91191 Gif sur Yvette, France. stanislas.dehaene@cea.fr Université Paris 11, 91405 Orsay, France. stanislas.dehaene@cea.fr

Article contents

Abstract
References

Rights & Permissions

Abstract

Letter-position tolerance varies across languages. This observation suggests that the neural code for letter strings may also be subtly different. Although language-specific models remain useful, we should endeavor to develop a universal model of reading acquisition which incorporates crucial neurobiological constraints. Such a model, through a progressive internalization of phonological and lexical regularities, could perhaps converge onto the language-specific properties outlined by Frost.

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 35 , Issue 5 , October 2012 , pp. 308 - 309

DOI: https://doi.org/10.1017/S0140525X12000283 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2012

“Cmabirdge” reads almost as well as “Cambridge,” but only in some languages. Ram Frost is right in pointing out that tolerance to letter-position swaps is not a universal feature of reading. His hypothesis that writing systems “optimally represent the languages' phonological spaces” (sect. 3, para. 1) is appealing and is indeed a crucial consideration when discussing the possibility of spelling reform – some variations in writing systems may be more “rational” than they first appear (Dehaene Reference Dehaene2009, pp. 32–37). Does it follow, however, that current open-bigram models of orthographic processing are, in Ram Frost's words, “ill-advised”? And what is the best strategy to achieve a “universal model of reading”?

From a neuroscientific perspective, much insight can be gained from limited models that consider in detail not only the problems raised by a specific script and language, but also the neurobiological constraints on how the brain might solve them. Our bigram neuron hypothesis, which postulates that the left occipitotemporal visual word form area (VWFA) may contain neurons tuned to ordered letter pairs, was presented in this context as a useful solution to position-invariant recognition of written words in English, French, and related Roman scripts (Dehaene et al. Reference Dehaene, Cohen, Sigman and Vinckier2005). A functional magnetic resonance imaging (fMRI) experiment aimed at testing the predictions of this model demonstrated that reading indeed relies on a hierarchy of brain areas sensitive to increasingly complex properties, from individual letters to bigrams and to higher-order combinations of abstract letter representations (Vinckier et al. Reference Vinckier, Dehaene, Jobert, Dubus, Sigman and Cohen2007). These regions form a gradient of selectivity through the occipitotemporal cortex, with activation becoming more selective for higher-level stimuli towards the anterior fusiform region (Fig. 1) (see also Binder Reference Binder, Medler, Westbury, Liebenthal and Buchanan2006). Interestingly, a similar gradient may also exist in Chinese script (Chan et al. Reference Chan, Tang, Tang, Lee, Lo and Kwong2009). It would be important to probe it in Hebrew readers.

Figure 1. Hierarchical Coding of Letter Strings in the Ventral Visual Stream. Up: Design and examples of stimuli used, with an increasing structural similarity to real words. Down: fMRl results The image illustrates the spatial layout of sensitivity of the occipitotemporal cortex to letter strings of different similarity to real words. Activations become more selective for higher-level stimuli (i.e., stimuli more similar to real words) toward the anterior fusiform regions. This is taken as evidence for a hierarchy of brain areas sensitive to increasingly complex properties, from individual letters to bigrams and to higher-order combinations of letters. (Adapted from Vinckier et al. Reference Vinckier, Dehaene, Jobert, Dubus, Sigman and Cohen2007).

We agree with Frost that developing a more general, language-universal model of reading acquisition is a major goal for future research. However, crucially, we would add that such a universal model should incorporate strong constraints from brain architecture and not just linguistics. Existing connectionist models typically incorporate few neurobiological constraints and, as a result, provide information-processing solutions that need not be realistic at the brain level. Reading is a ventral visual stream process that “recycles” existing visual mechanisms used for object recognition (Dehaene Reference Dehaene2009; Szwed et al. Reference Szwed, Cohen, Qiao and Dehaene2009; Reference Szwed, Dehaene, Kleinschmidt, Eger, Valabregue, Amadon and Cohen2011; however, see Reich et al. Reference Reich, Szwed, Cohen and Amedi2011) As such, it is heavily constrained by the limitations of the visual brain, for example, the necessity to process information step by step through distinct visual areas with increasing receptive fields (V1, V2, V3, V4, V5, LO, MT …). Implementing these constraints into general models has proven very challenging so far (although see Mozer Reference Mozer and Coltheart1987). Indeed, important advances in the field have been predominantly guided by narrow, language-specific theories that hardwire these constraints into their architectures. Nevertheless, the vast neurobiological knowledge about these regions should ultimately be tapped by a more general model. Starting from a generic, biologically realistic neuronal architecture, and using realistic synaptic plasticity rules, the future model would converge on a specific architecture for the VWFA in any language. It could include a Bayesian implementation of the informative fragments model, which falls close to predicting the real-life responses of ventral visual stream neurons involved in object recognition (Ullman Reference Ullman2007).

Would such a model, once developed, substantiate Frost's claim that the internal code for letter strings varies strongly across languages, depending on their phonology and word structure? Here, we should clear up a frequent confusion. During online processing, when an actual word is read by a fluent reader, magnetoencephalography (MEG) experiments, with their high temporal resolution, have shown that the first major response of the visual system, peaking roughly 130 msec after seeing a word, is determined overwhelmingly by the frequency of letter combinations that make up a word, whereas lexical and phonological effects come into play much later (Simos et al. Reference Simos, Breier, Fletcher, Foorman, Castillo and Papanicolaou2002; Solomyak & Marantz Reference Solomyak and Marantz2010). Thus, in adults, the VWFA may reflect a relatively isolated stage of orthographic processing that is essentially immune to phonological and semantic influences (Dehaene & Cohen Reference Dehaene and Cohen2011; but see Price & Devlin Reference Price and Devlin2011). However, this is not to say that, in the course of learning, the acquired orthographical code cannot be influenced by the needs of the phonological and semantic systems to which the VWFA ultimately projects. The anatomical localization of the VWFA is strongly influenced, not only by bottom visual constraints (Hasson et al. Reference Hasson, Levy, Behrmann, Hendler and Malach2002), but also by the lateralization of the target spoken language (Pinel & Dehaene Reference Pinel and Dehaene2009). MEG shows that, in English readers, the visual word form system decomposes the words' morphology into prefixes, roots, and affixes about 170 msec after stimulus onset (Solomyak & Marantz Reference Solomyak and Marantz2010). Such decomposition is automatic and operates even with pseudo-affixed words like “brother” that can be falsely decomposed into “broth” and “er” (Lewis et al. Reference Lewis, Solomyak and Marantz2011). Thus, the visual system has internalized orthographic units that are relevant to morphological and lexical knowledge. Although not yet demonstrated, we consider it likely that the VWFA also codes for frequent substrings that facilitate the mapping onto phonemes, such as “th” or “ain” in English. Indeed, this hypothesis may explain why English reading, with its complex grapheme–phoneme mappings, causes greater activation in the VWFA than does Italian reading (Paulesu et al. Reference Paulesu, McCrory, Fazio, Menoncello, Brunswick, Cappa, Cotelli, Cossu, Corte, Lorusso, Pesenti, Gallagher, Perani, Price, Frith and Frith2000).

In this context, we have no difficulty in accepting Frost's argument that the optimal neural code for letter strings might have to be much less tolerant to letter swaps in Hebrew than in English. This view predicts root detectors in the more anterior part of VWFA of Hebrew readers and sharper tuning curves for letters and bigrams detectors. Testing such predictions for scripts other than Latin is an important goal for future neuroimaging experiments. A readily available tool is fMRI repetition suppression, which has proven sensitive to subtle properties of object, number, and letter tuning (Dehaene et al. Reference Dehaene, Jobert, Naccache, Ciuciu, Poline, Le Bihan and Cohen2004; Grill-Spector et al. Reference Grill-Spector, Kushnir, Edelman, Avidan, Itzchak and Malach1999). Alternatively, multivariate pattern analysis may provide more direct access to the fine-tuning characteristic of the VWFA (Braet et al. Reference Braet, Wagemans and Op de Beeck2012).

ACKNOWLEDGMENT

MS was funded by an Iuventus Plus grant from the Polish Ministry of Science and Higher Education and an ERC (European Research Council) advanced Grant 230313.

References

Binder, J. R., Medler, D. A., Westbury, C. F., Liebenthal, E. & Buchanan, L. (2006) Tuning of the human left fusiform gyrus to sublexical orthographic structure. NeuroImage 33(2):739–48.Google Scholar

Braet, W., Wagemans, J. & Op de Beeck, H. P. (2012) The visual word form area is organized according to orthography. NeuroImage 59(3):2751–59.Google Scholar

Chan, S. T., Tang, S. W., Tang, K. W., Lee, W. K., Lo, S. S. & Kwong, K. K. (2009) Hierarchical coding of characters in the ventral and dorsal visual streams of Chinese language processing. NeuroImage 48(2):423–35.Google Scholar

Dehaene, S. (2009) Reading in the brain: The science and evolution of a human invention. Penguin Viking.Google Scholar

Dehaene, S. & Cohen, L. (2011) The unique role of the visual word form area in reading. Trends in Cognitive Sciences 15(6):254–62.Google Scholar

Dehaene, S., Cohen, L., Sigman, M. & Vinckier, F. (2005) The neural code for written words: A proposal. Trends in Cognitive Sciences 9:335–41.Google Scholar

Dehaene, S., Jobert, A., Naccache, L., Ciuciu, P., Poline, J. B., Le Bihan, D. & Cohen, L. (2004) Letter binding and invariant recognition of masked words: Behavioral and neuroimaging evidence. Psychological Science 15(5):307–13.Google Scholar

Grill-Spector, K., Kushnir, T., Edelman, S., Avidan, G., Itzchak, Y. & Malach, R. (1999) Differential processing of objects under various viewing conditions in the human lateral occipital complex. Neuron 24(1):187–203.CrossRef Google Scholar PubMed

Hasson, U., Levy, I., Behrmann, M., Hendler, T. & Malach, R. (2002) Eccentricity bias as an organizing principle for human high-order object areas. Neuron 34(3):479–90.Google Scholar

Lewis, G., Solomyak, O. & Marantz, A. (2011) The neural basis of obligatory decomposition of suffixed words. Brain and Language 118(3):118–27.Google Scholar

Mozer, M. C. (1987) Early parallel processing in reading: A connectionist approach. In: Attention and performance XII: The psychology of reading, ed. Coltheart, M., pp. 83–104. Erlbaum.Google Scholar

Paulesu, E., McCrory, E., Fazio, F., Menoncello, L., Brunswick, N., Cappa, S. F., Cotelli, M., Cossu, G., Corte, F., Lorusso, M., Pesenti, S., Gallagher, A., Perani, D., Price, C., Frith, C. D. & Frith, U. (2000) A cultural effect on brain function. Nature Neuroscience 3(1):91–96.Google Scholar

Pinel, P. & Dehaene, S. (2009) Beyond hemispheric dominance: Brain regions underlying the joint lateralization of language and arithmetic to the left hemisphere. Journal of Cognitive Neuroscience 22(1):48–66.Google Scholar

Price, C. J. & Devlin, J. T. (2011) The interactive account of ventral occipitotemporal contributions to reading. Trends in Cognitive Sciences 15:246–53.Google Scholar

Reich, L., Szwed, M., Cohen, L. & Amedi, A. (2011) A ventral visual stream reading center independent of visual experience. Current Biology 21:363–68.Google Scholar

Simos, P. G., Breier, J. I., Fletcher, J. M., Foorman, B. R., Castillo, E. M. & Papanicolaou, A. C. (2002) Brain mechanisms for reading words and pseudowords: An integrated approach. Cerebral Cortex 12(3):297–305.Google Scholar

Solomyak, O. & Marantz, A. (2010) Evidence for early morphological decomposition in visual word recognition. Journal of Cognitive Neuroscience 22(9):2042–57.Google Scholar

Szwed, M., Cohen, L., Qiao, E. & Dehaene, S. (2009) The role of invariant features in object and visual word recognition. Vision Research 49:718–25.Google Scholar

Szwed, M., Dehaene, S., Kleinschmidt, A., Eger, E., Valabregue, R., Amadon, A. & Cohen, L. (2011) Specialization for written words over objects in the visual cortex. NeuroImage 56(1):330–44.Google Scholar

Ullman, S. (2007) Object recognition and segmentation by a fragment-based hierarchy. Trends in Cognitive Sciences 11(2):58–64.Google Scholar

Vinckier, F., Dehaene, S., Jobert, A., Dubus, J. P., Sigman, M. & Cohen, L. (2007) Hierarchical coding of letter strings in the ventral stream: Dissecting the inner organization of the visual word form system. Neuron 55:143–56.Google Scholar

Figure 1. Hierarchical Coding of Letter Strings in the Ventral Visual Stream. Up: Design and examples of stimuli used, with an increasing structural similarity to real words. Down: fMRl results The image illustrates the spatial layout of sensitivity of the occipitotemporal cortex to letter strings of different similarity to real words. Activations become more selective for higher-level stimuli (i.e., stimuli more similar to real words) toward the anterior fusiform regions. This is taken as evidence for a hierarchy of brain areas sensitive to increasingly complex properties, from individual letters to bigrams and to higher-order combinations of letters. (Adapted from Vinckier et al. 2007).