Orthographic influences on division of labor in learning to read Chinese and English: Insights from computational modeling*

JIANFENG YANG; HUA SHU; BRUCE D. McCANDLISS; JASON D. ZEVIN

doi:10.1017/S1366728912000296

Orthographic influences on division of labor in learning to read Chinese and English: Insights from computational modeling*

Published online by Cambridge University Press: 14 September 2012

JIANFENG YANG ,

HUA SHU ,

BRUCE D. McCANDLISS and

JASON D. ZEVIN

Show author details

JIANFENG YANG: Affiliation:
Key Laboratory of Behavioral Science, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China
HUA SHU: Affiliation:
State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing 100875, China
BRUCE D. McCANDLISS: Affiliation:
Department of Psychology, Vanderbilt University Peabody College of Education and Human Development, Nashville, TN 37240, USA
JASON D. ZEVIN*: Affiliation:
Sackler Institute for Developmental Psychobiology, Weill Cornell Medical College, NY 10021, USA
*: Address for correspondence: Jason D. Zevin, Box 140, Sackler Institute for Developmental Psychobiology, Weill Cornell Medical College, New York, NY 10021, USAjdz2001@med.cornell.edu

Article contents

Abstract
Simulation 1: Modeling differential division of labor between Chinese and English in monolingual models
Simulation 2: Modeling Chinese–English bilingualism
General discussion
Footnotes
References

Rights & Permissions

Abstract

Learning to read in any language requires learning to map among print, sound and meaning. Writing systems differ in a number of factors that influence both the ease and the rate with which reading skill can be acquired, as well as the eventual division of labor between phonological and semantic processes. Further, developmental reading disability manifests differently across writing systems, and may be related to different deficits in constitutive processes. Here we simulate some aspects of reading acquisition in Chinese and English using the same model for both writing systems. The contribution of semantic and phonological processing to literacy acquisition in the two languages is simulated, including specific effects of phonological and semantic deficits. Further, we demonstrate that similar patterns of performance are observed when the same model is trained on both Chinese and English as an “early bilingual”. The results are consistent with the view that reading skill is acquired by the application of statistical learning rules to mappings among print, sound and meaning, and that differences in the typical and disordered acquisition of reading skill between writing systems are driven by differences in the statistical patterns of the writing systems themselves, rather than differences in cognitive architecture of the learner.

Keywords

computational modeling reading dyslexia

Type: Research Article
Information: Bilingualism: Language and Cognition , Volume 16 , Issue 2: Computational Modeling of Bilingualism , April 2013 , pp. 354 - 366

DOI: https://doi.org/10.1017/S1366728912000296 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2012

Most research on the development of reading has been done in English. This raises questions about whether insights and models of reading skill and its development will generalize to other languages and writing systems. In fact, reading acquisition and use appear to be quantitatively and qualitatively different across writing systems, in ways that have prompted some theorists to propose that different cognitive architectures would be required to understand reading in different writing systems (Coltheart, Curtis, Atkins & Haller, Reference Coltheart, Curtis, Atkins and Haller1993; Coltheart, Rastle, Perry, Langdon & Ziegler, Reference Coltheart, Rastle, Perry, Langdon and Ziegler2001; Frost, Reference Frost, Snowling and Hulme2005; Perfetti, Liu & Tan, Reference Perfetti, Liu and Tan2005). An alternative view is that these differences may be better understood in terms of statistical properties of the writing system (Ziegler & Goswami, Reference Ziegler and Goswami2005) and the impact these may have on the “division of labor” between semantic and phonological processing in reading (Seidenberg, Reference Seidenberg, Miller and Eimas1995). The models reported here apply the same basic architecture and learning rules to two very different writing systems – English and Chinese – in order to test these two possibilities. This is also a first-order question in the modeling of biliteracy for these two writing systems, because if the languages cannot be accommodated in a single functional architecture, it will have important consequences for the modeling of biliteracy in these two languages.

The difference between Chinese and English can be understood in terms of the statistical properties of spelling-to-sound and spelling-to-meaning mappings. Although English has something of an “outlier” writing system in mapping from print to sound (Malone, Reference Malone1925; Venezky, Reference Venezky1999), it has an alphabet of letters that correspond roughly to individual speech sounds (Venezky, Reference Venezky1970). In contrast, Chinese has an extremely “deep” orthography (Frost, Katz & Bentin, Reference Frost, Katz and Bentin1987) in that the pronunciation of a character cannot be computed sound-by-sound from its constituent parts (DeFrancis, Reference DeFrancis1989), although probabilistic cues to pronunciation do exist (Li & Kang, Reference Li and Kang1993; Zhu, Reference Zhu and Yuan1988) and are used by both children learning to read and adult readers (Lee, Tsai, Su, Tzeng & Hung, Reference Lee, Tsai, Su, Tzeng and Hung2005; Shu, Meng, Chen, Luan & Cao, Reference Shu, Meng, Chen, Luan and Cao2005). Chinese is sometimes characterized as a logographic system, in contrast to alphabetic systems, because of the morphemic (Leong, Reference Leong and Downing1973) or even morphosyllabic (e.g., DeFrancis, Reference DeFrancis1989; Mattingly, Reference Mattingly, Frost and Katz1992) mappings characters afford. Characters, as basic writing units, map onto morphemes – not phonemes – in the spoken language. Furthermore, Chinese characters typically contain a “semantic radical” that provides some probabilistic information that aids in the translation from orthography to semantics. Alphabetic writing systems rarely contain semantic information that is not somehow encoded phonologically. Even where there is ambiguity about spelling-to-sound for morphological forms (final -s and -ed in English), it cannot be said that these convey no phonological information at all, in the way that semantic components of Chinese characters do (see Frost, in press; Mirkovic, MacDonald & Seidenberg, Reference Mirkovic, MacDonald and Seidenberg2005, for discussion).

There is clear evidence among alphabetic orthographies that shallower systems are easier to learn than deeper ones, as reflected by both word and non-word performance in beginning readers (e.g., Ellis & Hooper, Reference Ellis and Hooper2001; Goswami, Gombert & de Barrera, Reference Goswami, Gombert and Barrera1998; Seymour, Aro & Erskine, Reference Seymour, Aro and Erskine2003). Differences between alphabetic orthographies and Chinese are starker still: the average English-reading child can recognize 3000–5000 words after the first grade (White, Graves & Slater, Reference White, Graves and Slater1990) whereas Chinese-reading children can typically read fewer than 800 characters with the same amount of schooling (Xing, Shu & Li, Reference Xing, Shu and Li2004). Thus, the overall consistency of mappings from units in the writing system to their phonological counterparts has clear effects on the rate at which reading skill can be acquired.

Another consequence of orthographic depth for the acquisition of reading is that related language skills (such as semantic and phonological processing) contribute differentially to reading success across writing systems. Shallow orthographies are characterized by weak effects of semantic variables in skilled reading (e.g., Bates, Burani, D'amico & Barca, Reference Bates, Burani, D'amico and Barca2001), and a limited contribution of semantic processing skills to the development of reading (McBride-Chang, Cho, Liu, Wagner, Shu, Zhou, Cheuk & Muse, Reference McBride-Chang, Cho, Liu, Wagner, Shu, Zhou, Cheuk and Muse2005; Saiegh-Haddad & Geva, Reference Saiegh-Haddad and Geva2008). In relatively “deep” orthographies, such as English, semantic knowledge plays some role in reading aloud, particularly in the reading of words whose spellings are highly atypical (Strain, Patterson & Seidenberg, Reference Strain, Patterson and Seidenberg1995, Reference Strain, Patterson and Seidenberg2002) and there is some evidence for a role of semantic processing abilities in beginning reading skill (Carlisle, Reference Carlisle2000, Reference Carlisle2003; Nation & Snowling, Reference Nation and Snowling1999). In part because print-to-sound cues are even less reliable in Chinese, the role of semantic processing in reading aloud is greater, and the contribution of semantics to the development of Chinese reading is particularly important (Shu, McBride-Chang, Wu & Liu, Reference Shu, McBride-Chang, Wu and Liu2006; Shu, Peng & McBride-Chang, Reference Shu, Peng and McBride-Chang2008).

The differential contribution of semantic and phonological processing across writing systems may also explain differences in the manifestation of reading disability across languages. In English, there is evidence for subtypes of developmental dyslexia: “phonological dyslexics” have specific difficulty with decoding and “surface dyslexics” have specific difficulty with atypically spelled words, but relatively spared performance on regular words and non-words (e.g., Manis, Seidenberg, Doi, McBride-Chang & Petersen, Reference Manis, Seidenberg, Doi, McBride-Chang and Petersen1996). These subtypes are often explained as resulting from distinct pre-existing deficits: in semantic processing for the developmental delay/surface dyslexics and phonological processing for the phonological dyslexics. The reading performance of children with developmental surface dyslexia is very similar to that of younger normal readers with respect to the relative difficulty of pseudowords, regular words and irregular-inconsistent words (Manis et al., Reference Manis, Seidenberg, Doi, McBride-Chang and Petersen1996). Their specific difficulty reading words with unusual spelling-to-sound correspondences may thus be associated either with semantic deficits or with a general delay in the development of reading skill (Nation & Snowling, Reference Nation and Snowling1998; Plaut, McClelland, Seidenberg & Patterson, Reference Plaut, McClelland, Seidenberg and Patterson1996). In contrast, developmental phonological dyslexia is associated with deficits in phonological processing (Stanovich & Siegel, Reference Stanovich and Siegel1994). In English, phonological dyslexics present with a reading impairment that is most pronounced for non-words, but, in milder cases, can leave exception word reading more or less intact (Castles & Coltheart, Reference Castles and Coltheart1993).

In Chinese, semantic and phonological processing deficits impact reading in different ways. Poor semantic processing is associated with difficulties reading all types of words, even those with more typical spelling-to-sound correspondences, although reading of atypically spelled words does suffer relatively more (Shu et al., Reference Shu, Meng, Chen, Luan and Cao2005). Children with phonological deficits are also impaired relative to age-matched controls on reading of all words, but the impairment is greater for words with typical spelling-to-sound correspondences, with the result that phonological dyslexics do not show the usual advantage for regular-consistent over irregular-inconsistent words (Shu et al., Reference Shu, Meng, Chen, Luan and Cao2005). In sum, there are gross differences between writing systems in the relative contribution of phonological and semantic processing abilities to the development of reading skill that appear to be driven largely by the consistency of print-to-sound mappings across languages. In previous studies, we have demonstrated that the same basic architecture and learning rules appropriate to English could model the acquisition and use of reading skill in Chinese, and simulate both effects that are directly analogous to English and effects that are specific to Chinese (Yang, McCandliss, Shu & Zevin, Reference Yang, McCandliss, Shu and Zevin2009).

Here, we simulate typical and disordered reading acquisition in English and Chinese by applying the same functional architecture (modified to represent the inputs and outputs for each language) and learning rules for both writing systems. The models implement the theory that reading is acquired via a process of statistical learning of mappings among spelling, sound and meaning, and test the hypothesis that differences in the patterns of typical and disordered reading development across writing systems may be explained in terms of differences in the statistical properties of the writing systems rather than by differences in functional architecture. In a second simulation, we model the simultaneous acquisition of Chinese and English, to examine whether the same learning trajectories and sequelae of pre-literate deficits would be observed across languages learned by the same individual. Simulation 2 addresses a first-order question in the modeling of Chinese/English biliteracy: Can English and Chinese are learned in the same set of mappings among orthography, phonology and semantics? Or do they require fundamentally different processing assumptions? Further, we can examine how the two languages interact when learned by the same system. It is possible that learning these two very different writing systems at the same time will lead to differences in how reading skill is acquired and used in both writing systems, but it is also possible that when both languages are learned at the same time, the outcome is equivalent to monolingual learning of each. Either outcome would have important consequences for understanding bilingualism and biliteracy in reading development.

Simulation 1: Modeling differential division of labor between Chinese and English in monolingual models

Here we examine development of typical and disordered reading in two parallel models implementing the same functional architecture for English and Chinese. Both models have feed-forward connections from an orthographic input layer to a phonological attractor network (Harm & Seidenberg, Reference Harm and Seidenberg1999), supplemented with a semantic input layer that functions mainly to provide a secondary source of input about word identity that is particularly useful for words with ambiguous spelling-to-sound mappings (Plaut et al., Reference Plaut, McClelland, Seidenberg and Patterson1996). Following Plaut (Reference Plaut1997), we used random bit patterns to capture this contribution of word-specific knowledge to generating a correct pronunciation. While this has the disadvantage of not providing a realistic representation of the similarity of the meanings of words within a language, it has the advantage of permitting us to use the same semantic patterns for both languages, thus allowing a direct investigation of the role of properties of the print-to-sound system on the division of labor.

Methods

Architecture

The same basic architecture (Figure 1) was used for two models: one for Chinese and one for English. Each model had an orthographic input layer designed to represent the spellings of words in the appropriate writing system, fully connected to a hidden layer with 100 units, which was in turn fully connected to a phonological output layer designed to represent the pronunciations of words in that language. The phonological output layer was fully connected both directly to itself and to 50 cleanup units, permitting the formation of attractor states, following Harm and Seidenberg (Reference Harm and Seidenberg1999). The English representations of orthography and phonology were adapted from the scheme of Harm and Seidenberg (Reference Harm and Seidenberg2004): 101 units were used to represent 10 slots of letters in the orthographic layer and 200 units were used for eight slots to represent phonemes in phonological layer. The Chinese orthographic representation consisted of 270 units based on a linguistic description of Chinese orthography including radicals, number of strokes and radical position, adapted from Xing et al. (Reference Xing, Shu and Li2004) by excluding slots that explicitly coded the location of the phonetic component (see Yang et al., Reference Yang, McCandliss, Shu and Zevin2009, for details). Ninety-two units were used to code each Chinese syllable, which includes five slots: one onset slot, three rime slots, and a fifth slot for tone. As in Zhao & Li's (Reference Zhao and Li2009) PatPho system, each phoneme slot was encoded with the same basic featural representation, but with a slightly different configuration for the two languages (e.g., Chinese has palatal and retroflex in addition to bilabial, alveolar and velar, used in the English models).

Figure 1. Architecture of the monolingual Model.

A second input layer was included to simulate the contribution of semantics in print-to-sound translation. Semantic patterns were 3000 random bit patterns clustered into 120 categories over 200 semantic features. Categories were created by generating a set of 120 prototypes, in which each feature had a probability of 0.1 of being active. Each prototype was then used to generate 25 exemplars by randomly selecting 10% of all features and resetting their probability of activation to 0.05, under the constraint that each exemplar differ from all other exemplars by at least three features. A subset of 2881 patterns was assigned randomly to the words in the English training corpus. A subset of 2689 patterns from the English training patterns were selected and randomly assigned to Chinese characters. In both versions of the model, the semantic input layer was connected to the output layer via 100 hidden units.

Training

Training was carried out in the same way for the English and Chinese versions of the model. We first pre-trained the phonological attractor net to an error threshold of 0.01, and the final weights (240K in Chinese and 60K in the English model) of phonological attractor net were embedded in the reading model. To avoid “catastrophic interference”, interleaved training (Hetherington & Seidenberg, Reference Hetherington and Seidenberg1989) on phonological processing and reading was adopted. Training mixed 10% “listening” trials, on which only the phonological attractor was trained, with 90% “reading” trials, on which the whole model was trained. A learning rate of 0.005 and momentum of 0.9 were used. Online learning was used with the continuous recurrent back-propagation algorithm (Pearlmutter, Reference Pearlmutter1995). Each word was selected according to the training probability transformed via square root compression.

The Chinese training corpus of 2689 characters consisted of 2390 characters from a set of naming norms (Liu, Shu & Li, Reference Liu, Shu and Li2007) and 299 additional items from phonetic families represented in the testing materials. Frequency estimates were taken from the Modern Chinese Frequency Dictionary (Language and Teaching Institute of Beijing Language College, 1986). The English training corpus consisted of 2881 monosyllabic words assigned frequencies taken from the Marcus, Santorini and Marcinkiewicz (Reference Marcus, Santorini and Marcinkiewicz1993) norms, which are based on 43 million tokens from The Wall Street Journal.

In both languages, two subtypes of developmental dyslexia were simulated by applying decay to either the hidden units from orthography to phonology (to simulate phonological dyslexia, hereafter PD) or the hidden units from semantics to phonology (to simulate surface dyslexia, hereafter SD). Decay on each weight ω was reduced in magnitude according to the formula Δω = −ω×σ, where σ was the decay constant. In order to simulate a wide range of deficit severity, 20 different decay values were used, varying from 0.25 × 10⁻⁵ to 5 × 10⁻⁵ in steps of 0.25 × 10⁻⁵. Unimpaired models were also run 20 times. Each run of the model used a different random seed for the initial randomization of weights and selection order of stimuli.

Testing

Naming accuracy was computed to test the model's performance. It was determined by applying a winner-take-all scoring system: for each slot on the output layer, we determined which phoneme was closest to the pattern on the output at the final time tick and reported this as the model's pronunciation.

Test items were drawn from studies of consistency, regularity and frequency effects in the two languages: the 120 Chinese test items were from Yang et al. (Reference Yang, McCandliss, Shu and Zevin2009), and the 144 English test items were those used by Plaut et al. (Reference Plaut, McClelland, Seidenberg and Patterson1996) from Taraban and McClelland (Reference Taraban and McClelland1987). In both languages, the items were sets of regular-consistent, regular-inconsistent and exception words matched for frequency, phonetic family size and other Chinese script properties, such as structure type, the number of strokes and radicals.

The definition of regularity in English and Chinese is slightly different. In English, regular words are those that can be pronounced correctly by rule (although there is some discrepancy between rule sets, due to disagreements about whether rules for units larger than single graphemes are considered, see e.g., Andrews & Scarratt, Reference Andrews and Scarratt1998; Zevin & Seidenberg, Reference Zevin and Seidenberg2006). In the current study, “regular” words are those with pronunciations consistent with the rule set of the Dual-Route Cascade model of word reading (Coltheart et al., Reference Coltheart, Rastle, Perry, Langdon and Ziegler2001) which has a large number of multi-grapheme rules, but nonetheless counts many highly inconsistent items as “regular”. In Chinese, a character is considered regular if its pronunciation matches the pronunciation of its phonetic component when this occurs as a single character (see Peng & Yang, Reference Peng and Yang1997; Yang et al., Reference Yang, McCandliss, Shu and Zevin2009). In both languages, exception words or characters are just those that are not considered regular. Consistency is defined essentially the same way in both languages – completely consistent words share the pronunciation of some critical sub-lexical component with all of the words that contain that component – although the sub-lexical structures of the two languages are of course different. In English, regular inconsistent words were items such as DOLL and BROTH, that have exception words as neighbors (e.g., POLL and BOTH). In Chinese, consistency (like regularity) is defined at the level of the phonetic component. Characters that are regular but contain a phonetic component that is pronounced in different ways in different (exception) characters are regular and inconsistent. Simulations of surface and phonological dyslexia in Chinese children used the items from the original study (Shu et al., Reference Shu, Meng, Chen, Luan and Cao2005).

Results

Overall performance across languages, for typical and disordered reading models

Figure 2 shows the models’ accuracy over time for all items in the training set. For the typically developing model, on average, the English model reached 90% overall accuracy after 292K trials (SD = 10.1K) and the Chinese model reached 90% accuracy after 665K trials (SD = 15.7K). A 3 (Deficit: Typical, PD, SD) × 2 (Language: Chinese and English) ANOVA with maximum naming accuracy as the dependent variable revealed significant main effects of Deficit, F(2,114) = 187.93, MSE = .20, p < .01, and of Language, F(1,114) = 8.90, MSE = .01, p < .01, as well as an interaction between the two, F(2,114) = 119.08, MSE = .13, p < .01. The interaction arises because there was a greater effect of PD in the English model (81.0% accuracy) than in the Chinese model (89.4%), and the reverse pattern for SD, with a very modest effect in English (98.7% accuracy) but a large effect in Chinese (84.7%). Accuracy in the typical model was nearly perfect (> 99%) for both languages.

Figure 2. Learning trajectories of English and Chinese models show differential effects phonological (dashed line) and semantic (gray line) impairment across languages.

Reading deficits in English

To further investigate the patterns of reading disability resulting from particular patterns of deficit, we conducted a 2 (Regularity: regular-consistent, irregular-inconsistent) × 3 (Deficit: Typical, PD, SD) ANOVA analysis on maximum naming accuracy. The main effect of regularity was significant, F(1,57) = 54.91, MSE = .02, p < .01, as was the main effect of deficit, F(2,57) = 26.74, MSE = .05, p < .01, and the interaction of the two, F(2,57) = 14.41, MSE = .01, p < .01. As seen in Figure 3, regular-consistent words were read more accurately than irregular-inconsistent words, and the Typical model's overall performance (100%) was significantly better than the PD model (93.0%), p < .01, and marginally better than the SD model (97.7%), p = .056. The interaction between deficit and stimulus condition arose because performance on all items was impaired in the PD model, whereas the SD model was impaired only in irregular word reading. In the Typical model, all words were named accurately. Semantic impairment had no impact on the regular-consistent items (100% accuracy), but resulted in reduced accuracy for the irregular-inconsistent items (95.4%). In contrast, the PD model was impaired for both regular-consistent and irregular-inconsistent items (94.8% and 91.3% accuracy, respectively).

Figure 3. Learning trajectories for different stimulus types in the English model. Regular consistent items are impacted only by phonological impairment (dashed line) whereas both phonological and semantic impairment (gray line) impacted irregular-inconsistent items.

Poor nonword reading is a particular hallmark of phonological dyslexia in English, but the status of “non-character” reading in Chinese (i.e., whether it reflects normal reading processes or meta-linguistic guessing) is a topic of debate (Shu et al., Reference Shu, Meng, Chen, Luan and Cao2005; Weekes, Yin, Su & Chen, Reference Weekes, Yin, Su and Chen2006). Because of the higher degree of arbitrariness in spelling-to-sound mappings, it is hard to create non-characters in Chinese. We therefore tested nonword reading in a separate set of statistical tests for English. Nonword reading was strongly influenced by deficit, F(2,57) = 44.09, MSE = .06, p < .01. Post-hoc tests showed no effect of SD on nonword reading (86.2% accuracy, compared to 86.1% accuracy for the Typical model), p = .86, and a large effect of PD (77.0%), p < .01.

Reading deficits in Chinese

In parallel with the analysis of the English model, we conducted a 2 (Regularity: regular-consistent, irregular-inconsistent) × 3 (Deficit: Typical, PD, SD) ANOVA with maximum accuracy as the dependent variable in Chinese. The main effect of regularity was significant, F(1,57) = 124.23, MSE = .11, p < .01, as was the main effect of deficit, F(2,57) = 111.93, MSE = .17, p < .01, and the interaction of the two, F(2,57) = 96.12, MSE = .09, p < .01. As seen in Figure 4, regular-consistent words were read more accurately than irregular-inconsistent words, and the Typical model's overall performance was significantly better than both the PD model, p < .01 than the SD model p < .01. The interaction between deficit and stimulus condition arose because performance on the word classes was differentially impacted by semantic and phonological impairments. In the Typical model, all words were named accurately (100%). Semantic impairment influenced the naming accuracy more for irregular-inconsistent (79.75%) than regular-consistent (96.5%) words, t(19) = 12.67, p < .01. In contrast, the PD model performed equally poorly on both regular-consistent (90.5%) and irregular-inconsistent (89.0%) words, t(19) = 1.55, p = .14.

Figure 4. Learning trajectories for different stimulus types in the Chinese model. Both regular-consistent and irregular-inconsistent items are impacted by both phonological (dashed line) and semantic (gray line) impairments.

Simulating three cases of Chinese dyslexia

Shu et al. (Reference Shu, Meng, Chen, Luan and Cao2005) reported three cases of developmental dyslexia in Chinese, along with data on the children's phonological and semantic processing abilities. One case, Child L (age 9:0, male), was classified as surface dyslexic because of his relatively specific impairment on exception words. The two remaining cases (J, 10:8, and Q, 12:2, both male) were phonological dyslexics. One important feature of this study is that semantic and phonological processing skills were also tested independently. Child L's reading impairment was accompanied by frank impairments in morphological awareness, a meta-linguistic task used to assess semantic processing in Chinese readers (McBride-Chang, Shu, Zhou, Wat & Wagner, Reference McBride-Chang, Shu, Zhou, Wat and Wagner2003), but his performance on phonological awareness tasks was within normal range, whereas J and Q showed the converse pattern.

Simulation of case studies was undertaken by identifying a point in training at which the appropriate model (SD for Child L, PD for J and Q) achieved the same overall accuracy (on all test items) as the case being simulated. Data from all three are shown in Figure 5. The SD model attained overall performance of 44% after an average of 364K (SD = 58K) trials. At this point in training, the SD model's ability to read words was strongly influenced by stimulus regularity – 61.25% accuracy for regular items vs. 50.63% for irregular, t(19) = 7.64, p < .01.

Figure 5. Simulations of three case studies. Child L has semantic deficits, and shows a strong regularity effect, whereas children J and Q have phonological deficits and show no regularity effect.

A diagnostic feature of phonological dyslexia in Chinese is the lack of a regularity effect, seen in both of the cases under consideration here. Child J's overall accuracy was 49%, which the PD model reached after 433K (SD = 88K) trials. At this point in training, the model exhibited no effect of regularity with 59.1% and 58.1% accuracy for regular and irregular characters, respectively, t(19) = 1.00, p = .33. Child Q's overall accuracy was 73%, which the PD model reached after 910K (SD = 192K) trials. At this point in training, the model did not show evidence for a regularity effect – 79.8% and 78.6% accuracy for regular and irregular items, t(19) = 1.58, p = .13.

Discussion

When the same functional architecture is trained to read English and Chinese, distinct patterns of typical and atypical development are observed across languages. Gross differences in the rate of learning of the two writing systems are clearly captured by the models, as are differences in the patterns of deficits observed in reading disability. Specifically, the same constitutive deficits (in phonological and semantic processing) have distinct effects that are language-specific, suggesting that these patterns are driven by statistical properties of the writing systems themselves, and not by differences in the basic architecture of reading across languages.

In both English and Chinese, phonological deficits have relatively broad effects, and are a key factor in predicting reading disability (McBride-Chang & Zhong, Reference McBride-Chang, Zhong, Li, Tan, Bates and Tzeng2006; Snowling, Reference Snowling2000; Vellutino & Fletcher, Reference Vellutino, Fletcher, Margaret and Snowling2005). This is captured in the overall pattern of effects in the two models. Further, language-specific features of phonological dyslexia are also observed. In English, children with phonological difficulties have particular difficulty with nonword pronunciation (Castles & Coltheart, Reference Castles and Coltheart1993; Manis et al., Reference Manis, Seidenberg, Doi, McBride-Chang and Petersen1996; Temple & Marshall, Reference Temple and Marshall1983). This was also found in the English model. In contrast, nonword reading is a difficult task for even skilled Chinese readers, and is rarely tested in development, but there is a specific pattern that is a hallmark of phonological dyslexia – the reduced size of the regularity effect observed by Shu et al. (Reference Shu, Meng, Chen, Luan and Cao2005). This effect was also captured in the model.

Semantic deficits had strikingly different effects across writing systems. In English, semantic support is mainly necessary for irregular-inconsistent items, and deficits in semantic processing have relatively specific effects on these items (Castles & Coltheart, Reference Castles and Coltheart1993, Reference Castles and Coltheart1996; Manis et al., Reference Manis, Seidenberg, Doi, McBride-Chang and Petersen1996). In Chinese, in contrast, effects of semantic deficits are quite general, impacting both regular-consistent and irregular-inconsistent items nearly equally. Again, this is consistent with case observations from Shu et al. (Reference Shu, Meng, Chen, Luan and Cao2005), and is also generally consistent with the relatively strong correlation of morphological awareness with reading ability (Ku & Anderson, Reference Ku and Anderson2003; McBride-Chang, Cho et al., Reference McBride-Chang, Cho, Liu, Wagner, Shu, Zhou, Cheuk and Muse2005).

Simulation 2: Modeling Chinese–English bilingualism

Although they shared many features, the models in Simulation 1 differed in important ways, because their phonological and orthographic representations were language-specific. Here we explore whether the same model, when trained to read both English and Chinese will show similar patterns of results to parallel models described in Simulation 1. We did this by training a single model with a single phonological output attractor, a single semantic system, and two orthographic input layers, one for each language.