Hostname: page-component-745bb68f8f-kw2vx Total loading time: 0 Render date: 2025-02-11T04:59:32.086Z Has data issue: false hasContentIssue false

Computational mechanisms of development? Connectionism and bilingual lexical representation

Published online by Cambridge University Press:  11 November 2021

Ping Li*
Affiliation:
TheHong KongPolytechnic University, Hong Kong
Xiaowei Zhao
Affiliation:
Emmanuel College, Boston, USA
*
*Address for correspondence: Ping Li, Department of Chinese and Bilingual Studies, Faculty of Humanities, The Hong Kong Polytechnic University, Hung Hom, Hong Kong. Email: pi2li@polyu.edu.hk
Rights & Permissions [Opens in a new window]

Abstract

Type
Peer Commentaries
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press

The Ontogenesis Model (OM) of Bordag, Gor and Opitz (Reference Bordag, Gor and Opitz2021) is a good example of applying computational thinking to the study of key issues in bilingual lexical representation. With the introduction of two critical computational concepts – namely, Multidimensionality and Fuzziness – this model aims at explaining bilingual lexical representation through a theoretical framework of ‘ontogenesis’. A key feature of the model – that distinguishes itself from other bilingual computation-minded models such as BIA and BIA+, as the authors argue – is the focus on development of lexical representation (hence, ontogenesis in the name of the model). This focus is particularly important, and since the earliest days of BIA and BIA+ models, there has been an urgent need for the field to move from ‘proficient bilingual model’ to ‘developmental bilingual model’ (Li, Reference Li2002).

While the OM provides a good illustration of the complexity and the dynamic nature of bilingual lexical representation, it falls short of delivering the key promise on accounting for the developmental processes and underlying principles of bilingual lexical representation. In our view, there are significant gaps with the current formulation of the OM model as either a theoretical or an analytic framework. First and foremost, it largely dismisses the significant amount of work in the past two decades that has been devoted to address precisely the same questions that OM asks, the connectionist models of bilingual representation (see a few examples in Thomas, Reference Thomas1997; French, Reference French, Gernsbacher and Derry1998; French & Jacquet, Reference French and Jacquet2004; Li & Farkas, Reference Li, Farkas, Heredia and Altarriba2002; Lewy & Grosjean, Reference Lewy, Grosjean and Grosjean2008; Zhao & Li, Reference Zhao and Li2010, Reference Zhao and Li2013; Peñaloza, Grasemann, Dekhtyar, Miikkulainen & Kiran, Reference Peñaloza, Grasemann, Dekhtyar, Miikkulainen and Kiran2019; along with Shook & Marian, Reference Shook and Marian2013; Kiran, Graesman, Sandberg & Miikkulainen, Reference Kiran, Graesman, Sandberg and Miikkulainen2013 and other articles in the 2013 special issue published by this journal). Bordag et al. write, “other frameworks, e.g., connectionist models, model non-optimal representations via, e.g., non-final weights and optimize them via re-weighting of connections due to new input… fuzziness in the OM refers to imprecise lexical encoding due to a broad range of linguistic and cognitive factors and the learning conditions.” This statement, on the one hand, does not do justice to the significant amount of work inspired by connectionism that has indeed incorporated ‘the broad range of linguistic and cognitive factors and learning conditions’, and on the other, reflects a misunderstanding of what the weight updating and optimization in connectionist networks really are. As Shirai's (Reference Shirai2018) recent synthesis demonstrates (see especially Chapters 3 and 4), connectionist bilingual lexical representations have indeed attempted to incorporate a wide range of linguistic and cognitive factors, including the ones not even discussed in the OM model such as working memory and its impact on individual difference (see also Ellis, Reference Ellis, Doughty and Long2003; Wen, Biedroń & Skehan, Reference Wen, Biedroń and Skehan2017). Further, weight updating and optimization are mechanisms used by the brain's neural circuit to accomplish the process of learning and development, and they are based on realistic biological principles that provide the necessary mechanisms for a cognitively plausible computational account, which brings us to the next point.

Second, in defending connectionism and its success in modeling developmental L2 lexical representation, we should also point out that OM lacks precisely the kind of computational mechanistic account provided by connectionist models in disentangling the complex interactions among the key linguistic and cognitive factors (see Grant, Legault & Li, Reference Grant, Legault, Li and Schwieter2019; Li, Reference Li2013; Li & Zhao, Reference Li, Zhao, de Groot and Hagoort2017 for discussion). Indeed, the BIA and BIA+ models are based on connectionist-like mechanisms (e.g., the original IA models by McClelland & Rumelhart, Reference McClelland and Rumelhart1981 were precursors to connectionism; see Li & Zhao, Reference Li, Zhao and Aronoff2020). Rather than dismissing the connectionist architecture, the OM should be able to benefit from integrating computational mechanisms of connectionist models. The OM aims at modeling three key dimensions of linguistic domains, mappings, and networks (including IntraNetwork and InterNetwork), but it is unclear, unlike in connectionist models, how these dimensions can be actually modeled and implemented computationally, and what plausible mechanisms are to be deployed in the modeling enterprise so that the authors and others can verify and test. As an example, Figure 3 of Bordag et al. nicely illustrates the progressively enlarged network connections, but the OM provides no quantitative methods to actually model such progression or developmental changes. In our view, these dimensions match well with the computational architecture of the connectionism-based DevLex-II (Li, Zhao & MacWhinney, Reference Li, Zhao and MacWhinney2007; Zhao & Li, Reference Zhao and Li2010), a multi-layer neural network model with three connected self-organizing maps representing basic linguistic contents (phonology, semantic, and the articulatory sequence) of the bilingual lexicon. The DevLex model was originally designed as a developmental model for L1 (Li & Farkas, Reference Li, Farkas, Heredia and Altarriba2002; Li et al., Reference Li, Zhao and MacWhinney2007) and DevLex-II is focused on the development of bilingual lexicon, having the same goal as the OM. Through computational mechanisms such as self-organization (SOM; Kohonen, Reference Kohonen2001) and Hebbian Learning (Hebb, Reference Hebb1949), DevLex-II can explicitly model the development of bilingual lexical representation and empirical patterns of priming (Zhao & Li, Reference Li2013), key aspects that the OM is designed for explaining. Connectionist models, including DevLex-II and others starting from the late 1990's (e.g., Thomas, Reference Thomas1997; French, Reference French, Gernsbacher and Derry1998), have aimed to provide the kind of computational mechanisms that the OM currently lacks, particularly with regard to such concepts as the word association networks within (OM's IntraNetwork) and between languages (OM's InterNetwork).

Finally, a crucial concept of the OM is its Fuzzy Lexical Representation (FLR) account. This view is highly consistent with what has been proposed by many connectionist language development models, in both L1 (e.g., Plunkett & Marchman, Reference Plunkett and Marchman1996; Li et al., Reference Li, Zhao and MacWhinney2007) and L2 (Hernandez, Li & MacWhinney, Reference Hernandez, Li and MacWhinney2005; Li & Zhao, 2013; Zhao & Li, Reference Zhao and Li2021). Specifically, Hernandez et al. (Reference Hernandez, Li and MacWhinney2005; see Fig. 1) proposed the concept of ‘parasitism’: according to which, factors such as age of acquisition and proficiency – and, in particular, competition/interaction between L1 and L2 – are responsible for fuzzy L2 lexical-semantic representations, which in turn lead to inaccurate lexical comprehension and production in both semantic and phonological domains. This leads to the question of why OM chooses to ignore L1-L2 interaction, given that the dynamic interaction of L2 with L1 is a core process that drives the outcome fuzzy L2 representation, based on both theoretical perspectives and neurocognitive evidence (e.g., Claussenius-Kalman, Hernandez & Li, Reference Claussenius-Kalman, Hernandez and Li2021; Zhang, Yang, Wang & Li, Reference Zhang, Yang, Wang and Li2020).

Despite our defending connectionism in this commentary, we believe that OM provides a good platform for further discussion and investigation of core issues in bilingual lexical representation, but the integration of connectionist principles into its architecture could enhance its plausibility and generalizability.

References

Bordag, D, Gor, K and Opitz, A (2021) Ontogenesis Model of the L2 Lexical Representation. Bilingualism: Language and Cognition 117. doi:10.1017/S1366728921000250Google Scholar
Claussenius-Kalman, H, Hernandez, A and Li, P (2021) Expertise, Ecosystem, and Emergentism: Dynamic developmental bilingualism. Brain and Language 222, 105013. https://doi.org/10.1016/j.bandl.2021.105013.CrossRefGoogle ScholarPubMed
Ellis, NC (2003) Constructions, chunking, and connectionism: The emergence of second language structure. In Doughty, CJ and Long, MH (eds.), The handbook of second language acquisition Malden, MA: Blackwell Publishing Ltd pp. 63103.CrossRefGoogle Scholar
French, RM (1998) A simple recurrent network model of bilingual memory. In Gernsbacher, MA and Derry, SJ (eds.), Proceedings of the 20th Annual Conference of the Cognitive Science Society (pp. 368373). Mahwah, NJ: Erlbaum.Google Scholar
French, RM and Jacquet, M (2004) Understanding bilingual memory: Models and data. Trends in Cognitive Science 8, 8793.10.1016/j.tics.2003.12.011CrossRefGoogle ScholarPubMed
Grant, A, Legault, J and Li, P (2019) What do bilingual models tell us about the neurocognition of multiple languages? In Schwieter, J (ed.), The handbook of the neuroscience of multilingualism. Wiley-Blackwell, pp. 4874.CrossRefGoogle Scholar
Hernandez, A, Li, P and MacWhinney, B (2005) The emergence of competing modules in bilingualism. Trends in Cognitive Sciences 9(5), 220225.10.1016/j.tics.2005.03.003CrossRefGoogle ScholarPubMed
Hebb, D (1949). The organization of behavior: A neuropsychological theory. New York, NY: Wiley.Google Scholar
Kiran, S, Graesman, U, Sandberg, C and Miikkulainen, R (2013) A computational account of bilingual aphasia rehabilitation. Bilingualism: Language and Cognition 16 (Special Issue), 325342. doi:10.1017/S1366728912000533CrossRefGoogle ScholarPubMed
Kohonen, T (2001) Self-organizing maps (3rd ed.). Berlin: Springer.10.1007/978-3-642-56927-2CrossRefGoogle Scholar
Lewy, N and Grosjean, F (2008) The Lewy and Grosjean BIMOLA model. In Grosjean, F (ed.), Studying bilinguals. Oxford: Oxford University Press, pp. 201210.Google Scholar
Li, P (2002) Bilingualism is in dire need of formal models. Bilingualism: Language and Cognition 5(3), 213. doi:10.1017/S1366728902253018CrossRefGoogle Scholar
Li, P (2013) Computational modeling of bilingualism: How can models tell us more about the bilingual mind? Bilingualism: Language and Cognition 16(2), 241245. doi:10.1017/S1366728913000059CrossRefGoogle Scholar
Li, P and Farkas, I (2002) A self-organizing connectionist model of bilingual processing. In Heredia, R and Altarriba, J (eds.), Bilingual sentence processing. North-Holland: Elsevier Science Publisher, pp. 5985.CrossRefGoogle Scholar
Li, P and Zhao, X (2013) Self-organizing map models of language acquisition. Frontiers in Psychology. 4(828). doi: 10.3389/fpsyg.2013.00828Google Scholar
Li, P and Zhao, X (2017) Computational modeling. In de Groot, AMB and Hagoort, P (eds.), Research methods in psycholinguistics and the neurobiology of language: A practical guide. Hoboken, NJ: John Wiley & Sons, Inc, pp. 208229.Google Scholar
Li, P and Zhao, X (2020) Connectionism. In Aronoff, M (ed.), Oxford bibliographies online (Linguistics). Oxford University Press. (www.oxfordbibliographies.com; originally published 2012, last updated 2020)Google Scholar
Li, P, Zhao, X and MacWhinney, B (2007) Dynamic self-organization and early lexical development in children. Cognitive Science: A Multidisciplinary Journal 31, 581612.CrossRefGoogle ScholarPubMed
McClelland, J and Rumelhart, D (1981). An interactive activation model of context effects in letter perception: Part 1. An account of basic findings. Psychological Review 88, 375407.CrossRefGoogle Scholar
Peñaloza, C, Grasemann, U, Dekhtyar, M, Miikkulainen, R and Kiran, S (2019) BiLex: A computational approach to the effects of age of acquisition and language exposure on bilingual lexical access. Brain and Language 195, 104643. https://doi.org/10.1016/j.bandl.2019.104643CrossRefGoogle Scholar
Plunkett, K and Marchman, VA (1996) Learning from a connectionist model of the acquisition of the English past tense. Cognition 61(3), 299308. https://doi.org/10.1016/s0010-0277(96)00721-4CrossRefGoogle ScholarPubMed
Shook, A and Marian, V (2013) The bilingual language interaction network for comprehension of speech. Bilingualism: Language and Cognition 16(2), 304324.10.1017/S1366728912000466CrossRefGoogle Scholar
Shirai, Y (2018) Connectionism and second language acquisition. Routledge. https://doi.org/10.4324/9780203118085CrossRefGoogle Scholar
Thomas, MSC (1997) Connectionist networks and knowledge representation: The case of bilingual lexical processing. PhD thesis, Oxford University.Google Scholar
Wen, Z, Biedroń, A and Skehan, P (2017). Foreign language aptitude theory: Yesterday, today and tomorrow. Language Teaching 50(1), 131. doi:10.1017/S0261444816000276CrossRefGoogle Scholar
Zhang, X, Yang, J, Wang, R and Li, P (2020) A neuroimaging study of semantic representation in first and second languages. Language, Cognition, and Neuroscience 35, 12231238. https://doi.org/10.1080/23273798.2020.1738509CrossRefGoogle Scholar
Zhao, X and Li, P (2010) Bilingual lexical interactions in an unsupervised neural network model. International Journal of Bilingual Education and Bilingualism. 13, 505524.10.1080/13670050.2010.488284CrossRefGoogle Scholar
Zhao, X and Li, P (2013) Simulating cross-language priming with a dynamic computational model of the lexicon. Bilingualism: Language and Cognition 16, 288303.CrossRefGoogle Scholar
Zhao, X and Li, P (2021) Fuzzy or Clear? A Computational Approach Towards Dynamic L2 Lexical-Semantic Representation. Frontiers in Communication. Ms. Under review10.3389/fcomm.2021.726443CrossRefGoogle Scholar