Hostname: page-component-745bb68f8f-kw2vx Total loading time: 0 Render date: 2025-02-06T02:39:58.087Z Has data issue: false hasContentIssue false

The importance of formal modelling for the development of cognitive theory

Published online by Cambridge University Press:  08 November 2021

Randall K. Jamieson*
Affiliation:
University of Manitoba, Winnipeg, Canada
Brendan T. Johns
Affiliation:
McGill University, Montreal, Canada
Vanessa Taler
Affiliation:
University of Ottawa, Ottawa, Canada
Michael N. Jones
Affiliation:
Indiana University, Bloomington, USA
*
Address for correspondence: Randall K. Jamieson Department of Psychology University of Manitoba Winnipeg, MB, Canada R3T 2N2 Email: randy.jamieson@umanitoba.ca
Rights & Permissions [Opens in a new window]

Abstract

Type
Peer Commentaries
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press

Bordag, Gor, and Opitz (Reference Bordag, Gor and Opitz2021) present the Ontogenesis Model of L2 Lexical Representation (OM) as an explanation of how words in a second language are represented in memory. According to the model, a word is represented by six quantities: three indicating how well its orthography, phonology, and semantics are learned and three representing how strongly those representations are interconnected. Over time, a word's orthographic, phonologic, and semantic representations become better learned and inter-connected (see Figure 4 of the article). As outlined in the paper, that framework provides insight into a broad range of lexical behaviours and empirical regularities from the study of second-language acquisition.

The OM is a timely and valuable contribution. It presents a broad and integrative approach to think about lexical representation. It confronts complexities of multilingual representation. It acknowledges the necessity of developing formal models to handle the scale of a problem as large as language. However, the OM stops short of its principal goal.

Although the OM is a theory of lexical representation, it does not define what a lexical representation is or how a lexical representation develops as a function of language experience. Rather, it assumes words have lexical representations; and focuses, instead, on modelling how quickly and how completely those representations are learned. Thus, if one asks what the OM knows about <CAT>, it can report how well <CAT> is learned but cannot report what <CAT> means. In that sense, the OM is more like a theory of associative learning than a theory of mental representation.

As a point of comparison, consider current models of lexical representation. Theories such as HAL (Lund & Burgess, Reference Lund and Burgess1996), LSA (Landauer & Dumais, Reference Landauer and Dumais1997), BEAGLE (Jones & Mewhort, Reference Jones and Mewhort2007), Word2Vec (Mikolov, Sutskever, Chen, Corrado & Dean, Reference Mikolov, Sutskever, Chen, Corrado and Dean2013), and GloVe (Pennington, Socher & Manning, Reference Pennington, Socher and Manning2014) are fully specified models that, when applied to a record of natural language (e.g., newspapers, novels, or internet chatter), derive a unique vector to represent each word. Once derived, those vectors can be decomposed into their representational elements (Hollis & Westbury, Reference Hollis and Westbury2016), applied, and compared to human lexical behaviour. Consistent with the OM's goals, those theories track changes in lexical behaviour as a function of age (Montag, Jones & Smith, Reference Montag, Jones and Smith2015), reading history (Aujla, Reference Aujlain press), culture (Johns & Jamieson, Reference Johns and Jamieson2019), and multilingual language exposure (Johns, Sheppard, Jones & Taler, Reference Johns, Sheppard, Jones and Taler2016). Those representations can also stand in for human knowledge in computer models of memory (Johns, Jones & Mewhort, Reference Johns, Jones and Mewhort2021), decision making (Bhatia, Reference Bhatia2017), language production (Johns, Jamieson, Crump, Jones & Mewhort, Reference Johns, Jamieson, Crump, Jones and Mewhort2020), and cognitive disorder (Johns et al., Reference Johns, Taler, Pisoni, Farlow, Hake, Kareken, Unverzagt and Jones2018). Thus, whereas the OM provides a framework for thinking about lexical representations, modern theories already articulate methods to directly and computationally derive those representations.

In summary, the OM is an intriguing theoretical framework to explain second-language learning. However, its breadth of vision obscures its specificity of explanation. Our goal is to encourage a formalization of the OM, perhaps grounded in a theory we have named or in a different kind of theory that we have not named (Griffiths, Steyvers & Tenenbaum, Reference Griffiths, Steyvers and Tenenbaum2007; Jamieson, Avery, Johns & Jones, Reference Jamieson, Avery, Johns and Jones2018; Kwantes, Reference Kwantes2005). No matter the outcome, a formal expression of the OM will allow researchers to interrogate the model, compare it to state-of-the-art language models, and leverage its insights to make discoveries. We are hopeful that future work can meet that goal.

References

Aujla, H (2021) Language experience predicts semantic priming of lexical decision. Canadian Journal of Experimental Psychology. 75, 235–244. https://doi.org/10.1037/cep0000255CrossRefGoogle ScholarPubMed
Bhatia, S (2017) Associative judgment and vector space semantics. Psychological Review 124, 120.CrossRefGoogle ScholarPubMed
Bordag, D, Gor, K and Opitz, A (2021) Ontogenesis Model of the L2 Lexical Representation. Bilingualism: Language and Cognition, 117. doi:10.1017/S1366728921000250Google Scholar
Griffiths, TL, Steyvers, M and Tenenbaum, JB (2007) Topics in semantic representation. Psychological Review 114, 211244.CrossRefGoogle ScholarPubMed
Hollis, G and Westbury, C (2016) The principals of meaning: Extracting semantic dimensions from co-occurrence models of semantics. Psychonomic Bulletin & Review 23, 17441756.CrossRefGoogle ScholarPubMed
Jamieson, RK, Avery, JE, Johns, BT and Jones, MN (2018) An instance theory of semantic memory. Computational Brain & Behavior 1, 119136.CrossRefGoogle Scholar
Johns, BT and Jamieson, RK (2019) The influence of time and place on lexical behavior: A distributional analysis. Behavior Research Methods 51, 2483–2453.CrossRefGoogle ScholarPubMed
Johns, BT, Jamieson, RK, Crump, MJC, Jones, MN and Mewhort, DJK (2020) Production without rules: Using an instance memory model to exploit structure in natural language. Journal of Memory and Language, 104165.CrossRefGoogle Scholar
Johns, BT, Jones, MN and Mewhort, DJK (2021) A continuous source reinstatement model of true and false recognition. Canadian Journal of Experimental Psychology 75, 118.CrossRefGoogle Scholar
Johns, BT, Sheppard, C, Jones, MN and Taler, V (2016) The role of semantic diversity in lexical organization across aging and bilingualism. Frontiers in Psychology 7, 703.CrossRefGoogle Scholar
Johns, BT, Taler, V, Pisoni, DB, Farlow, MR, Hake, AM, Kareken, DA, Unverzagt, FW and Jones, MN (2018) Cognitive modeling as an interface between brain and behavior: Measuring the semantic decline in mild cognitive impairment. Canadian Journal of Experimental Psychology 72, 117126.CrossRefGoogle ScholarPubMed
Jones, MN and Mewhort, DJK (2007) Representing word meaning and order information in a composite holographic lexicon. Psychological Review 114, 137.CrossRefGoogle Scholar
Kwantes, PJ (2005) Using context to build semantics. Psychonomic Bulletin & Review 12, 703710.CrossRefGoogle ScholarPubMed
Landauer, TK and Dumais, ST (1997) A solution to Plato's problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review 104, 211240.CrossRefGoogle Scholar
Lund, K and Burgess, C (1996) Producing high-dimensional semantic spaces from lexical co-occurrence. Behavior Research Methods, Instruments, & Computers 28, 203208.CrossRefGoogle Scholar
Mikolov, T, Sutskever, I, Chen, K, Corrado, GS and Dean, J (2013) Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems (pp. 3111–3119).Google Scholar
Montag, JL, Jones, MN and Smith, LB (2015) The Words Children Hear: Picture Books and the Statistics for Language Learning. Psychological science 26, 14891496.CrossRefGoogle ScholarPubMed
Pennington, J, Socher, R and Manning, CD (2014) GloVe: Global vectors for word representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 1532–1543.CrossRefGoogle Scholar