THE LEMMA DILEMMA: HOW SHOULD WORDS BE OPERATIONALIZED IN RESEARCH AND PEDAGOGY?

Stuart Webb

doi:10.1017/S0272263121000784

THE LEMMA DILEMMA

HOW SHOULD WORDS BE OPERATIONALIZED IN RESEARCH AND PEDAGOGY?

Published online by Cambridge University Press: 17 December 2021

Stuart Webb

Show author details

Stuart Webb*: Affiliation:
University of Western Ontario
*: *Corresponding author. E-mail: swebb27@uwo.ca.

Article contents

Abstract
How might the lexical unit affect research and pedagogy?
Conclusion
Footnotes
References

Rights & Permissions

Abstract

Recently there has been some debate about the appropriacy of different lexical units in pedagogy and research (e.g., Brown et al., 2020; Dang & Webb, 2016a; Kremmel, 2016; Laufer & Cobb, 2020; McLean, 2018; Nation, 2016; Nation & Webb, 2011; Vilkaitė-Lozdienė & Schmitt, 2020). The lexical unit (word types, lemmas, flemmas, word families) needs to be considered when developing wordlists, vocabulary tests, and vocabulary learning programs. It is also central to the lexical profiles of text and corpora, which indicate the vocabulary learning targets associated with understanding different types of discourse. Perhaps most importantly, the lexical unit of words found in vocabulary learning resources such as word lists and tests may affect their pedagogical value. The aim of this article is to highlight aspects of research and pedagogy that are affected by lexical units and describe issues that should be considered when operationalizing words in studies of vocabulary and learning resources.

Type: Critical Commentary
Information: Studies in Second Language Acquisition , Volume 43 , Issue 5 , December 2021 , pp. 941 - 949

DOI: https://doi.org/10.1017/S0272263121000784 [Opens in a new window]
Copyright: © The Author(s), 2021. Published by Cambridge University Press

Words are defined and categorized in many ways. One approach to defining words relates to the different forms in which they occur. The most common of the classifications related to word form are the word type, lemma, flemma, and word family. Word types consist of each unique word form. If we operationalize words as word types then record, records, and unrecorded are different words. Lemmas are made up of a headword and its inflections, all of which have the same part of speech. If we classify words as lemmas then a headword (e.g., record) and its inflections (records, recorded, and recording) would be categorized as the same word. Flemmas are a more recent classification type and are similar to lemmas but do not take part of speech into consideration (record and records would make up one lemma as a noun, and record, records, recorded, and recording would make up another lemma as a verb. However, the items in both of these lemmas would be included in one flemma). Word families are made up of a headword, its inflections, and derivations.Footnote ¹ If we use word family as the category, then we would also include derivations such as prerecord, recorder, and unrecorded along with their inflections (prerecords, prerecorded, prerecording, records, recording, recordings, recorder, recorders) within the word family for the headword record. Thus, word types provide the narrowest definition of words in these examples, while word families provide the broadest definition.

The greatest value of larger lexical units may lie in pedagogy. Presenting headwords together with their inflections and derivations may provide a shortcut to lexical development. It is likely easier to learn different forms of the same words than to learn the same number of unrelated words. Moreover, learning headwords together with their related forms is likely beneficial for learning the inflectional and derivational systems. The greatest value of smaller lexical units may lie in research. Measuring knowledge of smaller lexical units should provide more precise findings than when using larger units because the smaller the lexical unit that is used on a vocabulary test, the more representative that test is of the vocabulary that is assessed. Moreover, ranking words according to their frequency in language is more precise when using smaller units because the ranking is more representative of the headwords in the list.

Although the preceding discussion promotes the value of larger lexical units for pedagogy and smaller lexical units for research, it would likely be misleading to suggest that one lexical unit is most appropriate for all contexts, whether within or across research and pedagogy. The reason for this is that there are several factors that likely affect the value of a lexical unit. The most significant factors might be vocabulary size, morphological knowledge, and proficiency with each of these factors interrelated to some degree (Bertram et al., Reference Bertram, Laine and Virkkala2000; Nagy et al., Reference Nagy, Anderson, Schommer, Scott and Stallman1989; Wysocki & Jenkins, Reference Wysocki and Jenkins1987). Smaller lexical units appear more sensible with less proficient learners who are unable to recognize the similarities between different forms of a word. In contrast, larger lexical units appear more sensible with more proficient learners who have gained knowledge of the inflectional and derivational systems. Thus, the proficiency of learners should be reported in any discussions of the appropriacy of different lexical units.

The type of lexical knowledge, receptive and productive, is another factor likely to affect the value of the lexical unit. The most commonly presented argument for using word families as the lexical unit is that if learners have knowledge of the form-meaning connections of a family member (e.g., pleasant) as well as knowledge of the morphological system then they may be able understand other unfamiliar members of the family (e.g., pleasantly, unpleasant) when they are encountered in context (Nation, Reference Nation2016; Nation & Webb, Reference Nation and Webb2011; Vilkaitė-Lozdienė & Schmitt, Reference Vilkaitė-Lozdienė, Schmitt and Webb2020). There is support from L1 research for this argument (Wysocki & Jenkins, Reference Wysocki and Jenkins1987). However, there are no studies that have investigated the extent to which derivatives of known L2 headwords can be successfully inferred during reading, listening, and viewing. In contrast, research tends to indicate that both L1 speakers and L2 learners find it challenging to produce all of the derivatives of headwords (Iwaizumi & Webb, Reference Iwaizumi and Webb2021; Schmitt & Zimmerman, Reference Schmitt and Zimmerman2002). Moreover, being able to use a word correctly does not ensure that other morphologically related forms of that word can be used correctly. Thus, researchers tend to agree that word families are not an appropriate lexical unit for measuring productive knowledge (Nation, Reference Nation2016; Nation & Webb, Reference Nation and Webb2011; Vilkaitė-Lozdienė & Schmitt, Reference Vilkaitė-Lozdienė, Schmitt and Webb2020).

How might the lexical unit affect research and pedagogy?

L2 learning

There is little research investigating the degree to which the lexical unit influences L2 vocabulary learning. Moreover, there is also a lack of clarity about the extent to which words and their inflected and derived forms are taught and learned together. The similarity between inflected and derived forms should make it easier to learn the different members of lemmas and word families than to learn unrelated words. However, this variation in form may increase the difficulty of learning words encountered in L2 input at least initially before the morphological system is learned. For example, it is reasonable to question whether encountering the same unfamiliar word type repeatedly when reading or listening or encountering different inflected and derived forms of unfamiliar words affects comprehension and incidental vocabulary learning gains. This is because variation in the forms of unfamiliar items may make it less likely that they are recognized and understood (Reynolds, Reference Reynolds2013). The degree to which derived and inflected forms of known words are recognized when they are encountered in meaningful contexts is a useful avenue for further research.

An advantage to researching vocabulary learning using larger lexical units may be that it has greater ecological validity than using smaller lexical units. Teachers and learners are unlikely to control for word form variation during the learning process except in the early stages of lexical development when learners lack knowledge of inflectional and derivational affixes. Once learners have gained knowledge of the English inflectional system and some knowledge of the highest frequency affixes, encounters with different infected and derived forms are likely to be viewed as opportunities to further develop and strengthen vocabulary knowledge. The disadvantage of researching vocabulary learning using larger lexical units is a lack of clarity of findings (Reynolds, Reference Reynolds2013; Reynolds & Wible, Reference Reynolds and Wible2014). Research investigating L2 vocabulary learning has rarely reported whether word types, lemmas, or word families were learned. Reynolds and Wible (Reference Reynolds and Wible2014) found that within incidental vocabulary learning research the lexical unit differed with some studies using word types (e.g., Rott, Reference Rott1999), and other studies using lemmas (e.g., Webb, Reference Webb2007) and word families (e.g., Pellicer-Sánchez & Schmitt, Reference Pellicer-Sánchez and Schmitt2010). Moreover, Reynolds (Reference Reynolds2015) found some evidence that variation in word form during reading impacted vocabulary learning. Fewer words that varied in form over two to four encounters were learned than those that had no variation in form with no difference in the amount of learning between inflected and derived forms. This is a useful starting point for further research. Examining the degree to which morphological complexity affects vocabulary learning and retention would be a useful area for further research. There would also be value in investigating other questions such as: To what extent are the inflected and derived forms of words learned together? Is it more effective to learn the same members of a word family together or apart? To what extent do learners with different vocabulary sizes have knowledge of the L2 inflectional and derivational systems? How many times do learners need to encounter unfamiliar L2 derivations to recognize and recall their meanings?

Word lists

There are several reasons why word lists are closely linked with the topic of lexical units. First, the lexical unit varies between word lists. The Academic Word List (Coxhead, Reference Coxhead2000), Nation’s (Reference Nation2006) British National Corpus word lists, and Nation’s (Reference Nation2012) British National Corpus/Corpus of Contemporary American English word lists are all made up of word families. There are also several lemma-based word lists. Brezina and Gablasova’s (Reference Brezina and Gablasova2015) New General Service List and Gardner and Davies (Reference Gardner and Davies2014) Academic Vocabulary List were both developed using lemmas as the unit of counting words. There are also several lists with multiple lexical units. There are flemma and word family versions of the Academic Spoken Word List (Dang et al., Reference Dang, Coxhead and Webb2017), and word type and lemma-based versions of the Essential Word List (Dang & Webb, Reference Dang, Webb and Nation2016b). There is also a version of Gardner and Davies (Reference Gardner and Davies2014) Academic Vocabulary List that consists of word families to go along with the lemma-based version from which it was originally developed. Second, word lists are used as the source of lexical frequency information in lexical profiling studies (for a review of these studies see Nurmukhamedov & Webb, Reference Nurmukhamedov and Webb2019). This has led to studies recommending vocabulary learning targets indicative of listening (Dang & Webb, Reference Dang and Webb2014; Van Zeeland & Schmitt, Reference Van Zeeland and Schmitt2013), reading (e.g., Nation, Reference Nation2006; Webb & Macalister, Reference Webb and Macalister2013), and viewing comprehension (e.g., Webb & Rodgers, Reference Webb and Rodgers2009). Because the word lists used in lexical profiling studies have used word families as the lexical unit, all these vocabulary learning targets have consisted of learning certain numbers of word families. If the word lists used in lexical profiling studies used a different lexical unit, the targets might be slightly different. Third, word lists are also used to source items according to their frequencies in tests such as the Vocabulary Levels Test (Nation, Reference Nation1983; Schmitt et al., Reference Schmitt, Schmitt and Clapham2001; Webb et al., Reference Webb, Sasao and Ballance2017) and the Vocabulary Size Test (Coxhead et al., Reference Coxhead, Nation and Sim2015; Nation & Beglar, Reference Nation and Beglar2007).

The lexical unit of the items that make up a word list may affect its validity in two ways. First, smaller lexical units should provide greater transparency about the relative value of the words in a list. This is because there is less ambiguity about the words that provide value within the lexical unit. For example, the frequencies of the different members of the word family for the headword replace in Mark Davies’s (Reference Davies2008–) Corpus of Contemporary American English are as follows: replace 32215, replaced 29221, replacement 16774, replacing 10873, replaces 3251, replacements 2334, irreplaceable 1020, replaceable 561, replacer 92, and replacers 13. The variation in frequencies among the different members makes the value of each item less transparent. If replace is in a list made up of word types its value is clear. If replace is in a list of lemmas, the value of its items is less transparent because the frequencies of the members range from 3251 to 32215 occurrences. If replace is in a list of word families the value of each item within the family is much more opaque with eight members being relatively frequent and two members being infrequent. Second, word lists are typically created in relation to the amount of lexical coverage that they provide in corpora. This is sensible because the greater the lexical coverage that a list provides, the greater its potential value to learners. However, because larger lexical units are made up of a greater number of word types than smaller units, lists that use larger lexical units are likely to account for more coverage. For example, the 1,000 most frequent word types, flemmas, and word families accounted for 76.46%, 80.97%, and 82.95% coverage of a 14-million-word corpus (Nation, Reference Nation2016). These differences in coverage make it challenging to evaluate the value of word lists made up of different lexical units (Dang & Webb, Reference Dang and Webb2016a).

It is important to note that word lists are primarily created as resources to aid the learning of vocabulary and much of the discussion in relation to the appropriacy of lexical units in word lists is focused on research rather than pedagogy. The presentation of words in larger lexical units would appear to have value for teaching and learning because it allows learners and teachers to quickly find and study a word, its inflections, and derivations and this may lead to more efficient gains in lexical knowledge. However, there is no empirical support for this assumption, and therefore it would be useful to examine this in future research.

Assessing vocabulary knowledge

The advantage of using larger lexical units such as word families in tests is that, by measuring knowledge of the form-meaning connections of morphologically unrelated words (e.g., play, take, keep rather than play, plays, playful), tests tap into L2 learning of distinct words without tapping into knowledge of the morphological system. The advantage of using smaller lexical units in tests of form-meaning connection is that by assessing knowledge of both morphologically related and unrelated words, a test should provide a more precise measurement of lexical knowledge (Kremmel, Reference Kremmel2016). However, the smaller the lexical unit, the larger the number of words that would require assessment. For example, Nation (Reference Nation2016) reported that the most frequent 1000 word families were made up of 3,281 lemmas and 6,838 word types. Measuring a much greater sample of items requires a much greater number of test items. If we were to follow the 30 items per 1000 word ratio used in earlier versions of the Vocabulary Levels Test (Schmitt et al., Reference Schmitt, Schmitt and Clapham2001; Webb et al., Reference Webb, Sasao and Ballance2017), we would go from a 30-item test to measure knowledge of 1000 word families to a 98-item test to measure knowledge of the 3,281 lemmas, and 205-item test to measure knowledge of 6,859 word types. Thus, it would probably make little sense to measure vocabulary size or levels with smaller lexical units. However, there might be great value in developing and validating tests of form-meaning connection designed to evaluate the vocabulary knowledge of beginning L2 learners who are still in the process of learning word parts. It would be useful to create tests measuring knowledge of the most frequent 800 lemmas in Dang and Webb’s (Reference Dang, Webb and Nation2016b) Essential Word List, which accounts for 75% of spoken and written English, or the 2,494 lemmas in Brezina and Gablasova’s New General Service List (accounting for 80–82% of the corpora from which it was derived) for a more ambitious evaluation of beginner vocabulary knowledge.

Tests such as the Vocabulary Levels Test (Nation, Reference Nation1983; Schmitt et al., Reference Schmitt, Schmitt and Clapham2001; Webb et al., Reference Webb, Sasao and Ballance2017) and the Vocabulary Size Test (Coxhead et al., Reference Coxhead, Nation and Sim2015; Nation & Beglar, Reference Nation and Beglar2007) were developed to provide a reliable measure of receptive knowledge of the form-meaning connections of words across different word frequency levels. These tests use word families as the lexical unit. This means that the tests include one item for each family (e.g., admire) that is assessed without measuring knowledge of its other family members (admires, admired, admiring, admirable, admirably, admiration, admirer, admirers, admiringly). Although tests that have used word families as the lexical unit were not developed to evaluate knowledge of other family members, there might be the assumption that these tests indicate knowledge of not only the item included in the test, but of all members of a word family for each item. Two earlier studies indicate that this is unlikely to be correct at least using receptive recall test formats with minimal or no contextual information provided to cue responses (McLean, Reference McLean2018; Ward & Chuenjundaeng, Reference Ward and Chuenjundaeng2009). The degree to which knowledge of headwords indicate knowledge of other family members using recognition formats such as multiple-choice or matching is yet to be examined in research, but would be a useful follow-up to these studies. There is also a need to examine the degree to which factors such as test format, contextual cues, item (headword, inflection, and derivation) frequency, receptive vocabulary knowledge, and proficiency affect the degree to which learners are able to demonstrate receptive and productive knowledge of L2 headwords, inflections, and derivations. However, perhaps of greatest value would be the development of tests designed to measure derivational knowledge of words at different frequencies. Receptive and productive tests of derivational knowledge could be used together with tests measuring knowledge of form-meaning connection and word parts to assess L2 learner vocabulary knowledge more accurately.

Lexical coverage and profiling

Lexical coverage refers to the percentage of known words encountered in input. Research indicates that 95% lexical coverage can provide adequate reading comprehension (Laufer, Reference Laufer, Lauren and Nordman1989), but that 98% coverage may be optimal (Hu & Nation, Reference Hu and Nation2000; Schmitt et al., Reference Schmitt, Jiang and Grabe2011). Research also indicates that 90% lexical coverage may be sufficient for listening (Van Zeeland & Schmitt, Reference Van Zeeland and Schmitt2013) and viewing comprehension (Durbahn et al., Reference Durbahn, Rodgers and Peters2020). However, as lexical coverage increases beyond 90%, comprehension is likely to improve. Taken together, studies of lexical coverage indicate that the more words that are known in L2 input, the more likely that L2 input will be understood.

The extent to which inflected and derivative forms affect comprehension in studies of lexical coverage has not been examined. Recent studies present contrasting arguments about how the lexical unit may affect lexical coverage. Brown (Reference Brown2018) found that 13.4% of the members of the most frequent 5000 word families in Nation’s (Reference Nation2006) British National Corpus word lists were derivations. This led him to suggest that this proportion of derivations may reduce lexical coverage of written text (a larger percentage of words would be unknown) and, in turn, inhibit reading comprehension. Brown et al. (Reference Brown, Stoeckel, Mclean and Stewart2020) also argue that if L2 learner knowledge is evaluated using tests that use word families as the unit of counting, lexical coverage and comprehension of L2 input may be overestimated if learners cannot understand derivatives. In contrast, Laufer and Cobb (Reference Laufer and Cobb2020) conducted a corpus-driven study of several written text types and found that relatively few derivations were encountered in the texts and a large proportion of those that were encountered included the highest frequency affixes. This led them to suggest that lexical coverage is unlikely to be affected by the use of word families as the lexical unit.

It is important to note that studies of lexical coverage tend to use carefully controlled research designs which involve replacing varying proportions of lower frequency words encountered in a text with pseudowords to provide an accurate estimate of lexical coverage (e.g., Hu & Nation, Reference Hu and Nation2000; Van Zeeland & Schmitt, Reference Van Zeeland and Schmitt2013). These studies include derivations as running words in the texts and so it would appear that lexical coverage findings are based to some degree on word families as the lexical unit. However, the degree to which the proportion of derived and inflected forms within a text affect both comprehension and lexical coverage thresholds remains to be examined and would be a useful direction for further research.

Lexical profiling research indicates vocabulary learning targets that may be sufficient for reading, listening, and viewing comprehension. For example, lexical profiling studies indicate that knowledge of the most frequent 3000 word families may be sufficient to understand television (Webb & Rodgers, Reference Webb and Rodgers2009), the most frequent 4000 word families may allow comprehension of academic lectures (Dang & Webb, Reference Dang and Webb2014), and the most frequent 8000–9000 word families may be sufficient to understand most forms of written text (Nation, Reference Nation2006). There tends to be an assumption in lexical profiling studies that learners who have achieved these learning targets are likely to have learned the inflectional and derivational systems. However, the extent to which learners have morphological knowledge in relation to different vocabulary levels remains to be explored. If learners are unable to understand derivative and inflected forms of headwords encountered during reading, listening, and viewing, then these learning targets might be too low. The only studies that have examined comprehension with learners at differing vocabulary levels have involved comprehension of television. However, both, Rodgers (Reference Rodgers2013) and Durbahn et al. (Reference Durbahn, Rodgers and Peters2020) found that learners who knew fewer than the 3000 word family vocabulary learning target (Webb & Rodgers, Reference Webb and Rodgers2009) were able to understand different TV programs. Further research investigating the degree to which learners with varying L2 vocabulary levels can understand different types of L2 input is needed.

Conclusion

Recently, discussions of lexical units have presented flemmas and word families (McLean, Reference McLean2018) and lemmas and word families (Brown et al., Reference Brown, Stoeckel, Mclean and Stewart2020) as dichotomous options of which one is more appropriate than the other. It is useful to question and investigate the appropriacy of lexical units. However, it would be surprising if one lexical unit makes the most sense for all learners and all aspects of L2 research and pedagogy. With little L2 research conducted on the appropriacy of the different lexical units, researchers should be cautious not to overgeneralize findings. This article has argued that the selection of a lexical unit should depend on several factors. These include learner variables such as vocabulary size, morphological knowledge, and proficiency, the purpose of the lexical unit (research, pedagogy), and the type of use (vocabulary learning, measuring vocabulary knowledge, developing word lists and vocabulary tests, lexical coverage and profiling). This article has also tried to highlight several of the areas in which future research on lexical units is warranted.

Footnotes

¹ In their seminal article on word families, Bauer and Nation (Reference Bauer and Nation1993) suggest seven levels of word families. However, because there is no empirical evidence to support progressions in knowledge of word families across the levels, it may be most appropriate to simply define word families as including both the inflections and derivations of a headword. This would make its definition most transparent and more easily applied to research and pedagogy.

References

Bauer, L., & Nation, I. S. P. (1993). Word families. International Journal of Lexicography, 6, 253–279.CrossRef Google Scholar

Bertram, R., Laine, M., & Virkkala, M. M. (2000). The role of derivational morphology in vocabulary acquisition: Get by with a little help from my morpheme friends. Scandinavian Journal of Psychology, 41, 287–296.CrossRef Google Scholar PubMed

Brezina, V., & Gablasova, D. (2015). Is there a core general vocabulary? Introducing the new general service list. Applied Linguistics, 36, 1–22.CrossRef Google Scholar

Brown, D. (2018). Examining the word family through word lists. Vocabulary Learning and Instruction, 7, 51–65.CrossRef Google Scholar

Brown, D., Stoeckel, T., Mclean, S., & Stewart, J. (2020). The most appropriate lexical unit for L2 vocabulary research and pedagogy: A brief review of the evidence. Applied Linguistics. Advance online publication. https://doi.org/10.1093/applin/amaa061 CrossRef Google Scholar

Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34, 213–238.CrossRef Google Scholar

Coxhead, A., Nation, P., & Sim, D. (2015). Measuring the vocabulary size of native speakers of English in New Zealand secondary schools. New Zealand Journal of Educational Studies, 50, 121–135.CrossRef Google Scholar

Dang, T. N. Y., Coxhead, A., & Webb, S. (2017). The Academic Spoken Word List. Language Learning, 67, 959–997.CrossRef Google Scholar

Dang, T. N. Y., & Webb, S. (2014). The lexical profile of academic spoken English. English for Specific Purposes, 33, 66–76.CrossRef Google Scholar

Dang, T. N. Y., & Webb, S. (2016a). Evaluating lists of high frequency vocabulary. ITL–International Journal of Applied Linguistics, 167, 132–158.CrossRef Google Scholar

Dang, T. N. Y., & Webb, S. (2016b). Making an essential word list for beginners. In Nation, I. S. P., Making and using word lists for language learning and testing (pp. 153–167, 188–195). John Benjamins.CrossRef Google Scholar

Davies, Mark. (2008–) The Corpus of Contemporary American English (COCA). https://www.english-corpora.org/coca/Google Scholar

Durbahn, M., Rodgers, M., & Peters, E. (2020). The relationship between vocabulary and viewing comprehension. System, 88, 102166.CrossRef Google Scholar

Gardner, D., & Davies, M. (2014). A new academic vocabulary list. Applied Linguistics, 35, 305–327.CrossRef Google Scholar

Hu, M., & Nation, I. S. P. (2000). Vocabulary density and reading comprehension. Reading in a Foreign Language, 13, 403–430.Google Scholar

Iwaizumi, E., & Webb, S. (2021). Measuring L1 and L2 productive derivational knowledge: How many derivatives can L1 and L2 learners with differing vocabulary levels produce? TESOL Quarterly. Advance online publication. https://doi.org/10.1002/tesq.3035 CrossRef Google Scholar

Kremmel, B. (2016). Word families and frequency bands in vocabulary tests: Challenging conventions, TESOL Quarterly, 50, 976–987.CrossRef Google Scholar

Laufer, B. (1989). What percentage of text lexis is essential for comprehension? In Lauren, C. & Nordman, M. (Eds.), Special language: From humans thinking to thinking machines (pp 316–323). Multilingual Matters.Google Scholar

Laufer, B., & Cobb, T. (2020). How much knowledge of derived words is needed for reading? Applied Linguistics, 41, 971–998.CrossRef Google Scholar

McLean, S. (2018). Evidence for the adoption of the flemma as an appropriate word counting unit. Applied Linguistics, 39, 823–845.CrossRef Google Scholar

Nagy, W., Anderson, R. C., Schommer, M., Scott, J. A., & Stallman, A. C. (1989). Morphological families in the internal lexicon. Reading Research Quarterly, 24, 262–282.CrossRef Google Scholar

Nation, I. S. P. (1983). Testing and teaching vocabulary. Guidelines, 5, 12–25.Google Scholar

Nation, I. S. P. (2006). How large a vocabulary is needed for reading and listening? The Canadian Modern Language Review, 63, 59–82.CrossRef Google Scholar

Nation, I. S. P. (2012). The BNC/COCA word family lists. http://www.victoria.ac.nz/lals/about/staff/paul-nation Google Scholar

Nation, I. S. P. (2016). Making and using word lists for language learning and testing. John Benjamins Publishing Company.CrossRef Google Scholar

Nation, P., & Beglar, D. (2007). A vocabulary size test. The Language Teacher, 31, 9–13.Google Scholar

Nation, I. S. P., & Webb, S. (2011). Researching and analyzing vocabulary. Heinle.Google Scholar

Nurmukhamedov, U., & Webb, S. (2019). Research timeline: Lexical coverage and profiling. Language Teaching, 52, 188–200.CrossRef Google Scholar

Pellicer-Sánchez, A., & Schmitt, N. (2010). Incidental vocabulary acquisition from an authentic novel: Do things fall apart? Reading in a Foreign Language, 22, 31–55.Google Scholar

Reynolds, B. L. (2013). Comments on Stuart Webb and John Macalister’s “Is text written for children useful for L2 extensive reading?” TESOL Quarterly, 47, 849–852.CrossRef Google Scholar

Reynolds, B. L. (2015). The effects of word form variation and frequency on second language incidental vocabulary acquisition through reading. Applied Linguistics Review, 6, 467–497.CrossRef Google Scholar

Reynolds, B. L., & Wible, D. (2014). Frequency in incidental vocabulary acquisition research: An undefined concept and some consequences. TESOL Quarterly, 48, 843–861.CrossRef Google Scholar

Rodgers, M. P. (2013). English language learning through viewing television: An investigation of comprehension, incidental vocabulary acquisition, lexical coverage, attitudes, and captions. Unpublished PhD Thesis, Victoria University of Wellington, Wellington, New Zealand.Google Scholar

Rott, S. (1999). The effect of exposure frequency on intermediate language learners’ incidental vocabulary acquisition and retention through reading. Studies in Second Language Acquisition, 21, 589–619.CrossRef Google Scholar

Schmitt, N., Jiang, X., & Grabe, W. (2011). The percentage of words known in a text and reading comprehension. The Modern Language Journal, 95, 26–43.CrossRef Google Scholar

Schmitt, N., Schmitt, D., & Clapham, C. (2001). Developing and exploring the behaviour of two new versions of the Vocabulary Levels Test. Language Testing, 18, 55–88.CrossRef Google Scholar

Schmitt, N., & Zimmerman, C. B. (2002). Derivative word forms: What do learners know? TESOL Quarterly, 36, 145–171.CrossRef Google Scholar

Van Zeeland, H., & Schmitt, N. (2013). Lexical coverage in L1 and L2 listening comprehension: The same or different from reading comprehension? Applied Linguistics, 34, 457–479.CrossRef Google Scholar

Vilkaitė-Lozdienė, L., & Schmitt, N. (2020). Frequency as a guide for vocabulary usefulness: High-, mid-, and low-frequency words In Webb, S. (Ed.) The Routledge Handbook of Vocabulary Studies (pp. 81–96). Routledge.Google Scholar

Ward, J., & Chuenjundaeng, J. (2009). Suffix knowledge: Acquisition and applications. System, 37, 461–469.CrossRef Google Scholar

Webb, S. (2007). The effects of repetition on vocabulary knowledge. Applied Linguistics, 28, 46–65.CrossRef Google Scholar

Webb, S., & Macalister, J. (2013). Is text written for children appropriate for L2 extensive reading? TESOL Quarterly, 47, 300–322.CrossRef Google Scholar

Webb, S. & Rodgers, M. P. H. (2009). The vocabulary demands of television programs. Language Learning, 59, 335–366.CrossRef Google Scholar

Webb, S., Sasao, Y., & Ballance, O. (2017). The updated Vocabulary Levels Test: Developing and validating two new forms of the VLT. ITL–International Journal of Applied Linguistics, 168, 34–70.CrossRef Google Scholar

Wysocki, K., & Jenkins, J. R. (1987). Deriving word meanings through morphological generalization. Reading Research Quarterly, 22, 66–81.CrossRef Google Scholar

Article contents

THE LEMMA DILEMMA

Abstract

How might the lexical unit affect research and pedagogy?

L2 learning

Word lists

Assessing vocabulary knowledge

Lexical coverage and profiling

Conclusion

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests