The main issue the present study attempts to address is how people acquire and generalize new morphological systems. This is a critical aspect of language competence: the ability to produce an unlimited number of expressions by combining a limited number of linguistic units (Diessel, Reference Diessel2004; Pinker & Jackendoff, Reference Pinker and Jackendoff2005). This creativity is reflected in the process of generalization across individual exemplars to which people are exposed that evidently emerges in morphological learning (Tamminen, Davis, Merkx, & Rastle, Reference Tamminen, Davis, Merkx and Rastle2012; Tamminen, Davis, & Rastle, Reference Tamminen, Davis and Rastle2015). For example, the vast majority of new English words (e.g., unreasonable) are created by combining a limited set of stems (e.g., reason) with affixes (e.g., un–, –able) based on certain rules (Algeo, Reference Algeo1991). The hallmark of this productivity lies in the capability to generalize newly learned morphemes (e.g., un–, –able) to create novel words (e.g., ungoogleable) in appropriate grammatical categories (e.g., “This is an ungoogleable question”).
It is generally accepted that children acquire their first language unconsciously. It has long been a subject of debate, however, whether adults learn a second language (L2) without awareness or intention. Krashen (Reference Krashen1981, Reference Krashen1985) claimed that language acquisition takes place through unconscious, implicit processing of input and that explicit learning plays a small role in the development of implicit knowledge. Schmidt (Reference Schmidt1990, Reference Schmidt and Schmidt1995, Reference Schmidt and Robinson2001) proposed that noticing is sufficient, and in some cases necessary, for language learning to take place, though incidental learning is possible when attention is focused on what is to be learned. Schmidt also proposed two levels of awareness regarding conscious and unconscious language learning: the level of noticing, where learners register some aspects of input, and the level of understanding, where learners recognize rules or regularities underlying the structure of the input. Explicit knowledge has been argued to foster and facilitate the development of implicit knowledge (Ellis, Reference Ellis2005, Reference Ellis and Rebuschat2015; Suzuki & DeKeyser, Reference Suzuki and DeKeyser2017), and conscious learning may lead to unconscious knowledge through proceduralization and automatization of explicitly learned knowledge (DeKeyser, Reference DeKeyser1997, Reference DeKeyser, VanPatten and Williams2015). Automatized explicit knowledge and implicit knowledge can be distinguished by the awareness criterion (Suzuki, Reference Suzuki2017; Suzuki & DeKeyser, Reference Suzuki and DeKeyser2017). Although both types of knowledge can be accessed rapidly, automatized explicit knowledge involves awareness about linguistic forms, while implicit knowledge does not.
The definitions of terms used in this article in regard to the process and product of learning are stated here. Implicit learning is a process whereby people acquire knowledge without the intention to learn and without awareness of the knowledge they have acquired, while explicit learning is a process in which people are aware that they are acquiring knowledge. In this context, incidental learning refers to learning without any intention to learn, but it does not necessarily exclude the awareness of what has been learned. Implicit knowledge refers to the product of learning (knowledge) people are unaware and unconscious of, whereas explicit knowledge signifies the conscious knowledge people are aware of and often able to verbalize (Grey, Williams, & Rebuschat, Reference Grey, Williams and Rebuschat2014; Hulstijn, Reference Hulstijn2005; Leow, Reference Leow2018; Leow & Zamora, Reference Leow, Zamora, Loewen and Sato2017; Rebuschat, Hamrick, Riestenberg, Sachs, & Ziegler, Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015; Rebuschat & Williams, Reference Rebuschat and Williams2012; Williams, Reference Williams, Ritchie and Bhatia2009).
Recent studies have shifted their interest toward measuring the knowledge acquired under different learning conditions (i.e., the product of learning), rather than the process of learning as the stage of encoding information (Hama & Leow, Reference Hama and Leow2010; Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015; Suzuki, Reference Suzuki2017; Suzuki & DeKeyser, Reference Suzuki and DeKeyser2017). This is to determine whether the knowledge acquired during exposure or leaning phases is implicit or explicit in nature (Grey et al., Reference Grey, Williams and Rebuschat2014; Rebuschat, Reference Rebuschat2013; Rebuschat & Williams, Reference Rebuschat and Williams2012; Rogers, Reference Rogers2017; Rogers, Révész, & Rebuschat, Reference Rogers, Révész and Rebuschat2016).
A growing body of research has investigated what aspects of a L2 can be learned incidentally, including form–meaning mappings (Hama & Leow, Reference Hama and Leow2010; Kerz, Wiechmann, & Riedel, Reference Kerz, Wiechmann and Riedel2017; Leung & Williams, Reference Leung and Williams2011, Reference Leung and Williams2012, Reference Leung and Williams2014; Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015; Williams, Reference Williams2005), word order (Grey et al., Reference Grey, Williams and Rebuschat2014; Rebuschat & Williams, Reference Rebuschat and Williams2012; Ruiz, Tagarelli, & Rebuschat, Reference Ruiz, Tagarelli and Rebuschat2018; Tao & Williams, Reference Tao and Williams2018; Williams & Kuribara, Reference Williams and Kuribara2008), case marking (Brooks & Kempe, Reference Brooks and Kempe2013; Grey et al., Reference Grey, Williams and Rebuschat2014; Robinson, Reference Robinson2005; Rogers, Reference Rogers2017, Reference Rogers and Leow2019; Rogers et al., Reference Rogers, Révész and Rebuschat2016), verb meaning (Paciorek & Williams, Reference Paciorek and Williams2015), and phonological stress patterns (Chan & Leung, Reference Chan and Leung2018; Graham & Williams, Reference Graham and Williams2016). These studies have examined learning under incidental learning conditions and the conscious status of the resultant knowledge after incidental exposure (i.e., the product of learning).
To date, however, no research has been conducted on incidental learning of L2 derivational morphology and its nature of acquired knowledge. Unlike inflectional morphology, derivational morphology creates new words and adds novel meanings in different grammatical categories due to its combinatory system containing stems and suffixes (Marslen-Wilson, Reference Marslen-Wilson and Gaskel2007; Tyler & Nagy, Reference Tyler and Nagy1989). Derivational processes and derived words make a significant impact on processing and expanding vocabulary among children, adults, and L2 learners (Carlisle, Reference Carlisle2000; Jarmulowicz, Reference Jarmulowicz2006; Silva & Clahsen, Reference Silva and Clahsen2008). Previous studies, however, have not examined how derivational morphological system is learned under incidental or explicit learning conditions. Marsden, Williams, and Liu (Reference Marsden, Williams and Liu2013) conducted priming research regarding recognition memory of suffixed nonwords; however, the meanings added to the suffixes they used were functional (e.g., singular or plural, present or past) rather than derivational (causing a change of word class). Marsden et al. exposed participants to suffixed nonwords (e.g., gatot), instead of sentences, to investigate the morphological representations under the priming paradigm. What is relevant to incidental learning was their finding that the generalization task using a picture-matching test (e.g., oral and visual presentation of smafot with three pictures) showed that participants learned the meanings of suffixes (e.g., –ot in gatot) even when their attention was oriented to the stems (e.g., gat– in gatot). The following two sections provide a brief introduction to some key concepts that are widely discussed in light of awareness assessment and learning conditions with a particular focus on delayed effects of learning.
Measurement of awareness
Researchers have used different procedures to operationalize and measure the construct of awareness. Studies concerned with the stage of encoding information (the process of learning) employ concurrent measures of awareness such as think-aloud protocols in which participants are requested to say aloud whatever they are thinking while performing experimental tasks (Bowles & Leow, Reference Bowles and Leow2005; Hama & Leow, Reference Hama and Leow2010; Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015). One limitation to think-aloud protocols is the issue of reactivity influencing learners’ cognitive processes and accuracy of task performance. In contrast, in order to measure whether participants have acquired implicit or explicit knowledge (the product of learning), nonconcurrent retrospective verbal reports are conducted after testing (Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015; Rebuschat & Williams, Reference Rebuschat and Williams2012; Williams, Reference Williams2005). Retrospective verbal reports prompt participants to verbalize any rules or patterns they might have noticed during the experiment. Knowledge is considered unconscious or implicit if participants demonstrate the effect of learning despite being unable to verbalize the underlying rule in the exposure items. One problem with this measurement, however, is that participants may fail to recall something after a long period of exposure and testing. Another drawback is that participants may not report knowledge because they lack the confidence to do so (Hamrick & Rebuschat, Reference Hamrick, Rebuschat, Rebuschat and Williams2012; Rebuschat, Reference Rebuschat2013; Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015; Rebuschat & Williams, Reference Rebuschat and Williams2012). It is noteworthy that both measures of using think-aloud protocols and verbal reports rely on verbalization, which does not identify the construct of awareness that is not verbalized (Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015).
In order to refine the measurement of the conscious status of knowledge, recent studies have adopted two types of subjective measures of awareness during testing phases: confidence ratings and source attributions (Dienes, Altmann, Kwan, & Goode, Reference Dienes, Altmann, Kwan and Goode1995; Dienes & Scott, Reference Dienes and Scott2005; Hamrick & Rebuschat, Reference Hamrick, Rebuschat, Rebuschat and Williams2012; Rebuschat, Reference Rebuschat2013; Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015; Rogers et al., Reference Rogers, Révész and Rebuschat2016). Confidence ratings ask participants to indicate their level of confidence (e.g., no confidence, somewhat confident, very confident, or absolutely certain) for each decision they make during the testing phrase. Source attributions require them to identify the source of their judgment (e.g., guess, intuition, memory, or rule knowledge). Knowledge can be considered unconscious if participants’ confidence is unrelated to their accuracy (called the zero-correlation criterion), or if a negative correlation between confidence and accuracy is observed, which would provide evidence for better performance without conscious knowledge (Rebuschat, Reference Rebuschat2013). Knowledge can also be considered implicit if they believe themselves to be guessing when their classification performance is significantly above chance (called the guessing criterion). The intuition and guess attributions are both considered to correspond to unconscious knowledge (Dienes & Scott, Reference Dienes and Scott2005; Rebuschat, Reference Rebuschat2013).
For instance, Rogers et al. (Reference Rogers, Révész and Rebuschat2016) used the two types of subjective measures. Rogers et al. (Reference Rogers, Révész and Rebuschat2016) investigated whether novel morphological case markings can be learned by adults via incidental exposure and whether the resulting knowledge is implicit or explicit in nature. The study used a semiartificial language based on English sentences with Czech case marking. The testing phase included a grammaticality judgment test, along with subjective measures of awareness. At the end of the experiment, participants also provided retrospective verbal reports. The results of grammaticality judgment tests (GJT) showed that the experimental group demonstrated significantly better learning effects than the control group for accusative cases, but not for nominative cases. The verbal reports found that all participants noticed the changing morphological inflections during the training phase, although none of them were able to verbalize the underlying rule system. Nevertheless, the analysis of the subjective measures of awareness demonstrated that participants developed both implicit and explicit knowledge; confidence ratings suggested the conscious knowledge evidenced by above chance performance when participants chose the categories somewhat confident and very confident, while the analysis of source attributions revealed the unconscious knowledge evidenced by above chance performance only based on “intuition.”
Learning conditions and delayed effects
A number of studies investigating differences between implicit and explicit instruction have demonstrated that learning under explicit-type conditions is more effective than learning under implicit-type conditions (Goo, Granena, Yilmaz, & Novella, Reference Goo, Granena, Yilmaz, Novella and Rebuschat2015; Norris & Ortega, Reference Norris and Ortega2000; Spada & Tomita, Reference Spada and Tomita2010). For example, Robinson (Reference Robinson1996) compared the learning of English grammar rules by adult learners of English as a second language under four training conditions: implicit (explained to participants as a memory test), incidental (as an exercise in reading for meaning), rule-search (as an exercise in finding rules), or instructed (where participants were asked to read through the rules). The results of the GJT (e.g., “Across the street raced Tom” judged as being grammatical, and “On Saturday night danced Charlie” judged as being ungrammatical) showed that instructed learners outperformed the other groups. Similar results were obtained by Robinson (Reference Robinson1997) in which the dative alternation in English (e.g., “John donated the piano to the church” as grammatical, and “John donated the church the piano” as ungrammatical) was learned under four different conditions: implicit, incidental, enhanced (target items were boxed for each sentence), or instructed. Instructed learners were superior to learners in other conditions, in particular in generalizing the knowledge developed through training to new items.
Andringa and Curcic (Reference Andringa and Curcic2015) investigated how a morphosyntactic rule in Spanish is explicitly or implicitly acquired by adult native speakers of Dutch. The rule is called differential object marking according to which direct objects are preceded by the preposition a (“to”) if they are animate. In the 11-min instruction, both explicitly and implicitly instructed learners were presented with the audio of 52 target trials providing exposure to the rule. The difference between the two groups was that only the explicit group received the explanation of the target rule. In the explicit instruction, 2 target trials were replaced by screens explaining the rule to the participants in Dutch. The results showed that explicitly instructed learners were more accurate than implicitly instructed ones in the GJT. They conducted source attributions that confirmed that explicitly instructed learners based their decision on the rule 71% of the time, while implicitly instructed learners did so 12% of the time. In their study, however, participants in the implicit learning condition, as well as those in the explicit group, were informed in advance of the grammaticality test that the knowledge was tested after the instruction, which makes it hard to characterize their learning as implicit or even incidental.
Denhovska and Serratrice (Reference Denhovska and Serratrice2017) examined the acquisition of the gender agreement patterns in Russian by adult native speakers of English. Participants in the incidental learning condition were visually presented with Russian sentences along with English translations. They were asked to read the Russian sentences and translations without performing any other task. Participants in the explicit condition received a metalinguistic explanation of the gender agreement in Russian and were provided with example sentences. They were asked to memorize the agreement rule and were informed that they would be tested afterward. The training time for both conditions was the same (15 min). The results showed that participants in both conditions were similarly accurate in the GJT, although learners in the explicit condition performed significantly better than those in the incidental condition in the production task where they had to fill in the blanks with appropriate inflections.
Unlike the above four studies examining grammar acquisition, Hamrick and Rebuschat (Reference Hamrick, Rebuschat, Rebuschat and Williams2012) compared the learning effects between intentional and incidental learning conditions in the statistical word learning paradigm. Each condition had 30 native speakers of English, and participants in both conditions were exposed to 57 trials in which two images (e.g., panda and glass) were displayed on the screen and two pseudowords (e.g., houger and femod) were auditorily presented. The location of the image and the order of the pseudoword were not related. The only clue to the right referent of the word was to keep track of the co-occurrences of the images and the words across trials. Participants in the intentional learning condition were instructed to learn the meanings of the words and were told that they would be tested afterward. In contrast, participants in the incidental condition were not informed about the purpose of the experiment or about being tested afterward, and were asked to indicate instead how many objects on each slide were animate. In the testing phase, participants in both groups were presented with four pictures on the screen and a spoken pseudoword, and were required to select the appropriate referent as quickly and accurately as possible. Performance on the picture-matching task was regarded as the measure of learning. Awareness was measured by means of confidence ratings and source attributions. The results showed that although there was a clear learning effect for both groups, the learning effect was larger under intentional learning conditions. The zero-correlation criterion was met in the case of the incidental group, but it was not met for the intentional group. Source attributions with three categories (guess, intuition, and memory) revealed that both groups performed significantly above chance when choosing any of the three categories, including when the intentional group members based their decisions on guessing. This suggests that some unconscious knowledge developed even under intentional learning conditions.
As a longitudinal study, Morgan-Short, Steinhauer, Sanz, and Ullman (Reference Morgan-Short, Steinhauer, Sanz and Ullman2012) examined how explicit and implicit training affected the learning of an artificial language’s grammar, which was auditorily presented to participants using the GJT and electrophysiological (event-related potential) measures. Learners in the explicit condition were provided with metalinguistic explanation of the artificial language’s rules along with 33 example sentences, while those in the implicit condition received 127 example sentences only. The initial training time was 13.5 min for both groups. After the initial training, learners in both groups received additional practice in comprehension and production over 1 to 5 days until they reached low proficiency (accuracy significantly above chance) and then high proficiency (80% accuracy or more). Performance of the behavioral judgment task was not significantly different between the groups at either low or high proficiency, although event-related potential measures revealed that only the implicitly trained learners showed brain activity typical of native speakers at a high proficiency.
With respect to the delayed effects of learning, Tamminen et al. (Reference Tamminen, Davis, Merkx and Rastle2012) and Tamminen et al. (Reference Tamminen, Davis and Rastle2015) discussed the role of consolidation in language learning and linguistic generalization (i.e., a process of integration or generalization of newly learned knowledge, which develops slowly over a period of time). In their studies, native speakers of English were taught new words consisting of a familiar stem and a novel affix (e.g., sleepafe) with a definition (e.g., “sleepafe is a participant in a study about the effects of sleep”). The training phase included a typing task and a recall task that lasted roughly 1 hr. In the testing phase, the sentence congruency task and the recognition memory task were given. Reading aloud latencies were measured in the sentence congruency task, and the accuracy as to whether the word was trained or untrained was obtained in the recognition memory task. The results showed that although the accuracy of recognition declined a week later, the semantic congruency effect (i.e., longer reading latency for incongruent sentences compared to congruent ones) was observed a week after training. This suggests that generalization takes place following a delay between learning and testing. These studies, however, were not concerned with whether participants acquired implicit or explicit knowledge.
In a study on the effects of input- and output-based instruction on the learning of Spanish object pronouns, Morgan-Short and Bowden (Reference Morgan-Short and Bowden2006) found that input-based instruction in the interpretation test made no significant change from immediate to delayed tests. Output-based instruction, however, showed a significant loss, and the control group revealed a significant gain although the two experimental groups demonstrated significant gains from the pretests.
Grey et al. (Reference Grey, Williams and Rebuschat2014) investigated incidental learning of word order and case marking in a semiartificial language called Japlish, which followed the word order and case-marking rules of Japanese but uses English vocabulary. The verb was always at the end of the sentences and nouns were case marked for subject (–ga), object (–o), and indirect object (–ni; e.g., “Stacey-ga that picture-o painted,” “Those seeds-o Ann-ga parrot-ni fed”). During the exposure phase (approximately 20 min), participants were presented audio of 128 sentences during which they decided if the sentences were semantically plausible or not. Performance was assessed with the acceptability judgment task (AJT) examining the learning of word order and the picture-matching tasks (PMT), which tested the learning of case marking that were administered immediately after exposure and 2 weeks later. On the immediate tests, there was a significant learning effect on the AJT, but not on the PMT. The delayed tests, however, showed maintenance of the learning effects for the AJT and significantly improved performance for the PMT. It was assumed that consolidation played a role when the establishment of the form–meaning connections (between –ga, –o, –ni and subject, object, indirect object, respectively) was required for the PMT. They took subjective measures of awareness only for the AJT. Confidence ratings showed their performance was above chance when they felt very confident. Source attributions revealed above chance accuracy based on intuition.
The results from the above studies comparing different learning conditions have generally shown that learning under explicit conditions is superior to incidental conditions. Very few studies, however, have compared how the incidental and explicit learning of grammatical knowledge develops through a relatively short period of exposure, together with the measurement of awareness using confidence ratings and source attributions (see Andringa & Curcic, Reference Andringa and Curcic2015; Hamrick & Rebuschat, Reference Hamrick, Rebuschat, Rebuschat and Williams2012). Only a small number of studies, as shown above, were conducted to discover the difference in learning between immediate and delayed testing. Further, no previous research has compared the delayed effects of the learning of grammatical rules between incidental and explicit learning conditions. The present study addresses this gap by examining the learning of the L2 derivational system and its durability under different learning conditions.
The present study
The present research extended the study by Rogers et al. (Reference Rogers, Révész and Rebuschat2016) in several ways: to focus on derivational morphology, include an explicit learning group, and examine the delayed effects after 1 week. As noted earlier, derivational morphology is a key factor in developing the ability to generalize novel words and grammar (Marslen-Wilson, Reference Marslen-Wilson and Gaskel2007; Tamminen et al., Reference Tamminen, Davis, Merkx and Rastle2012, Reference Tamminen, Davis and Rastle2015; Tyler & Nagy, Reference Tyler and Nagy1989). The comparison between incidental and explicit learning also has pedagogical implications (Andringa & Curcic, Reference Andringa and Curcic2015; Ellis, Reference Ellis and Rebuschat2015; Hulstijn, Reference Hulstijn2005; Leow, Reference Leow2018), considering that in the current study the stimulus materials were based on real English suffixes and that participants were intermediate learners of English. Furthermore, it is worthwhile to examine the delayed effects of learningFootnote 1 or consolidation under incidental and explicit learning conditions (Grey et al., Reference Grey, Williams and Rebuschat2014; Morgan-Short & Bowden, Reference Morgan-Short and Bowden2006; Morgan-Short et al., Reference Morgan-Short, Steinhauer, Sanz and Ullman2012; Tamminen et al., Reference Tamminen, Davis, Merkx and Rastle2012, Reference Tamminen, Davis and Rastle2015), especially when the research is concerned with productivity and generalization of newly learned grammatical patterns.
In Rogers, Révész, and Rebuschat (Reference Rogers, Révész, Rebuschat and Rebuschat2015), participants were asked in the third experiment to repeat sentences and target foreign words aloud during exposure to promote awareness and induce a learning effect. This is because in the first and second experiments, participants did not demonstrate learning effects without repeating target foreign words (repeating sentences aloud was included in the second experiment). The current study excluded repeating sentences and target words for the incidental learning group due to the possibility of participants noticing the rule hidden in the exposure sentences. Considering that the present study’s experimental sentences were real English sentences with familiar words and one nonword keeping the basic word order, adding the task that directs participants’ attention to the target nonword would have likely made them more aware of the pattern in the target nonword.
Several studies regarding the learning of grammatical rules have measured reaction times (RTs). Although RT data are often less directly linked to the main research question, they serve as complementary measures for exploring the differences between participant groups or different structures (Jiang, Reference Jiang2012). RT data can also measure the speed and processing load to reach correct grammatical judgments, and can shed light upon the nature of linguistic knowledge that individuals possess (Gass & Mackey, Reference Gass and Mackey2007). Robinson (Reference Robinson1996) measured RTs in addition to learners’ accuracy of GJT and reported no significant difference between learners on implicit or instructed learning conditions. In Robinson (Reference Robinson1997), however, incidental learners were significantly slower than instructed learners in making correct grammaticality judgments on untrained sentences, and it was assumed that incidental learners needed additional time to find rules. Andringa and Curcic (Reference Andringa and Curcic2015) also used RT measures besides accuracy data and concluded, based on no significant difference between the two groups, that explicit knowledge of the target grammatical structure does not provide explicit learners with any speed advantage in deciding on correct answers.
The results on RT measures so far have thus been mixed. It can be assumed that there might be a trade-off between the strategies (conscious or unconscious) employed by both types of learners. If participants in the incidental learning condition attempt to look for rules to decide on the grammaticality of testing sentences, they tend to be slow in responding. However, if they use the implicit knowledge at least partially developed during exposure, they may have faster access to knowledge (Suzuki, Reference Suzuki2017; Suzuki & DeKeyser, Reference Suzuki and DeKeyser2017). Likewise, participants in the explicit learning condition can be very quick in accessing explicit rules taught in learning phases, but at the same time, applying the conscious rules to grammatical judgment may take time. The present study also attempts to address this issue by measuring RTs for different learning groups, which was not included in the study by Rogers et al. (Reference Rogers, Révész and Rebuschat2016), though self-paced reading tasks with RTs were used in some studies investigating the learning of case marking such as Rogers (Reference Rogers and Leow2019) and VanPatten and Smith (Reference VanPatten and Smith2018).
With a view to validate awareness assessment measures, the current experiment also included retrospective verbal reports at the end of the experiment, as well as two types of subjective measures of awareness during testing phases: confidence ratings and source attributions (Dienes et al., Reference Dienes, Altmann, Kwan and Goode1995; Dienes & Scott, Reference Dienes and Scott2005; Rebuschat, Reference Rebuschat2013; Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015).
The aim of the current study is thus to compare incidental and explicit auditory learning of L2 derivational suffixes by measuring the accuracy and the RT of a GJT administered to Japanese learners of English at an intermediate level. A further aim was to explore whether the resultant knowledge was implicit or explicit in nature through the analysis of subjective measures of awareness during the testing phases, in addition to postexperimental verbal reports. The study also examined the delayed effects of learning by testing participants immediately after learning and 1 week later. In particular, the research aims to address the following four research questions:
1. To what extent are adult learners able to acquire L2 derivational morphology incidentally through auditory exposure?
2. To what extent is their performance comparable to that of explicit learners?
3. What is the nature of the acquired knowledge: implicit or explicit?
4. Are there any delayed effects of learning demonstrated by the different learning groups?
Method
Participants
For the current study, 60 Japanese university students were recruited as participants (43 females, 17 males) with an age range from 18 to 21 (M = 19.80, SD = 1.09). All participants were native speakers of Japanese and had studied English as a foreign language for more than 6 years.Footnote 2 Their scores of English proficiency tests administered in Japan were 400–470 at TOEFL ITP, 440–600 at TOEIC L&R, or Eiken Grade Pre-2 and Grade 2. Participants were approximately equivalent to A2/B1 levels in the Common European Framework of Reference for Languages based on the conversion table provided by the Ministry of Education, Culture, Sports, Science and Technology, Japan (2015).
Participants were paid for their participation. They filled out the consent forms on the first day (Day 1), at which time they were reminded that they had to participate again a week later (Day 8). The consent form also explained that the task was to perform an action after listening to a sentence (via headphones connected to the computer) and provided no information about the purpose of the study. Participants drew lots to be randomly assigned to three groups, with 20 participants in each group: incidental learning group, explicit learning group, and the control group.
Stimuli
The stimuli were 144 sentences including nonwords attached with 12 suffixes based on real English suffixes forming three grammatical categories (Noun: –mənt, –nəs, –ʃən [ʒən], –əti; Adjective: –fəl, –ɪʃ, –əl, –ɪk; Verb: –eɪt, –ən, –aɪz, –ɪfaɪ). The suffixes were selected from 32 major English suffixes reported in Harwood and Wright (Reference Harwood and Wright1956). The stems of the nonwords consisted of two syllables: consonant plus vowel plus consonant plus /ɪ/, /i/, /ɪs/, /ɪz/, /is/, /iz/, or /əs/. The consonant plus vowel plus consonant parts of the nonwords were constructed with reference to Noble (Reference Noble1961), ensuring a variation of vowels and consonants. The participants’ first language (Japanese) has similar derivational suffixes (Shibatani, Reference Shibatani1990; Tsujimura, Reference Tsujimura2007). For example, when the suffix –sa is attached to an adjective “yasashii” (kind), the derived form “yasashisa” (kindness) serves as a noun.
These nonwords were inserted into sentence frames designed for the three word classes. Noun frames were “She shows some __ at home,” “He shows his __ in the library,” “The teacher likes his __ at school,” and “The doctor likes to show his __.” Adjective frames were “She is very __ at home,” “He is very __ in the library,” “The teacher is very __ at school,” and “The doctor is a very __ person.” Verb frames were “She tries to __ her friend at home,” “He wants to __ his book,” “The teacher tries to __ at school,” and “The doctor wants to __ his patient.” The sentence frames mentioned above were thought to be appropriate for intermediate learners of English to understand while listening without written forms. All the stimuli are presented in Appendix A. Examples of sentences with three grammatical categories are shown in Table 1.
Forty-eight out of those 144 sentences were used to create an exposure list for the incidental group. Another 48 sentences were used as grammatical sentences for the testing phase. The remaining 48 sentences were made ungrammatical by inserting the nonwords into inappropriate categories in the sentences (e.g., “The doctor is a very bəfɪsnəs person”). Half of the testing sentences (24 sentences) were completely matched except for the suffixes between grammatical and ungrammatical sentences: “The doctor is a very bəfisfəl person” versus “The doctor is a very bəfɪsnəs person.” The other half (the remaining 24 sentences) had different stems and suffixes to prevent participants from paying too much attention to the suffix itself: “The teacher likes his səfɪʒən at school” versus “The teacher likes his hənɪsɪfaɪ at school.” Two versions of sentence lists (each including different 48 sentences) were prepared for the two testing phases (Day 1 and Day 8) and counterbalanced across participants.
These stimulus sentences were converted into audio files using text-to-speech software (Globalvoice English 3, HOYA). Half of them were created with an American male voice and the other half with a British female voice and were counterbalanced across grammatical categories for each learning and testing set. Postexperimental interviews revealed that no participant noticed the sound was synthetic speech. There are also several studies including Hirai and O’ki (Reference Hirai and O’ki2011) and Matsuda (Reference Matsuda2017) where the synthetic speech generated via the same software did not result in comprehension problems for L2 learners compared with natural human speech, and was found to be effective for language learning (particularly for nonproficient Japanese learners of English).
Procedure
The experiment consisted of two phases: the learning phase and the testing phase. In the learning phase, the incidental learning group listened to 48 grammatical sentences twice (96 sentences in total). After they listened to each sentence, participants were requested to choose one of the two pictures on the computer screen that they thought matched the content of the sentence. Whether their choice was correct or not was indicated by a different sound: a chime or a beep. The provision of this kind of feedback presumably prompted the semantic processing of the sentences while avoiding raising awareness as to what they were learning. Furthermore, their choice could be made just on the basis of the English words in the sentences such as “at home,” “in the library,” “at school,” and “doctor,” which was intended to prevent participants paying much attention to the novel words and the suffixes. The order of the presentation of the sentences was randomized for each participant, and the learning period lasted about 15 min. Example sentences and pictures are illustrated in Figure 1 (all the sentences and the pictures used in the learning phase for the incidental group are presented in Appendix B).
The explicit learning group received an explanation in Japanese (using PowerPoint slides) that English has three types of suffixes that form nouns, adjectives, and verbs. Participants then listened to auditory examples of 12 suffixes and sentences including the suffixes (three grammatical categories × four suffixes) with explanations written on PowerPoint slides. No suffix or sentence was presented in the written mode. The explanation was repeated three times for about 15 min, almost the same amount of time for the incidental learning group. Participants were not allowed to go back to the previous slides. All the slides presented in the learning phase for the explicit condition are shown in Appendix C.
Meanwhile, the control group had no learning phase. The stimulus presentation was controlled by SuperLab 5.0 software with a response pad (RB-740) (Cedrus Corporation).
Testing phase
The three groups took the GJT twice: immediately after the learning phase (Day 1) and a week later (Day 8). Participants listened to one of the two versions of 48 sentences including new nonwords with the suffixes, and judged whether each sentence was grammatically correct or incorrect as quickly and accurately as possible by pressing designated buttons on the response pad. Participants had to complete this task without being given any information about which aspect of the sentence was grammatical or not. It was assumed, however, that the use of basic English word order and the highly familiar words used in the experimental sentences (except for the suffixed nonwords) prompted similar expectations among the three groups regarding a grammaticality judgment (Hamrick & Sacks, Reference Hamrick and Sacks2018). The sentences were presented in randomized order for each participant. Half of the sentences were grammatically correct (e.g., “The doctor likes to show his bəfɪzmənt”), and the other half were grammatically incorrect (e.g., “The teacher tries to səfɪsɪk at school”). There was no feedback in the testing phase. The correct response rates and RTs were obtained during the testing phase.
After each trial, participants were asked to indicate on a 4-point scale how confident they were in their decision (no confidence, somewhat confident, very confident, or absolutely certain) and what the basis of their judgment was (guess, intuition, memory, or rule). Explanations about what the English terms in the 4-point scale meant were provided in Japanese. No participant seemed to be confused about the differentiation between the terms. Even the distinction between guessing and intuition was relatively clear to participants. The phenomenology of intuition was labeled as “chokkan” in Japanese to mean knowing that a judgment is correct, but not knowing why, while guessing was labeled as “atezuppou” in Japanese to mean not knowing either (Dienes & Scott, Reference Dienes and Scott2005).
Different stimulus sentences were used on Day 1 and Day 8. After the testing phases on Day 8, participants filled out a debriefing questionnaire that asked whether they might have noticed any rules or patterns in the sentences they had heard, and if so, when they noticed them (during the training, the first testing phase, or the second testing phase). This was followed by an oral interview that elaborated on the questionnaire. Table 2 summarizes the overall design of the experiment.
Results
GJT
Accuracy
The accuracies for Day 1 were 56.4% (SD = 6.70) for the incidental group, 56.8% (SD = 4.88) for the explicit group, and 51.5% (SD = 5.48) for the control group. The accuracies for Day 8 were 50.8% (SD = 8.11) for the incidental group, 56.4% (SD = 7.81) for the explicit group, and 52.4% (SD = 5.44) for the control group. The accuracies were analyzed by a logit mixed-effects model with crossed random effects for subject and item (Linck & Cunnings, Reference Linck and Cunnings2015) using the lme4 package (version 1.1-18-1) of R (version 3.5.1; R Core Team, 2018). The dependent variable was a GJT performance specified by a binary outcome. The model included day (Day 1 and Day 8) and group (incidental, explicit, and control) as fixed effects. Because this maximal model failed to converge, the model was simplified by removing by-subject random slopes for day, which produced a stable model. The results are reported in Table 3. There were the significant fixed effect of group and the marginally nonsignificant fixed effect of day, but their interaction was not significant.
Note: Formula: acc~Day*Group + (1|Subject) + (Group|Item); family = binomial.
* p < .05.
Learning was assessed by A-prime (A’) scores on the GJT.Footnote 3 The A’ scores were calculated based on the proportions of hits (correct acceptance of a grammatical sentence) and false alarms (incorrect acceptance of an ungrammatical sentence). They provide a more sophisticated measure than simply reporting accuracy scores for the GJT according to signal detection theory (Grier, Reference Grier1971; Linebarger, Schwartz, & Saffran, Reference Linebarger, Schwartz and Saffran1983; Macmillan & Creelman, Reference Macmillan and Creelman2005; Snodgrass, Levy-Berger, & Haydon, Reference Snodgrass, Levy-Berger and Haydon1985). An A’ score of .50 was taken to mean a chance performance. Table 4 shows A’ scores for three groups on Days 1 and 8. The analysis of the scores was conducted using a linear mixed-effects model with only the subject as a random effect. The fixed effects were day and group. A fixed effect was considered significant if the absolute value of the t statistic was greater than or equal to 2.0 (Gelman & Hill, Reference Gelman and Hill2007; Linck & Cunnings, Reference Linck and Cunnings2015). The results are summarized in Table 5.
Note: Formula: aprime~Day*Group + (1|Subject). *|t| > 2.0, indicating a significant effect (Gelman & Hill, Reference Gelman and Hill2007).
The main effects of day and group and their interaction were significant. To interpret this interaction, Bonferroni pairwise comparisons were performed. The analysis produced the following results. The incidental group and the explicit group performed significantly better than the control group (p = .028, d = .79, 95% confidence interval; CI [.007, .158], and p = .015, d = .97, 95% CI [.014, .165], respectively) on Day 1. On Day 8 the explicit group performed significantly better than the incidental group (p = .041, d = .75, 95% CI [.002, .157]) and the control group (p = .048, d = .87, 95% CI [.001, .155]). Regarding the effect of time, the performance of only the incidental group significantly declined from Day 1 to Day 8 (p = .016, d = .65, 95% CI [.014, .130]), while the explicit and the control groups did not change their performance significantly after a week interval (p = .974, d = .01 and p = .663, d = .13, respectively).
RT
The analysis was based on correct responses to the testing items only. Participants’ RTs were measured from the end of the sentences until the participant pressed the button. Outlying RTs, at cutoff limits of ±2.5 SD from each participant’s mean, were removed (2.01% of the data were removed). Table 6 presents RTs for three groups on Days 1 and 8. The analysis of RT was conducted using a linear mixed-effects model with crossed random effects for subject and item with day and group as the fixed effects. A fixed effect was considered significant if the t value was above 2.0 (Gelman & Hill, Reference Gelman and Hill2007; Linck & Cunnings, Reference Linck and Cunnings2015). The results were summarized in Table 7. Only the effect of day was found to be significant, whereas the effects of group and the interaction between day and group were not found to be significant.
Note: Formula: RT~Day*Group + (Day|Subject) + (Group|Item). *|t| > 2.0, indicating a significant effect (Gelman & Hill, Reference Gelman and Hill2007).
Retrospective verbal reports
The reports were composed of what participants wrote on the debriefing questionnaires and also their oral interviews in which participants were prompted to describe whatever they noticed during or after the experiments. No one in the control group could verbalize or describe anything related to the English derivational system. Participants made comments such as “The same (or similar) sentences were heard,” “The same words were used in the sentences,” “The same subjects (he, she, doctor, teacher) were repeated,” “The word order was the same,” and “The voices were balanced between male and female.”
In the incidental group, 4 out of the 20 participants reported that they had decided the grammaticality based on the endings of words during the testing phases, although none of them could verbalize any correct rules or appropriate patterns included in the training and testing sentences. This means they might have possessed awareness at the level of noticing, but did not demonstrate awareness at the level of understanding (Schmidt, Reference Schmidt1990, Reference Schmidt and Schmidt1995, Reference Schmidt and Robinson2001).Footnote 4 Their accuracies on the GJT were 56.3%, 56.3%, 62.5%, and 64.6%. They were all better than the average of the control group (51.5%) and the same or above their group mean (56.4%). This indicates that awareness, even at the level of noticing, facilitates learning (Rogers, Reference Rogers2017; Rogers et al., Reference Rogers, Révész and Rebuschat2016; Schmidt, Reference Schmidt1990, Reference Schmidt and Schmidt1995, Reference Schmidt and Robinson2001). These verbal reports should be viewed with caution, however, given the 1-week time difference between the learning phase and the administration of the verbal reports. Other comments included “The same (or similar) sentences were heard,” “The same words are used in the sentences,” “The same subjects and places were used,” “The voices were balanced between male and female,” “I was focusing on familiar words,” “I was focusing on verbs,” “There were one or two unfamiliar words,” and “Place names were often at the ends of the sentences.”
In contrast, most of the participants in the explicit group mentioned something regarding the derivational system or the word endings. Examples included “I paid attention to word endings” and “I thought about word classes of unfamiliar words.” Other comments included “The same (or similar) sentences were heard,” “The word order was the same most of the time,” “The voices were balanced between male and female,” “I was thinking about word classes,” “Familiar words were used,” “Many adjectives were used,” and “I was focusing on word order.”
Generally, participants did not report much in the debriefing session, arguably because they lacked the confidence in judging the sentence grammaticality correctly as evidenced by the very low confidence level in the ratings reported below. Meanwhile, no participant mentioned the similarity of the derivational suffixation between English and Japanese. The majority of the participants’ comments seem to reflect the simple sentence structures containing familiar words used in the experimental materials.
Subjective measures of awareness
Whether the knowledge acquired during the learning phases is implicit or explicit was assessed using two types of subjective measure of awareness: confidence ratings and source attributions. Binominal tests were used to determine whether performance was above chance (Jackson, Reference Jackson2018; Marsden et al., Reference Marsden, Williams and Liu2013).
Confidence ratings
Table 8 reveals that the incidental group performed significantly above chance when they reported to be somewhat confident and to have no confidence only on Day 1. Yet the explicit group performed significantly above chance when they were somewhat or very confident on both Days 1 and 8. A logit mixed-effects model was used to compare the relationship between accuracy and confidence for the incidental group at Day 1 and the explicit group at Days 1 and 8, after confidence ratings were regrouped into two categories: less confidence (1 and 2) and more confidence (3 and 4; Rogers, Reference Rogers2017). Accuracy was specified as a binary outcome with confidence level as a fixed effect. The results show that in all three cases confidence and accuracy was significantly related (p < .001). This negative correlation (a higher accuracy with less confidence) did not satisfy the zero-correlation criterion (Dienes & Scott, Reference Dienes and Scott2005), but may suggest better performance with unconscious knowledge in the opposite way (Rebuschat, Reference Rebuschat2013).
Source attributions
As can be seen from Table 9, learners in the incidental condition performed significantly above chance when they based their decisions on guessing, intuition, and rule knowledge only on Day 1. An above chance performance when choosing “Guess” met the guessing criterion for implicit knowledge, indicating those learners developed some unconscious knowledge. On the other hand, learners in the explicit condition performed significantly above chance when based on memory and rule knowledge on Day 1, indicating their knowledge was largely conscious. However, they performed above chance when basing decisions on intuition, memory, and rule knowledge on Day 8. Their above-chance performance when choosing “Intuition” implies that they developed at least some unconscious knowledge (Dienes & Scott, Reference Dienes and Scott2005; Rebuschat, Reference Rebuschat2013).
Discussion
The present study investigated the effects of incidental and explicit learning conditions of the English derivational system by Japanese learners of English and the nature of the knowledge they acquired immediately after learning as well as a week later. The findings of the present research with regard to the four research questions are discussed in this section.
Research Question 1: To what extent are adult learners able to acquire L2 derivational morphology incidentally through auditory exposure?
The incidental learners exhibited significant learning effects, at least immediately after the brief auditory exposure to the sentences in the learning phase. Because new nonwords were used in the test items, the participants presumably assumed that it was something about the nonword that was crucial to their decision. Therefore, unlike the training task, attention was drawn to the nonwords (possibly the suffixes) in this case, to the extent that they could be segmented without learners’ awareness. In addition, the exposure sentences were composed of familiar words (except for the targeted nonwords) in the basic English word order, as many participants mentioned in their retrospective verbal reports. This may have provided salience to nonwords, including the target structure (Ellis, Reference Ellis2016; Endress, Scholl, & Mehler, Reference Endress, Scholl and Mehler2005; Romberg & Saffran, Reference Romberg and Saffran2013; Schmidt, Reference Schmidt1990). Furthermore, as noted in the Stimuli section, the participants’ first language (Japanese) also has a similar derivational system to English. Implicit learning of a certain system in a L2 is more likely to occur if that system is present in their first language (Leung & Williams, Reference Leung and Williams2014).
With respect to the limited amount of learning demonstrated in the current research (i.e., 56.4% and 56.8% for incidental and explicit learners, respectively, immediately after learning, and 56.4% for explicit learners after a week), the current study does not differ much from similar studies reporting the accuracy of incidental learning. Grey et al. (Reference Grey, Williams and Rebuschat2014) reported the average accuracy of 57.5% for the AJT and 56.3% for the delayed PMT. In Rogers et al. (Reference Rogers, Révész and Rebuschat2016) the average accuracy of the GJT was 55.4%. The similar amount of learning in the present study was notable considering that participants were found to be less confident and had comparatively higher degrees of unconscious knowledge compared to participants of the two previous studies. Thus, language acquisition may arguably take place like this at least in the initial stages of learning: a little above chance performance accumulates over time and thus develops a more fine-tuned learning system with a higher accuracy.
Research Question 2: To what extent is their performance comparable to that of explicit learners?
Considering the immediate measurements of learning, performance of the incidental learners was on par with that of the explicit learners. It was not the case, however, after a 1-week delay where incidental learners declined to a chance-level performance, while explicit learners still remained reliable in that they performed significantly better than the incidental and the control groups. The plausible reason for the difference regarding the delayed effects will be discussed in the response to the fourth research questions below.
One might wonder why the explicit learners were far from perfect (56.8% accuracy). There are three aspects of their learning conditions that seem relevant to this question. First, learners in the explicit condition did not receive any trials to which they had to respond intentionally. Andringa and Curcic (Reference Andringa and Curcic2015), for instance, provided explicitly instructed learners with 56 trials in which they performed PMT. They reached the accuracy of 83%. Explicit learners in the current study just saw the rule explanation on the computer screen and listened to the suffixes and sentences including suffixed nonwords. Second, the number of sentences they heard was small: 36 sentences compared to 96 sentences for the incidental condition despite the same amount of time spent on learning. Third, learning via the aural mode rather than written input could have been cognitively challenging throughout the course of learning for participants (Hama & Leow, Reference Hama and Leow2010; Leow, Reference Leow1995), in particular for Japanese learners of English who are arguably less proficient at understanding spoken English because they are, for the most part, taught English by reading and writing (Watanabe, Reference Watanabe1988). The final possibility appears to be able to account for the generally low performance of all the participants in the present study.
Research Question 3: What is the nature of the acquired knowledge: Implicit or explicit?
Incidental learners were found to develop implicit knowledge given their lack of confidence and their judgment based on guess and intuition. The negative correlation between confidence and accuracy caused by a large number of choices of the “less confidence” category may be indicative of better performance with less confidence (Rebuschat, Reference Rebuschat2013). At the same time, their performance at above chance levels based on rule knowledge suggests they also developed some explicit knowledge, although none of the participants were able to describe the derivational rule in the retrospective verbal reports. Considering that participants in the control group did not possess this kind of explicit knowledge, the explicit knowledge developed by incidental learners was arguably created through incidental exposure in the learning phase (Grey et al., Reference Grey, Williams and Rebuschat2014; Rebuschat et al., Reference Rebuschat, Hamrick, Riestenberg, Sachs and Ziegler2015; Rebuschat & Williams, Reference Rebuschat and Williams2012; Rogers et al., Reference Rogers, Révész and Rebuschat2016). Furthermore, it was found that the nature and durability of the resultant explicit knowledge seems to differ substantially between the two types of learners, given that the explicit knowledge immediately developed by incidental learners diminished afterward, but the knowledge possessed by explicit learners persisted throughout the 1-week experiments. The reason for the learners’ difference in the robustness of explicit knowledge is not clear, but it is worth noting that only the explicit learners seemed to demonstrate the consolidation effects or to engage in conscious hypothesis testing, as will be mentioned later in the discussion of Research Question 4.
Explicit learners were found to possess conscious knowledge both in the immediate and the delayed tests. Interestingly enough, explicit learners also developed unconscious knowledge 1 week later evidenced by their reliance on intuition (DeKeyser, Reference DeKeyser, VanPatten and Williams2015; Ellis, Reference Ellis2005). Hamrick and Rebuschat (Reference Hamrick, Rebuschat, Rebuschat and Williams2012) showed, too, that learners in an intentional learning condition developed some unconscious knowledge even immediately after exposure, evidenced by their above chance performance based on intuition.
It should be noted here that L2 learners’ intuition is different in nature from native speakers’ intuition, which develops to the level of automaticity after many years of exposure. A limited amount of exposure does not enable L2 learners to develop the intuition that native speakers possess (Leow & Hama, Reference Leow and Hama2013; Rogers, Reference Rogers2017). In light of this, Rogers (Reference Rogers2017) interpreted intuition as indicative of low levels of awareness (i.e., explicit knowledge). However, these measures were developed in the context of artificial grammar learning where there is also low levels of exposure. Therefore, in the present results, based on Dienes and Scott (Reference Dienes and Scott2005) and Rebuschat (Reference Rebuschat2013), the intuition attribution was considered to belong to unconscious knowledge.
The two types of learners were found to not be significantly different in the speed with which they accessed the knowledge they had acquired. The results of RT measures only showed that both types of learners responded faster a week later than the first day. As there was not a significant effect of the group or its interaction with time, it is not easy to explain why no significant differences in RT was detected between incidental and explicit conditions. Studies have suggested that incidental learners can be quick to access their unconscious knowledge when they possess implicit knowledge or can be slow if they attempt to find rules, whereas explicit learners may be quick to rely on their conscious knowledge explicitly explained beforehand, even though they do not possess implicit knowledge deployed to make rapid access possible (Robinson, Reference Robinson1997; Suzuki & DeKeyser, Reference Suzuki and DeKeyser2017). In the present study, given that only the explicit learners showed learning effects a week later, their judgment (which was found to be above chance) based on intuition, in addition to memory and rule knowledge on Day 8, might have contributed to their faster access to their knowledge.Footnote 5 Decisions based on intuition reflecting implicit knowledge are likely to enable rapid access to unconscious knowledge (Suzuki & DeKeyser, Reference Suzuki and DeKeyser2017).
Another question is concerned with the relationship between the grammatical cues and suffixes the incidental group learned immediately after exposure. It seems that participants in the incidental group learned nonadjacent dependencies between the grammatical cues and suffixes over an intervening nonword (Gomez, Reference Gomez2002; Mintz, Reference Mintz2002, Reference Mintz2003; Romberg & Saffran, Reference Romberg and Saffran2013; St. Clair, Monaghan, & Christiansen, Reference St. Clair, Monaghan and Christiansen2010). For example, in the learning phase, they may have simply been learning the invariant relationships between “his” and “nəs,” “very” and “fəl,” and “to” and “aɪz.” In the testing phase, they judged the sentence including the association, which broke this invariance such as the one between “his” and “aɪz” as ungrammatical above chance. Whether the participants in the present study did or did not learn the semantic functions of the suffixes cannot be known. Marsden et al. (Reference Marsden, Williams and Liu2013) demonstrated, however, that after brief exposure to suffixed nonwords (e.g., sifedec presented orally and visually), surface forms of suffixes were learned without adding meanings to stems or suffixes. Furthermore, their subsequent experiments showed that when the meaning was given to either the stem or the suffix, the meaning of the other could be learned partly incidentally, which may indicate the learning of form–meaning connections without attention to target structures. Whether the nonadjacent dependency learning seen in the current study leads to the learning of grammatical categories is an interesting but difficult question to tackle (Mintz, Reference Mintz2002, Reference Mintz2003; St. Clair et al., Reference St. Clair, Monaghan and Christiansen2010). Given that the participants in the present experiment were intermediate-level learners of English (as detailed in the Method section), it seems highly likely that they already possessed a sense of grammatical categories in English. They might have known that words with different natures (i.e., something to distinguish between nouns, adjectives, and verbs) follow the different grammatical units (“his” and “some” vs. “very” vs. “wants to” and “tries to”).
Research Question 4: Are there any delayed effects of learning demonstrated by the different learning groups?
To reiterate some points mentioned above, incidental learners were found to not retain their learning for a week. This indicates that they may need additional exposure, for example, on the day after the immediate learning or at least another time within the week. Nevertheless, the learning gained by explicit learners was found to endure, and part of the resulting knowledge became unconscious later on. These results corroborate previous observations that explicit types of instruction are more efficient and the effects are more durable than implicit types (Andringa & Curcic, Reference Andringa and Curcic2015; Ellis, Reference Ellis and Rebuschat2015; Norris & Ortega, Reference Norris and Ortega2000; Spada & Tomita, Reference Spada and Tomita2010). Although these previous studies were not concerned with whether implicitly instructed participants were really unaware of what they had learned, the current research provides further evidence that explicit learning helps to develop more reliable knowledge.
The finding that the explicit knowledge gained by explicit learners became partly unconscious at the time of delayed testing is in line with the interface position, which postulates that explicit knowledge, with a certain degree of proceduralization and automatization, impacts and facilitates implicit knowledge (DeKeyser Reference DeKeyser, VanPatten and Williams2015; Suzuki & DeKeyser, Reference Suzuki and DeKeyser2017). This finding may also be accounted for by consolidation effects (Grey et al., Reference Grey, Williams and Rebuschat2014, Tamminen et al., Reference Tamminen, Davis, Merkx and Rastle2012, Reference Tamminen, Davis and Rastle2015). The question arises as to whether consolidation took place in the testing phase for learners in explicit learning conditions. In order to explore this issue, the test phase (48 sentences presented) on Day 1 was divided into two sessions: the first half of the test (24 sentences) and the second half of the test (24 sentences).Footnote 6 The accuracy of the incidental group was 56.5% for the first session and 56.3% for the second session, with no significant difference between the sessions, t (19) = 0.068, p = .946, d = .02, 95% CI [6.20%, 6.61%]. The accuracy of the explicit group was 53.5% for the first session and 60.0% for the second session, with no significant difference between the sessions, either, t (19) = 1.656, p = .114, d = .65, 95% CI [1.70%, 14.6%]. However, participants in the explicit condition revealed a medium effect size (d = .65; Plonsky & Oswald, Reference Plonsky and Oswald2014), in contrast with the very small effect size (d = .02) shown by participants in the incidental condition. There is some indication that participants in the explicit condition were improving their performance during the test phase; here they might have been engaging in conscious hypothesis testing in the testing phase (Grey et al., Reference Grey, Williams and Rebuschat2014; Tamminen et al., Reference Tamminen, Davis, Merkx and Rastle2012, Reference Tamminen, Davis and Rastle2015). This might partially explain the reason for the explicit learners’ learning durability. The medium size effect for the explicit group diminished on Day 8, t (19) = 0.066, p = .948, d = .02, 95% CI [6.35%, 6.77%], with their accuracy between the first half (56.5%) and the second half (56.3%). It seems that after the consolidation of their explicit knowledge on Day 1, learners in the explicit condition developed the stable knowledge of the derivational rule that was employed throughout the course of the test. Given the explicit learners’ potential awareness raising during the test phase, one would expect the reliance on explicit knowledge to increase from Day 1 to Day 8.Footnote 7 To examine this issue, proportions of the correct responses for explicit learners across the four source categories (guess, intuition, memory, and rule) were compared between Day 1 and Day 8. None of the four categories revealed a significant difference, although the responses for the guess category approached that level (28.1% on Day 1 and 33.9% on Day 8); t (19) = 1.95, p = .066, d = .24, 95% CI [0.43%, 12.0%]. This indicates that participants in the explicit learning conditions did not change the extent of the reliance on explicit knowledge, even though they might have improved their performance in the testing phase on Day 1.
Morgan-short et al. (Reference Morgan-Short, Steinhauer, Sanz and Ullman2012) demonstrated that in terms of neural measures, implicit training, as opposed to explicit training, leads to brain processing typical of native speakers, both on the first day and 1 to 5 days after instruction. Morgan-short et al. used almost the same amount of time in implicit and explicit training in the initial instruction of their experiment as the present research (13.5 min), and supplemented it by a considerable amount of practice. Performance of learners of both implicit training and explicit training were found to not differ in terms of behavioral measures at both times. This took place presumably because the participants in both groups in their study received extensive training over 1 to 5 days, and learners in the implicit condition were most likely aware of the grammar rules they were being trained on.
Grey et al. (Reference Grey, Williams and Rebuschat2014) found the maintenance of learning effects of word order 2 weeks later, even without additional exposure to the stimulus sentences. The durability of the learning shown in their study might be associated with the observation that 68% of their learners were able to state the word order rule for simple sentences on the debriefing questionnaires, and that only the learners who reported to be very confident in the confidence ratings performed the AJT significantly above chance immediately after exposure. Regarding case marking, 38% of Grey et al.’s participants correctly stated case-marking rule for –ga and –0, and 35% for –ni. The significant difference of case-marking learning between those conscious learners and those who did not report case-marking rules was not found with immediate testing, but did emerge at delayed testing. In contrast, no one in the incidental group in the current study stated the correct rule of the derivational suffixes in the debriefing session and the above chance level performance was limited to the learners who reported having no confidence or being somewhat confident. This comparison suggests that reportable knowledge and confidence may be key factors in maintaining or improving learning effects.
The finding in the current research that incidental learning resulted in similar learning effects to explicit learning, at least immediately after learning, may suggest the potential of implicit types of learning in acquiring L2 derivational systems. Because the test items were not included in the learning items, this made it possible to test learners’ abilities to generalize the acquired knowledge to new items they encountered. Whereas explicit learners generalize newly learned rules persistently, presumably through a consolidation process, even incidental learners can develop their ability to generalize their acquired knowledge immediately after exposure to input. How to make it durable is a crucial issue to be explored in future work. From a totally different perspective, the present results may lead to another interpretation. Neither of the types of learning is particularly effective (no more than 60% of average accuracy) in the absence of meaningful input and/or practice opportunities. A long period of training with the large amount of practice is needed to make skills develop (DeKeyser, Reference DeKeyser1997, Reference DeKeyser, VanPatten and Williams2015).
There are several other issues that should be addressed in future research. First, the GJT used in the study did not tap into the production process of language. Although the implicit form of knowledge seems difficult to exploit in generation tasks (e.g., Jimenez, Mendez, & Cleeremans, Reference Jimenez, Mendez and Cleeremans1996), it would be interesting to investigate whether the knowledge acquired under incidental or explicit conditions can transfer to production ability. Second, the present study did not consider individual differences, such as working memory, procedural memory, and personality (e.g., Denhovska & Serratrice, Reference Denhovska and Serratrice2017; Denhovska, Serratrice, & Payne, Reference Denhovska, Serratrice and Payne2016; Jackson, Reference Jackson2018; Tagarelli, Ruiz, Vega, & Rebuschat, Reference Tagarelli, Ruiz, Vega and Rebuschat2016), which may influence different aspects of incidental and explicit learning. Finally, the current experiment did not examine the impact of prior knowledge about the target rule participants may have possessed. Instead, it compared the results of learning groups with those of the control group who had no learning phase. It is plausible to see the exposure materials based largely on a real L2 that participants are learning as a pedagogical advantage that secures ecologically valid forms of learning. An interesting avenue for future work would be to investigate how the derivational system of a completely new language can be learned under an incidental or explicit learning condition.Footnote 8
Conclusion
The current study has demonstrated that L2 derivational morphology can be auditorily learned, either incidentally or explicitly, immediately after exposure, and that learning can be maintained for a longer time only when learned explicitly. Incidental learners were found to develop primarily unconscious knowledge, whereas explicit learners were found to rely principally on conscious knowledge, part of which becomes unconscious later on.
New evidence presented in this study for incidental learning of derivational morphology and the consolidation effects gained through explicit knowledge adds a theoretical contribution to the growing literature on incidental and explicit learning of L2 grammar. The findings of this study utilizing a real L2 system as experimental stimuli may have direct pedagogical implications for L2 learning. Explicit instruction of grammatical rules may be more effective and durable than incidental learning, at least only with a one-time exercise. Incidental learning, albeit with immediate effects, seems to require extensive exposure over time to be effective.
Acknowledgments
I thank Associate Editor Dr. Annie Tremblay and three anonymous reviewers for their helpful comments on earlier versions of the manuscript. I am grateful to Dr. John Williams for his advice on the experimental procedure. This research was supported by a grant from the Japan Society for the Promotion of Science (No. 15K02768).
Supplementary Material
To view the supplementary material for this article, please visit https://doi.org/10.1017/S0142716419000304