Hostname: page-component-745bb68f8f-hvd4g Total loading time: 0 Render date: 2025-02-07T05:58:15.184Z Has data issue: false hasContentIssue false

LEARNING CONDITION, LINGUISTIC COMPLEXITY, AND FIRST LANGUAGE TRANSFER IN SEMIARTIFICIAL LANGUAGE LEARNING

A CONCEPTUAL REPLICATION AND EXTENSION OF TAGARELLI ET AL. (2016)

Published online by Cambridge University Press:  22 March 2021

Jianwu Gao
Affiliation:
Capital Normal University
Shuang Ma*
Affiliation:
Capital Normal University
*
*Correspondence concerning this article should be addressed to Shuang Ma, Department of English Language and Literature, Capital Normal University, Haidian District, Beijing100089, China. E-mail: s.ma5384@cnu.edu.cn
Rights & Permissions [Opens in a new window]

Abstract

This study explored the interaction between learning conditions, linguistic complexity, and first language (L1) syntactic transfer in semiartificial grammar learning by conceptually replicating and extending Tagarelli et al. (2016). We changed the L1 background, elicited production data during debriefing, and added a binary mixed-effects logistic regression analysis to compare variability at learner and item levels with group-level variation on exposure condition, linguistic complexity, and their interaction. Our results replicated those of the original study regarding the comparative efficacy of explicit instruction; however, we also found a condition × complexity interaction absent in the original study. Debriefing sentence-production data suggest that the changed L1-L2 typological distance may have leveled off the advantage of explicit instruction in the learning of the complex V2-VF structure. Finally, our mixed-effects modeling analysis revealed that variability at learner and item levels accounted for a larger proportion of the variance of the outcomes than all the predictors combined.

Type
Research Article
Open Practices
Open materials
Copyright
© The Author(s), 2021. Published by Cambridge University Press

INTRODUCTION

The debate over the effect of incidental and explicit learning conditions has been one of the central subjects in the field of Second Language Acquisition (SLA) (Ishikawa, Reference Ishikawa2019; Leow, Reference Leow2019; Rebuschat, Reference Rebuschat2015). Studies have generally showed an advantage for explicit over implicit learning conditions (for meta-analyses see, e.g., Goo et al., Reference Goo, Granena, Yilmaz, Novella and Rebuschat2015; Norris & Ortega, Reference Norris and Ortega2000; Spada & Tomita, Reference Spada and Tomita2010; for a different view see Ellis et al., Reference Ellis, Loewen, Elder, Erlam, Philp and Reinders2009); however, recent meta-analytical research (e.g., Kang et al., Reference Kang, Sok and Han2018) and psycholinguistic research (e.g., Morgan-Short et al., Reference Morgan-Short, Steinhauer, Sanz and Ullman2012) have enriched, challenged, or even reversed the conclusions of previous studies, thereby keeping the debate alive.

One of the key issues in the debate is whether linguistic complexity interacts with learning conditions (e.g., Andringa et al., Reference Andringa, de Glopper and Hacquebord2011; DeKeyser, Reference DeKeyser1995, Reference DeKeyser, Doughty and Williams1998, Reference DeKeyser2016; Housen et al., Reference Housen, Pierrard, Van Daele, Housen and Pierrard2005; Reber et al., Reference Reber, Kassin, Lewis and Cantor1980; Tagarelli et al., Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016; see Spada & Tomita, Reference Spada and Tomita2010 for a meta-analysis). So far, a few studies have incorporated both explicit and implicit conditions and targeted both simple and complex structures, and their findings are inconclusive. Some report that the two conditions are equally effective regardless of linguistic complexity (e.g., Housen et al., Reference Housen, Pierrard, Van Daele, Housen and Pierrard2005); others find that while explicit instruction leads to more gains than implicit/incidental exposure in the learning of simple rules, the two conditions do not produce statistically different results when the learning target is complex (e.g., DeKeyser, Reference DeKeyser1995, Reference DeKeyser, Doughty and Williams1998; Williams & Evans, Reference Williams, Evans, Doughty and Williams1998). Still others point at an advantage of explicit instruction over implicit/incidental exposure regardless of complexity (e.g., Robinson, Reference Robinson1996; Tagarelli et al., Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016; see Spada & Tomita, Reference Spada and Tomita2010 for a meta-analysis). Finally, very few studies revealed that other factors may also come into play, such as the type of the target structures (de Graaff, Reference de Graaff1997), the salience of the target structures (Tagarelli et al., Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016), and learners’ first language (L1) background (e.g., Andringa et al., Reference Andringa, de Glopper and Hacquebord2011).

The inconclusiveness of the findings is partly attributable to the complex nature of the definition of linguistic complexity (Bulté & Housen, Reference Bulté, Housen, Housen, Kuiken and Vedder2012; Housen et al., Reference Housen, Kuiken, Vedder, Housen, Kuiken and Vedder2012; Housen & Simoens, Reference Housen and Simoens2016; Hulstijn & de Graaff, Reference Hulstijn and de Graaff1994). Hulstijn and de Graaff (Reference Hulstijn and de Graaff1994), arguably the first systematic investigation into this subject, proposed that linguistic complexity should be primarily determined by functional complexity, or the “number (and/or the type) of criteria to be applied to arrive at the correct form” (p. 103). Housen et al. (Reference Housen, Kuiken, Vedder, Housen, Kuiken and Vedder2012, p. 4) provided a more encompassing definition that includes the intrinsic “formal” or “semantic-functional” properties of second language (L2) elements and the properties of the “(sub-)systems” of the L2 elements. Importantly, there is a gap between the objective, feature-related complexity of the structures in the contexts of explicit instruction and the structures perceived and abstracted by learners in the incidental condition. DeKeyser (Reference DeKeyser, Doughty and Williams1998, p. 44), for example, pointed out that average learners are unlikely to induce implicitly any but the most semantically transparent and formally simple rules. Rebuschat (Reference Rebuschat2008, experiment 3) also reported that under the incidental condition, a complex rule may be partially and inaccurately perceived as a simple rule in learners’ interlanguage system, resulting in (apparent) learning gains. Some researchers have hence called for further studies on the exact nature and the relative weight of the various factors that contribute to objective, feature-related complexity and subjective, learner-related difficultyFootnote 1 (e.g., Housen & Simoens, Reference Housen and Simoens2016).

Among the few experimental studies of the relation between linguistic complexity and learning condition, Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) is the first to offer initial evidence that cognitive salience impacted the learning of one complex syntactic rule but not that of the other. The researchers used a semiartificial language composed of German syntax and English lexicon, a system developed by Rebuschat (Reference Rebuschat2008) and subsequently used in multiple recent studies (e.g., Kim & Godfroid, Reference Kim and Godfroid2019; Miller & Godfroid, Reference Miller and Godfroid2020; Rebuschat & Williams, Reference Rebuschat and Williams2012). Fifty-one native speakers of English with no background in German were randomly assigned to the incidental group or the instructed group. During the training phase, the participants in both groups were exposed to 120 sentences containing three patterns of varying linguistic complexity and were asked to judge the plausibility of the sentences and also orally repeat them. Before the plausibility and sentence repetition tasks, the participants in the instructed group were also presented with the verb-placement rules and were asked to write down two sentences that followed each rule. During the testing phase, both groups completed an untimed grammaticality judgment task (GJT) of 60 new sentences. For each sentence, participants were asked to judge the sentence’s grammaticality, indicate how confident they were in the accuracy of their judgment, and report the source of their judgment. The results showed that exposure condition was a significant predictor of two syntactic patterns (V2 and V2-VF), but not of the most complex structure (VF-V1). The debriefing questionnaire data showed that the VF-V1 rule was highly salient to many participants, which according to the researchers might have compensated for its complexity and led to better learning results than expected. The incidental group’s d’ scores for this structure approached significance (d’ = 0.28, p = .07), which contrasted with the same group’s performance for the V2-VF structure (d’ = 0.04, p > .05). Meanwhile, the VF-V1 rule was the only structure for which the learning outcomes were not predicted by exposure condition.

The interesting results reported by Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) suggested that in addition to linguistic complexity and learning condition, learner-related difficulty may also affect learning outcomes in more complicated ways than expected. Specifically, certain target structures (e.g., VF-V1 with its contiguous verb placement) may give rise to greater cognitive salience and less perceived difficulty, thus leveling out the advantage of explicit instruction.

Tagarelli et al.’s (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) interesting results promoted us to explore another shared learner-related feature apart from cognitive salience, namely, learners’ L1 experience. Learners may be affected by their L1 syntax and perceive the difficulty of different L2 structures differently, which may moderateFootnote 2 the effect of linguistic complexity and exposure condition in semiartificial learning. As the following review will reveal, the use of lexicon from learners’ L1 in semiartificial grammar learning is a double-edged sword: on the one hand, it frees learners from the burden of learning new words while enabling researchers to control for exposure and simulate natural language processing, thereby giving studies in this paradigm an advantage over those in natural language instruction and artificial grammar learning; on the other hand, however, it could give rise to learners’ reliance on L1 syntactic knowledge, thus increasing the variability of results. It is therefore necessary to examine the generalizability of relevant findings in the semiartificial grammar learning paradigm by conducting partial/conceptual replications in diverse L1 contexts. In the context of this study, it is interesting to investigate whether Tagarelli and colleagues’ findings about exposure condition and linguistic complexity would be replicated in Chinese with a typologically different syntax. The present study contributes to this gap by using the same procedure as that of Tagarelli and colleagues but adopting the Chinese translations of the training and test stimuli, with a population of native speakers of Chinese.Footnote 3

In addition to recruiting L1-Chinese participants and translating the stimuli into Chinese, we also introduced another important change to the design of the initial study, namely, eliciting production data during the debriefing phase. As one anonymous reviewer pointed out, language production is barely investigated in L2 research that employs (semi)artificial languages. To our knowledge, Ruiz et al. (Reference Ruiz, Tagarelli and Rebuschat2018) is the only study so far that elicited sentence production data alongside grammaticality judgment data. In that study, the participants in both the incidental and intentional conditions predominantly produced simple sentences on the production task (incidental: 86%; intentional: 92%), and the two groups did not perform differently accuracy-wise in sentence production or grammaticality judgment. Importantly, the explicit condition that Ruiz and colleagues investigated was intentional rule-search while ours was instruction. By eliciting production data during the debriefing phase, we may compare our results with Ruiz and colleagues’, thereby offering new insights into the relation between language production and exposure condition, a rarely investigated area in previous (semi)artificial grammar learning research.

LITERATURE REVIEW

L1 TRANSFER IN SEMIARTIFICIAL GRAMMAR LEARNING

Abundant psycholinguistic and SLA research has investigated the crucial role of prior L1 knowledge in adult L2 learners’ perception of natural language instruction (e.g., Andringa et al., Reference Andringa, de Glopper and Hacquebord2011; Doughty & Williams, Reference Doughty, Williams, Doughty and Williams1998; Ellis, Reference Ellis2006; Kellerman, Reference Kellerman1978; VanPatten, Reference Van Patten and VanPatten2003). In the acquisition of L2 syntax, the whole of the L1 grammar constitutes the L2 initial state (Hartsuiker & Bernolet, Reference Hartsuiker and Bernolet2017; Neeleman & Weerman, Reference Neeleman and Weerman1997; Schwartz & Sprouse, Reference Schwartz, Sprouse, Hoekstra and Schwartz1994); after learner’s exposure to L2 input, L1 background affects the perceived difficulty of target structures (e.g., Andringa et al., Reference Andringa, de Glopper and Hacquebord2011; MacWhinney et al., Reference MacWhinney, Bates and Kliegl1984) and influences the sequence of L2 acquisition order (e.g., Della Putta, Reference Della Putta2016; Ellis, Reference Ellis2006). Studies also show that L1 syntactic knowledge continues to be activated in highly proficient late bilinguals’ processing of L2 constructions (Hwang et al., Reference Hwang, Shin and Hartsuiker2018; Luke et al., Reference Luke, Liu, Wai, Wan and Tan2002).

Despite the attention L1 transfer has received in the studies of natural language instruction, it remains understudied in semiartificial language learning (Hamrick & Sachs, Reference Hamrick and Sachs2018). As an alternative paradigm to artificial grammar learning, semiartificial language learning combines novel structures with the lexical information from the participants’ L1 (Williams, Reference Williams and Hinkel2005). It is argued that this paradigm can both control for exposure and engage participants in meaningful tasks, thereby preserving the strengths of artificial grammar learning while better simulating the cognitive processes in learning novel structures in natural languages (Tagarelli et al., Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016; for an example of early artificial grammar learning research see Reber et al., Reference Reber, Kassin, Lewis and Cantor1980). However, as researchers pointed out, semiartificial language systems draw on multiple phonological/semantic cues from the participants’ L1, which may create challenges that have not been sufficiently addressed (Hamrick & Sachs, Reference Hamrick and Sachs2018) or lead to increased saliency that could influence and potentially alter the cognitive processes underlying natural language learning (e.g., Godfroid, Reference Godfroid2016; Leow, Reference Leow2019; Maie & DeKeyser, Reference Maie and DeKeyser2020).

So far two studies have addressed the possible impact of L1 background on the learning of semiartificial structures with different degrees of complexity. Using a semiartificial language (“Japlish”) that combines Japanese word order with English lexicon, Williams and Kuribara (Reference Williams and Kuribara2008) investigated how incidental exposure affects English native speakers’ learning of the SOV word order (the “canonical” order in Japlish) and of the “scrambled” sentence patterns (the “marked” order in Japlish). They found that (a) the no-exposure group showed an above-chance acceptance of the canonical SOV word order after very limited exposure; (b) the exposure group showed a statistical advantage over the no-exposure control on two trained sentence patterns (one canonical and the other scrambled) but did not show any advantage on other sentence patterns, be they trained or new, canonical, or scrambled (the exposure group even showed significantly lower acceptance rates on trained short scrambling), and (c) only a subset of the incidental exposure group showed signs of acquiring scrambling, but these participants did not perform above chance in rejecting structures that are grammatical in English syntax but illegitimate in Japlish. As a follow-up of Williams and Kuribara (Reference Williams and Kuribara2008), Williams (Reference Williams2010, experiment 1) repeated the procedures in the original study but doubled the amount of exposure. The results were largely consistent with those of Williams and Kuribara (Reference Williams and Kuribara2008), except that the learning gains for the trained scrambles were shown in the increased-exposure group only.

The results of the two studies show that L1 syntactic knowledge may play a vital role in the learning of semiartificial language structures. It should be noted, nevertheless, that neither Williams and Kuribara (Reference Williams and Kuribara2008) nor Williams (Reference Williams2010) incorporated explicit instruction; it is not clear whether learners’ L1 experience would moderate the intake of explicit instruction in the same way as it does that of incidental exposure. However, studies such as Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) that did compare explicit instruction with incidental exposure only included learners’ L1 experience as background information without addressing the possible role of L1 transfer in semiartificial grammar learning. By systematically changing the L1 background from English to a typologically different language—in our context, Chinese—we may investigate the generalizability of Tagarelli and colleagues’ findings about the relation between linguistic complexity, exposure condition, and learning outcomes, thereby shedding new lights on the neglected area of L1 transfer in semiartificial grammar learning.

In the following section, we first reanalyze the rules of the semiartificial language system and the stimuli used by Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) by comparing German, Chinese, and English word order. On the basis of the initial analysis, we predict the outcomes of this conceptual replication.

RULES AND STIMULI IN TAGARELLI et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) REVISITED: THE WORD ORDER ISSUE

The following are the three target rules and the sample sentences included in Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016):

  1. 1. Simple: The finite verb is placed in the second phrasal position of main clauses that are not preceded by a subordinate clause (V2).

    e.g., Yesterday scribbled David a long letter to his family.

    Adjunct–Verb–Subject–Object–Adjunct

  2. 2. Complex 1: The finite verb is placed in the final position of all subordinate clauses (V2-VF).

    e.g., In the afternoon acknowledged David that her children to England moved .

    Adjunct–Verb 1–Subject 1–Subject 2–Adjunct–Verb2

  3. 3. Complex 2: When a subordinate clause precedes a main clause, the finite verb is placed in the first position of the main clause and the final position of the subordinate clause (VF-V1).

    e.g., When his wife in the afternoon the office left , prepared Jim dinner for the entire family.

    Subject 1–Adjunct–Object 1–Verb 1–Verb 2–Subject 2–Object 2–Adjunct

The German word order as exemplified in these example sentences departs from the typical Chinese and English word order in a number of ways. Drawing from the relevant findings of typological, corpus, and syntactic studies, we summarized the relevant features of German, English, and Chinese word order in S1 in the Supplementary Materials. On the basis of the literature-based analysis, we differentiated the grammaticality of major clause types of the training and test stimuli in German, Chinese and English as follows (Table 1):

TABLE 1. Grammaticality of sentence patterns in training and test stimuli in German/English/Chinese

Note: S = Subject; V = Verb; O = Object; A = Adjunct.

a The word order of the main clause (VA) is ungrammatical in Chinese, but the lack of syntactic markers in Chinese and the aural mode make it possible for participants to misparse the adjunct as part of the subordinate clause, thereby making the stimuli sound grammatical in Chinese. From the initial analysis of the stimuli we may draw two observations. First, Chinese is the only language in the world that places the adjunct before the verb (Dryer & Gensler, Reference Dryer, Gensler, Dryer and Haspelmath2005), which results in its proximity to the VF part of the complex, less salient V2-VF rule. At the same time, as Chinese is also an SV language, many Chinese sentences have their verbs placed in the third position or even later (e.g., SAV; SAOV), which may make it more difficult for L1-Chinese learners to acquire the simple V2 rule. Second, the sentences grouped under the same rule (e.g., VF-V1, *VF-V2, and *V1) or even those grouped into the same sentence pattern (AVSOA and SVAAA under V2) may differ from each other in terms of their closeness to Chinese/English syntax, thus possibly leading to different perceived difficultyFootnote 4 for Chinese and English native speakers. These observations suggest that there would possibly be increased variability at the learner and item levels in the changed L1 background, which could be concealed by an analysis on the aggregated data (e.g., the mixed-effects repeated measures ANOVA that the original study adopted). It is therefore both important and necessary to contextualize the findings about group-level variation (i.e., the fixed effects) against random variability (i.e., the random effects) to gain a better understanding of the generalizability of such findings in the changed L1 background, specifically, L1-Chinese.

Based on this initial analysis, we put forward the following research questions:

RQ1: To what extent can Tagarelli et al.’s (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) findings about the relation between exposure condition, linguistic complexity, and grammar learning be replicated and generalized with a different L1 background, specifically, L1-Chinese?

RQ2: How does the group-level variation based on exposure condition and linguistic complexity (the fixed effects) compare with the variability at individual learner and item levels (the random effects)?

Our predictions of the possible outcomes are as follows:

  1. 1a. Based on the findings of previous experimental and meta-analytical research (e.g., Andringa et al., Reference Andringa, de Glopper and Hacquebord2011; Goo et al., Reference Goo, Granena, Yilmaz, Novella and Rebuschat2015), we predict a main effect for exposure condition, with learners receiving explicit instruction outperforming those receiving incidental exposure;

  2. 1b. Based on the findings of previous typological, syntactic, psycholinguistic and corpus studies (e.g., Dryer & Gensler, Reference Dryer, Gensler, Dryer and Haspelmath2005; Rankin, Reference Rankin2014; Williams, Reference Williams2010), we predict a condition × complexity interaction with the L1-Chinese background, with L1-Chinese learners perceiving the simple V2 rule as more difficult and the complex V2-VF rule as less difficult; and

  3. 2. On the same basis, we predict a considerable variability at individual learner and item levels compared with the group-level variation based on exposure condition, linguistic complexity, and their interaction (if any).

METHOD

PARTICIPANTS

The data were collected at a public university in China. Participants were 84 Chinese native speakers with no background in German or any other V2 languages. The participants were randomly assigned to an incidental exposure condition (n = 41, 35 female, Mage  = 19.10) or an instructed condition (n = 43, 41 female, Mage  = 19.23). An a priori power analysis using G*Power 3.1 (Faul et al., Reference Faul, Erdfelder, Buchner and Lang2009) showed that the sample size is well-powered (for the power analysis see S2-1).

The mean age and gender distribution of our participants are similar to those of the participants in the original study (for further participant information see S2-2). Consistent with the original study, the participants in our study received course credit in return.

DESIGN

The study design, procedure, and instruments were exactly as reported by Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016), except that (a) the training and test stimuli were translated into Chinese, and (b) we incorporated an additional question in the debriefing questionnaire to elicit the participants’ production data (see S4). As with the original study, the design included a session of artificial language training and testing and a session of cognitive tasks (the operationalization and the results of the cognitive tasks will be reported elsewhere). In the following, we will briefly describe the design, with special emphasis on the two aspects of the design in which our study differed from the original study.

Pretraining Instruction, Training, and Testing

In the three sessions of pretraining instruction, training, and testing, we adopted the same procedures as in the original study.

The pretraining instruction was administered to the instructed group only. During this session, the participants were told that the scrambled sentences they would hear followed three rules of word order. They were then presented visually with the three rules illustrated with two sentence examples, respectively. After each rule was presented, the participants were required to produce two sentences according to the rule, but they received no feedback. All the participants in the instructed group finished the pretraining session within 15 minutes.

During the training phase, both the incidental and the instructed groups were auditorily presented with the Chinese translations of the 120 stimuli used in Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) in random order. Neither group was told that there would be a testing phase. The participants in the incidental group were told that they were tested on their ability to judge the plausibility of sentences with scrambled word order to keep their attention away from the linguistic targets. In contrast, the participants in the instructed group were told to both judge the plausibility of the scrambled sentences and reflect over the rules they had learned.

The training stimuli were evenly distributed in plausibility and across the three rules, with 20 plausible and 20 implausible sentences for each syntactic pattern (V2; V2-VF; VF-V1). After hearing each sentence, participants were asked to indicate its plausibility by clicking on a “Plausible” or “Implausible” button on the screen and orally repeat the sentence. Feedback was immediately provided on each of the plausibility judgments. As with the original study, participants would have to process the entire auditory string before judging its plausibility. See the following stimuli for example:

Plausible:

小乔 重申了 在今天搬运工 他的家具 划坏了

George repeated today that the movers his furniture scratched.

Implausible:

小乔 切开了 在几个月前 菜谱 用桃子

George laminated a few months ago the recipe cards with peaches.

During the GJT phase, the participants were either told (incidental group) or reminded (instructed group) that the “scrambled” word order of the sentences in the training phase followed a certain rule system, and that they should judge the grammaticality of the new sentences based on the ones they had heard in the training. Identical to the original study, participants first completed four practice trials (wherein no feedback was provided) and were then auditorily presented with 60 new sentences (30 grammatical and 30 ungrammatical) in random order. The testing set included 10 grammatical sentences for each rule (V2, V2-VF, and VF-V1) and 5 sentences for each ungrammatical pattern (*V3, *V4, *V1-VF, *VF, *VF-V2, and *V1). For each sentence, participants were asked to click on the appropriate buttons on the screen to judge the sentence’s grammaticality (grammatical/ungrammatical), indicate how confident they were in the accuracy of their judgment (not at all confident, somewhat confident, quite confident, and extremely confident), and report the source of their judgment (guess, intuition, memory, and rule).Footnote 5

Translation of the Training and Test Stimuli

We translated all the training and test stimuli into Chinese (see S9). Care was taken so that the translated stimuli follow the exact word order of the original stimuli. To prevent barriers to comprehension, all the common English names were substituted with common Chinese names and the lexical items that require certain culture-specific knowledge were replaced with general ones.Footnote 6 See the following list of examples:

V2: Chris entertained today his colleagues with an interesting performance. (SVAOA)

小李 招待了 在今天 他同事 一场演出。(SVAOA)

V2-VF: Paul speculated yesterday that the suspect from prison escaped . (SVA-SAV)

小罗 怀疑 在昨天嫌疑犯 从监狱 逃脱了 。(SVA-SAV)

VF-V1: When his wife in the afternoon the office building left , prepared Jim dinner for the entire family. (SAOV-VSOA)

小金的妻子 在下午 办公楼 离开了 准备了 小金 晚餐 为全家人。(SAOV-VSOA)

The translations were proofread by two professional Chinese-L1 translators and one English-L1, Chinese-L2 bilingual, who agreed that every translated stimulus was accurate in terms of both lexical information and word order.

Debriefing

After all the participants completed the GJT, they were given a debriefing questionnaire in paper form. All participants completed the debriefing questionnaire within 20 minutes. In the questionnaire, we asked the participants to describe the rules they had inferred from the practice as Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) did; additionally, we also listed 10 expressions (see S4) and invited the participants to create as many grammatical sentences as they could with any number of these expressions, according to the rules they inferred from the training. The list of expressions included four NPs (Xiaoliang, Xiaoliang’s aunt, the oven, and bread), three verbs (fixed, bought, and said), two adjuncts (in a hurry and at eight last night), and one adverbial subordinator (since). The production data enabled us to triangulate the findings about the GJT data without demanding a high level of metalinguistic awareness on the part of the participants.

Cognitive Tasks

Following the original study, we arranged a separate session to test on the participants’ working memory (WM) and procedural learning ability. The specific operationalization and results of the cognitive tasks will be reported elsewhere.

OUTCOME MEASURES AND ANALYSIS

Following Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016), our outcome measures included the accuracy rates and the d’ scores on the GJT (with the d’ scores controlling for response bias and reflecting the participants’ sensitivity), the subjective measures of awareness (i.e., source attributions and confidence ratings, to be reported elsewhere), and the outcomes of the cognitive tasks (which will be reported elsewhere). In addition to these measures, we also included the sentence production data to triangulate the findings about the GJT. The sentences were first manually coded by both researchers according to syntactic complexity and verb positioning. For instance, “Xiaoliang at eight last night repaired the oven” was coded V3, as it was a simple sentence with the verb appearing as the third element. “At eight last night Xiaoliang said Xiaoliang’s aunt the oven had repaired” was coded VFVF because it was a complex sentence with the verb in the main and subordinate clauses appearing as the last element. The two researchers then rated the grammaticality of each coded sequence based on the semiartificial grammar. The interrater reliability for both the coding and the grammaticality ratings was 100%.

Our RQ1 was whether Tagarelli and colleagues’ findings about exposure condition, linguistic complexity, and grammar learning would be replicated and generalized in a different L1 background, specifically, L1-Chinese. To address RQ1, we performed (a) a robust independent-samples t-test to compare the groups’ overall performance in the GJT,Footnote 7 and (b) a robust 2 × 3 between-within subjects ANOVA on trimmed means (20% trimming level) with Condition (Incidental vs. Instructed) as the between-subject variable and Complexity (Simple, Complex 1, and Complex 2) as the within-subject variable to examine the main effects and possible interactions.Footnote 8

Our RQ2 was how the individual-level variability in the raw data compared with the group-level variation based on condition and linguistic complexity. To address RQ2, we performed a mixed-effects modeling analysis to estimate both fixed and random effects (i.e., with a random slope and intercept for Subject and a random intercept for Item) based on the raw accuracy data and compared the results with those on the mean-based aggregated data.

The effect sizes of Cohen’s d were interpreted on the basis of Plonsky and Oswald (Reference Plonsky and Oswald2014), namely, d = 0.60, 1.00, and 1.40 corresponding to small, medium, and large effect sizes. The explanatory measure of effect size ξ was interpreted based on Mair and Wilcox (Reference Mair and Wilcox2020, p. 10), namely, ξ̂ = 0.10, 0.30, and 0.50 corresponding to small, medium, and large effect sizes.

RESULTS

RQ1: RELATIONSHIP BETWEEN EXPOSURE CONDITION AND SYNTACTIC COMPLEXITY

As with the original study, we adopted the outcome measures of accuracy rates and d’ scores. See Table 2 for descriptive statistics on the two outcome measures and Figure 1 for a graph of the mean differences on the d’ scores for group and complexity.

TABLE 2. Accuracy (%) and d’ scores on the GJT for each group overall and according to complexity

Note: Significance from chance: + p = .055; * p < .05; ** p < .01; *** p < .001.

FIGURE 1. Effects of group and complexity on GJT performance.

As shown in Table 2, the 95% confidence intervals (CIs) for the incidental group’s overall d’ scores spanned 0 (0 indicating no discrimination ability), whereas those for the instructed group’s overall d’ scores did not. These results indicated that only the instructed condition performed above chance overall. Moreover, there was no overlapping in CIs between the two groups overall and for two of the three target structures, indicating an advantage of explicit instruction at large. A robust independent samples t-test on trimmed means (20% trimming level) showed that the instructed group performed statistically better than the incidental group overall, t(40.39) = 2.37, p = .023, with a large explanatory measure of effect size, ξ = 0.43 [0.08, 0.71]. The effect size of the between-group difference in overall GJT performance in our study (g[CI] = 0.87 [0.42, 1.32]) is consistent with the calculated effect size in Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) (g[CI] = 0.86 [0.28, 1.44]).

The structure-specific data in our study exhibited largely similar patterns to those in the original study regarding the role of exposure condition, but the different slopes of the two groups’ performance across the three structures (Figure 1) suggested a potential exposure condition × complexity interaction. To ascertain whether such an interaction existed, we performed a robust 2 × 3 between-within subjects ANOVA (20% trimming level) with Condition as the between-subject variable (Incidental vs. Instructed) and Complexity as the within-subject variable (Simple, Complex 1, and Complex 2). The results showed that the interaction between Condition and Complexity was statistically significant, F(2, 29.55) = 4.16, p = .026, and so was the main effect of Condition, F(1, 31.32) = 9.73, p = .004. The effect of Complexity was nonsignificant, F(2, 29.55) = 1.49, p = .242. We performed post hoc pairwise comparisons on trimmed means (20% trimming level), with the 95% CIs and p-values adjusted for multiple testing (p = .05/6 = .008). See Table 3 for the results:

TABLE 3. Post hoc pairwise comparisons on trimmed means (20% trimming level)

As shown in Table 3, before the p-value adjustment the instructed group outperformed the incidental group on both the Simple and Complex 2 structures, exhibiting a similar pattern as observed in the 95% CIs in Figure 1. We may also observe that compared with the instructed group, the incidental group scored significantly higher on the Complex 1 structure than on the Simple structure, which lent support to our prediction that the perceived difficulty of the structures would vary in the changed L1.

However, after the adjustment, the two groups differed significantly only on the Simple structure, with the instructed group outperforming the incidental, ψ̂ = −1.01 [−1.73, −0.30], p = .007. The difference between the incidental and instructed groups also remained statistically significant, ψ̂ = −1.97 [−3.00, −0.94], p < .001. These results supported our initial prediction about the interaction effect between condition and complexity.

We now summarize our findings about RQ1 in Table 4:

TABLE 4. Summary of findings on RQ1: Original study and replication contrasted

The findings of the GJT data analysis were both corroborated and enriched by an examination of the sentence production data. Six participants (incidental: 2, instructed: 4) left this part blank. The other participants produced a varied number of sentences from 1 or 2 sentences to as many as 16 sentences. See Table 5 for an overview of the sentence production data:

TABLE 5. Overview of the sentence-production data

a Patterns comprising 5+ instances are included as labeled types. For patterns with below five tokens, see S7-1.

b This pattern put the main clause before the subordinate clause.

From Table 5 we may draw the following observations. First, the instructed group exhibited substantially greater learning outcomes than the incidental group (compare the proportion of targetlike sentences produced by both groups), thus supporting Prediction 1a about the advantage of instruction over incidental exposure; second, the untarget-like sentences produced by the incidental group tended to place the verbs toward the end, suggesting the impact of L1 transfer on this group. Moreover, this group produced more ungrammatical V2 sentences (*V3: 23; *V4: 18) than grammatical ones (14), thus supporting Prediction 1b about the condition × complexity interaction. Third, the untarget-like sentences produced by the instructed group put the verbs in the first, second, or last places, all of which were contained in the rules taught to this group. These instances of rule overgeneralization indicated that the participants at the early stage of development failed to associate verb placement with syntactic complexity and clause sequence.

To sum up, our hypotheses regarding the effect of exposure condition, linguistic complexity, and the interaction between the two variables in the L1-Chinese background were supported.

RQ2: FIXED VS. RANDOM EFFECTS

So far we have explored the group-level patterns in the aggregated data following Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016)’s methods of analysis. However, collapsing data could conceal the variability at individual learner and item levels. For the L1-Chinese learners in our study, the problem with random variability could be particularly prominent, as the test stimuli included ungrammatical sentence patterns (*V3, *V4, *VF, *VF-V2) that could be grammatical in Chinese due to the language’s preference of verb postponement. Given these particularities, it is both important and necessary to contextualize our discussions of the fixed effects (i.e., exposure condition, linguistic exposure, and their interaction) against a more accurate measurement of the random effects (i.e., variability at the individual learner and item levels). One method to realize this goal is applying mixed-effects modeling on raw, nested data, whose technique has been adopted by an increasing number of SLA studies (Dörnyei, Reference Dörnyei2009; Hui, Reference Hui2020; Mukarami, Reference Mukarami2016; Tao & Williams, Reference Tao and Williams2018; see Linck & Cunnings, Reference Linck and Cunnings2015 for the utility of the method in SLA research).

We ran binary mixed-effects logistic regressions using the lme4 package in R (Bates et al., Reference Bates, Maechler, Bolker and Walker2015). For the fixed effects we included Condition (Incidental and Instructed), Complexity (Simple, Complex 1, and Complex 2), and Grammaticality (Grammatical and Ungrammatical), with Grammaticality added because difficulty of GJTs may differ depending on whether the item sentences are grammatical or ungrammatical (e.g., R. Ellis, Reference Ellis2005).Footnote 9 For the random effects, we included random intercepts for items and random slopes and random intercepts for participants.Footnote 10 Regarding model selection, we mainly adopted James et al.’s (Reference James, Witten, Hastie and Tibshirani2013) method by building the unconditional model first (i.e., a model that included the random-effects variables only), gradually adding predictors that decreased the model’s Akaike information criterion (AIC) the most, and considering interaction terms only when the model already included the main effects constituting the interactions. The model comparisons are summarized in Table 6.

TABLE 6. Summary of binary mixed-effects logistic regressions model comparisons

As shown in Table 6, AIC categorically decreased until Model 5, and likelihood ratio tests suggested a steady improvement until Model 5. Considering that complexity may possibly improve the model after other variables are entered, we included complexity into the fixed-effects variables on the basis of Model 5. However, the addition of this parameter did not reduce the AIC scores significantly (Model 6).

To ensure the predictor variables were not collinear with each other, we diagnosed their corresponding VIF values that were obtained by the vif() function in the car package (Fox et al., Reference Fox, Weisberg, Price, Adler, Bates and Baud-Bovy2019). The results for Model 4 were acceptable (condition: vif = 1.00, grammaticality: vif = 1.00), while those for Model 5 were not (condition: vif = 1.85, grammaticality: vif = 3.41, grammaticality × condition: vif = 4.24).

Although Model 5 was a significant improvement on Model 4, we selected Model 4 as the final model. First, the decrease of AIC from Model 4 to Model 5 was fairly small (2.21) and the p value of the added parameter (.040) was not as low as the p values of other parameters; second, Model 4 is less complex than Model 5 and AIC tends to prefer more complex models in general (Held & Bové, Reference Held and Bové2014); and third, similar decisions were made by previous L2 research (e.g., Mukarami, Reference Mukarami2016).

See Table 7 for a summary of Model 4 (for the purpose of comparison, see S8 for a summary of Model 5).

TABLE 7. Summary of Model 4

a Both theoretical and observation-level (delta) values were reported for reproducibility.

From Table 7 we may draw the following observations. First, although Model 4 explained a medium proportion of the variance (theoretical: 28%; observation-level: 24%), most of the variance was attributable to random effects (theoretical: 23%; observation-level: 20%). The fixed effects (condition and grammaticality), significant predictors as they were, only accounted for a small proportion of the variance of the raw accuracy data (theoretical: 5%; observation level: 4%). Second, consistent with the findings about the aggregated data, linguistic complexity was not a significant predictor. Third, the analysis on the raw data differed from the group-level analysis in that the best-fit model did not include any interaction effect. Last but not least, the analysis on the raw data revealed that the by-learner random slope for grammaticality accounted for large variances of the participants’ accuracy rate on the logit scale (note the corresponding large SD in the model summary), indicating that the individual participants responded to grammatical(/ungrammatical) items very differently from each other. This finding is consistent with the wide dispersion of the individual d-prime scores in Figure 1 and points at the necessity to further investigate the random variability at individual participant (and item) levels.

With these observations we now summarize the findings about RQ2 in Table 8.

TABLE 8. Summary of findings about RQ2: original study and this study contrasted

SUMMARY OF FINDINGS AND DISCUSSION

Despite the extensive attention accorded to L1 transfer in natural language instruction, the factor is understudied in semiartificial language learning. In this study, we conceptually replicated Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) by recruiting L1-Chinese participants, translating the stimuli to Chinese, and eliciting language production data during debriefing, to investigate how L1 transfer moderates the interaction between linguistic complexity and exposure condition in semiartificial grammar learning. We also extended the initial study by using a binary mixed-effects logistic regression analysis to contextualize our findings about the fixed effects against the random effects. Our major findings are as follows: (a) in the changed L1 background explicit instruction continued to exhibit an advantage over incidental exposure overall and on the simple structure; (b) the analysis on the GJT data showed that linguistic complexity was not a significant predictor in the L1-Chinese background; (c) the analysis on sentence production data confirmed that the instructed group performed better than the incidental group, with negative L1 transfer affecting the incidental group more than the instructed group; (d) the mixed-effects modeling analysis supported the findings of the group-level analysis regarding the advantage of the explicit over the incidental condition and the nonsignificance of linguistic complexity; however, it did not find the condition × complexity interaction effect; and (e) the mixed-effects models revealed that the variation across individual participants and items accounted for a bigger portion of the outcome variance than the fixed effects combined.

Our interpretation of the results addresses three separate questions. First, explicit instruction still exhibited an overall advantage over incidental exposure even when the L1 background changed to a typologically different language. This finding lends support to the conclusion of the original study about the comparative efficacy of explicit instruction and corroborates with the findings of numerous experimental studies comparing the explicit with the incidental conditions (e.g., de Graaff, Reference de Graaff1997; Robinson, Reference Robinson1996) and meta-analytical research (e.g., Goo et al., Reference Goo, Granena, Yilmaz, Novella and Rebuschat2015; Spada & Tomita, Reference Spada and Tomita2010). Meanwhile, our results contrast with the findings of Morgan-Short et al. (Reference Morgan-Short, Steinhauer, Sanz and Ullman2012) and of Kang et al. (Reference Kang, Sok and Han2018). The contrast may be attributable to two factors, namely, the length and intensity of the treatment and the type of measures. First, the participants in our study were exposed to one training session consisting of 120 stimuli items before they took the judgment task. The limited length and intensity of exposure may have placed the incidental group at a disadvantage, for learning under implicit conditions is thought to take longer than under explicit conditions (N. C. Ellis, Reference Ellis2005). Second, researchers have argued that GJTs, even timed ones, tap into (automatized) explicit L2 knowledge rather than implicit L2 knowledge (e.g., Maie, Reference Maie2020; Suzuki, Reference Suzuki2017; Vafaae et al., Reference Vafaee, Suzuki and Kachinske2017) because they direct learners’ attention away from meaningful communication to formal accuracy (e.g., Suzuki & DeKeyser, Reference Suzuki and DeKeyser2017). Regarding the sentence production data, our form-focusing instruction about conforming to the rules of the language system may have heightened participants’ awareness, and as a result prompted the participants to use their explicit knowledge as well. Importantly, we did not include an analysis or discussion of the subjective measures of awareness in this study due to space limitations; a follow-up examination of these measures will undoubtedly offer valuable insights into the particular types of knowledge to which the different exposure conditions gave rise. We therefore call for more research that (a) investigates higher-proficiency learners and (b) includes measures of both explicit and implicit knowledge to present a fuller picture of the knowledge gained through the different conditions.

Second, our analysis of the group-level variation revealed no main effects of complexity, which departed from the findings of the original study in the L1-English context. The contrastive findings between our study and Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) help to determine the limits of generalizability of the conclusion of the original study about the role of complexity in L2 learning. As our analysis of sentence production data showed, L1 syntax transfer affected the incidental group more than the instructed group: on the one hand, the preference of Chinese syntax for postponed verb placement interfered with the incidental group’s intake of the V2 rule, leading to the group’s at-chance performance on this simple structure. On the other hand, some of the stimuli for the complex V2-VF sentence pattern are grammatical in Chinese, which could have contributed to the incidental group’s relatively high retention rate of this structure. These findings indicate that varying L1-L2 proximity could differentially affect the perceived L2 difficulty of simple/complex structures under different learning conditions, thereby rendering the effect of linguistic complexity less evident (for the instructed condition) or even offsetting the latter (for the incidental condition). This finding is corroborated by previous studies on natural language instruction that revealed the role of L1 background in perceived L2 difficulty (e.g., Della Putta, Reference Della Putta2016; González & Hernández, Reference González and Hernández2018). We hence call for more attention to the role of L1 transfer in discussions of the generalizability of findings regarding linguistic complexity, particularly in the field of semiartificial grammar learning.

Third, our analysis of the variability at individual learner and item levels confirmed the main effect of exposure condition, but negated the condition × complexity interaction found in the analysis on group-level variation. Moreover, the analysis revealed that the fixed effects, significant predictors as they were, accounted for a relatively small portion of the outcome variance compared with the random effects. These findings draw attention to the importance of accounting for learner- and item-level individual variation and point at the limitation of the group-level analysis methods adopted by the original study. On the basis of our results and echoing calls of researchers such as DeKeyser (Reference DeKeyser2012), we call for more interaction research and recommend that researchers compare the fixed effects with the random effects to contextualize the findings.

Importantly, although Tao and Williams (Reference Tao and Williams2018) also applied mixed-effects modeling to the analysis of raw data, they did it with a view to controlling the random effects without comparing the fixed effects with the random effects regarding the amount of variance of learning outcomes they explained (see Mukarami, Reference Mukarami2016 for a contrastive practice, namely, interpreting the random effects by plotting and summarizing). We hence call for more systematic investigations into the variability of the data at individual levels while controlling for random effects in discussions of the significance of the predictors.

CONCLUSION

In this study, we conceptually replicated and extended Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) and examined whether exposure condition affected the learnability of simple/complex structures in similar ways when the L1 background is changed to Chinese, a language typologically different from English (the L1 in the original study) and German (the target syntax). Our results showed that the effect of exposure condition remained a significant predictor of the outcome variance despite the changed L1 background, with explicit instruction resulting in better overall performance than incidental exposure. This lends support to the conclusion of the original study about the comparative efficacy of the explicit condition over the incidental one. Meanwhile, our results showed that the changed L1-L2 proximity could have interfered with the incidental group’s intake of the simple and complex structures, rendering the effect of complexity nonsignificant. This finding limits the generalizability of the conclusion of the original study about the effect of linguistic complexity. Last, the analysis on the raw data supplemented the group-level analysis in important ways and revealed that the fixed effects played a smaller part than the random effects, thus bringing attention to the importance of individual variation in discussions of fixed effects.

In this study, we did not include a control group. However, as Leow (Reference Leow2019) and Williams (Reference Williams2010) have pointed out, learners with no exposure do not necessarily perform at chance. Future research may expand on our findings by incorporating both L1-Chinese and L1-English control groups to further explore the role of learners’ L1 experience, thereby both redressing the limitations of reliance on chance performance and providing further evidence of the role of L1 transfer in semiartificial grammar learning. We also call for the adoption of measures of both implicit and explicit knowledge to present a fuller picture of the learning processes and outcomes under the incidental/explicit conditions.

Supplementary Materials

To view supplementary material for this article, please visit http://dx.doi.org/10.1017/S0272263120000686.

Footnotes

The experiment in this article earned an Open Materials badge for transparent practices. The materials are available at https://www.iris-database.org/iris/app/home/detail?id=york%3a938554&ref=search

We would like to thank Bronson Hui and Ryo Maie from Michigan State University for their detailed, insightful feedback on the drafts of this article. We thank Dr. Kaitlyn Tagarelli from Georgetown University and Dr. Simón Ruiz from University of Tuebingen for kindly providing the instruments that the original study adopted. We are grateful to Prof. Charlene Polio and Prof. Aline Godfroid from Michigan State University for their advice on the research design and data analysis. We thank the two anonymous reviewers and the handling editor Prof. Gregory Keating for their informative feedback. We thank our research assistants, Zizhen Wang and Yiwei Ma, and all the colleagues and students who participated in or supported the study. Finally, we would like to thank Kehao Zhang for technical support. All remaining errors are our own. This research was sponsored by China Scholarship Council.

1 Complexity is sometimes termed as “feature-related L2 difficulty,” which is subsumed under the cover term “L2 difficulty” along with context-related L2 difficulty (arising from varying learning conditions) and learner-related (cognitive, socio-affective) L2 difficulty (see, e.g., Housen & Simoens, Reference Housen and Simoens2016). Meanwhile, difficulty is sometimes termed as “relative complexity” and subsumed under “L2 complexity” along with absolute (linguistic, discourse-interactional, and propositional) complexity (see, e.g., Bulté & Housen, Reference Bulté, Housen, Housen, Kuiken and Vedder2012). Despite the diverse names they are given, the two concepts dichotomized in this study, namely, (objective) feature-related complexity and (subjective) learner-related difficulty, are clear.

2 L1 background here could be a moderator because it could have affected the strength of the effect of exposure condition on the learning of structures of varying complexity. For a distinction between a moderator and a mediator, see Baron and Kenny (Reference Baron and Kenny1986).

3 We categorized this study as a conceptual replication and extension of Tagarelli et al. (Reference Tagarelli, Ruiz, Moreno Vega and Rebuschat2016) because we systematically changed one parameter in the design of the initial study (by operationalizing the experiment in Chinese on L1-Chinese participants), elicited production data during debriefing, and added a new research question. For recommendations on the nomenclature, see Marsden et al. (Reference Marsden, Morgan-Short, Thompson and Abugaber2018).

4 Worthy of note is that the link between language distance and learning outcomes is not as straightforward as one would expect. On the one hand, studies show that learners with an L1 grammar similar to that of the L2 would be privileged in L2 development, whereas dissimilar L1-L2 grammatical properties do not support positive transfer (e.g., Della Putta, Reference Della Putta2016; Rankin, Reference Rankin2014); however, recent studies also show that language distance only affects processing time but not learners’ perception of accuracy of L2 syntactic strings (e.g., Booth et al., Reference Booth, Clenton and Van Herwegen2018).

5 Following the original study, the participants were presented with a specific explanation of each of these options before the testing phase. See S3 for these definitions.

6 Specifically, we replaced “president of England” in Practice Item 33 into “英格兰的酋长” (“tribal chief of England”); we also replaced “preached” in Practice Item 34 into “投降了” (“surrendered”).

7 Because the assumptions of normality and homogeneity of variance were not met (Shapiro–Wilk Test of Normality: W = 0.95, p < .001; Levene’s Test for Equality of Variances: F = 17.48, p < .001), we normalized the overall d’ scores using the Yeo–Johnson method. The corrected data were normalized (Shapiro–Wilk Test of Normality: W = 0.98, p = .119) but were still heteroskedastic (Levene’s Test for Equality of Variances: F = 9.86, p = .002). Consequently, we adopted the robust independent samples t-test on trimmed means (20% trimming level), which according to Yuen (Reference Yuen1974) allows for the presence of unequal variances. To compare the test results with those of a nonparametric test (in our study, the Wilcoxon rank sum test) on the untransformed data, see S5.

8 The structure-specific data on the d’ scores violated multiple assumptions of ANOVA (see S6 for the tests of assumptions). Consequently, we adopted the robust repeated measures ANOVA on trimmed means (20% trimming level) because it is more robust and allows for heteroscedasticity (Larson-Hall, Reference Larson-Hall2016; Wilcox & Schönbrodt, Reference Wilcox and Schönbrodt2014).

9 The binary categorical variables were centered to avoid multicollinearity. The reference level for Grammaticality was “Ungrammatical” and that for Condition was “Incidental.” For the variable Complexity we applied the sum to zero contrasts coding with the level “Simple” as the reference level.

10 Identical to the original study, we did not prepare two versions for each item, so it was impossible to include by-item slope for Grammaticality.

References

REFERENCES

Andringa, S. J., de Glopper, C. M., & Hacquebord, H. I. (2011). The effect of explicit and implicit instruction on free written response task performance. Language Learning, 61, 868903. https://doi.org/10.1111/j.1467-9922.2010.00623.x.CrossRefGoogle Scholar
Baron, R. M., & Kenny, D. A. (1986). The moderator-mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology, 51, 11731182. https://doi.org/10.1037/0022-3514.51.6.1173.CrossRefGoogle ScholarPubMed
Bates, D., Maechler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67, 148. https://doi.org/10.18637/jss.v067.i01.CrossRefGoogle Scholar
Booth, P., Clenton, J., & Van Herwegen, J. (2018). L1–L2 semantic and syntactic processing: The influence of language proximity. System, 78, 5464. https://doi.org/10.1016/j.system.2018.07.011.CrossRefGoogle Scholar
Bulté, B., & Housen, A. (2012). Defining and operationalizing L2 complexity. In Housen, A., Kuiken, F., & Vedder, I. (Eds.), Dimensions of L2 performance and proficiency: Complexity, accuracy and fluency in SLA (pp. 2146). John Benjamins.CrossRefGoogle Scholar
de Graaff, R. (1997). The Experanto experiment: Effects of explicit instruction on second language acquisition. Studies in Second Language Acquisition, 19, 249276.CrossRefGoogle Scholar
DeKeyser, R. M. (1995). Learning second language grammar rules: An experiment with a miniature linguistic system. Studies in Second Language Acquisition, 17, 379410. https://doi.org/0.1017/S027226310001425X.CrossRefGoogle Scholar
DeKeyser, R. M. (1998). Beyond focus on form: Cognitive perspectives on learning and practicing second language grammar. In Doughty, C., & Williams, J. (Eds.), Focus on form in classroom language acquisition (pp. 4263). Cambridge University Press.Google Scholar
DeKeyser, R. M. (2012). Interactions between individual differences, treatments, and structures in SLA. Language Learning, 62, 189200. https://doi.org/10.1111/j.1467-9922.2012.00712.x.CrossRefGoogle Scholar
DeKeyser, R. M. (2016). Of moving targets and chameleons. Studies in Second Language Acquisition, 38, 353363. https://doi.org/10.1017/S0272263116000024.CrossRefGoogle Scholar
Della Putta, P. (2016). The effects of textual enhancement on the acquisition of two nonparallel grammatical features by Spanish-speaking learners of Italian. Studies in Second Language Acquisition, 38, 217238. https://doi.org/10.1017/S0272263116000073.CrossRefGoogle Scholar
Dörnyei, Z. (2009). The psychology of second language acquisition. Oxford University Press.Google Scholar
Doughty, C., & Williams, J. (1998). Pedagogical choices in focus on form. In Doughty, C. & Williams, J. (Eds.), Focus on form in classroom second language acquisition (pp. 197261). Cambridge University Press.Google Scholar
Dryer, M. S., & Gensler, O. D. (2005). Order of object, oblique, and verb. In Dryer, M. S., & Haspelmath, M. (Eds.), The world atlas of language structures (pp. 342345). Oxford University Press.Google Scholar
Ellis, N. C. (2005). At the interface: Dynamic interactions of explicit and implicit language knowledge. Studies in Second Language Acquisition, 27, 305352. https://doi.org/10.1017/S027226310505014X.CrossRefGoogle Scholar
Ellis, N. C. (2006). Selective attention and transfer phenomena in L2 acquisition: Contingency, cue competition, salience, interference, overshadowing, blocking, and perceptual learning. Applied Linguistics, 27, 164194. https://doi.org/10.1093/applin/aml015.CrossRefGoogle Scholar
Ellis, R. (2005). Measuring implicit and explicit knowledge of a second language: A psychometric study. Studies in Second Language Acquisition, 27, 141172. https://doi.org/10.1017/S0272263105050096.CrossRefGoogle Scholar
Ellis, R., Loewen, S., Elder, C., Erlam, R., Philp, J., & Reinders, H. (2009). Implicit and explicit knowledge in second language learning, testing, and teaching. Multilingual Matters.Google Scholar
Faul, F., Erdfelder, E., Buchner, A., & Lang, A.-G. (2009). Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses. Behavior Research Methods, 41, 11491160.CrossRefGoogle ScholarPubMed
Fox, J., Weisberg, S., Price, B., Adler, D., Bates, D., Baud-Bovy, G., … R-Core. (2019). Car package version 3.0-5. https://r-forge.r-project.org/projects/car/ Google Scholar
Godfroid, A. (2016). The effects of implicit instruction on implicit and explicit knowledge development. Studies in Second Language Acquisition, 38, 177215. https://doi.org/10.1017/S0272263115000388.CrossRefGoogle Scholar
González, P., & Hernández, L. Q. (2018). Inherent aspect and L1 transfer in the L2 acquisition of Spanish grammatical aspect. Modern Language Journal, 102, 611625. https://doi.org/10.1111/modl.12502.CrossRefGoogle Scholar
Goo, J., Granena, G., Yilmaz, Y., & Novella, M. (2015). Implicit and explicit instruction in L2 learning: Norris & Ortega (2000) revisited and updated. In Rebuschat, P. (Ed.), Implicit and explicit learning of languages (pp. 443483). John Benjamins.CrossRefGoogle Scholar
Hamrick, P., & Sachs, R. (2018). Establishing evidence of learning in experiments employing artificial linguistic systems. Studies in Second Language Acquisition, 40, 153169. https://doi.org/10.1017/S0272263116000474.CrossRefGoogle Scholar
Hartsuiker, R. J., & Bernolet, S. (2017). The development of shared syntax in second language learning. Bilingualism: Language and Cognition, 20, 219234. https://doi.org/10.1017/S1366728915000164.CrossRefGoogle Scholar
Held, L., & Bové, D. S. (2014). Applied statistical inference: Likelihood and Bayes. Springer.CrossRefGoogle Scholar
Housen, A., Kuiken, F., & Vedder, I. (2012). Complexity, accuracy and fluency: Definitions, measurement and research. In Housen, A., Kuiken, F., & Vedder, I. (Eds.), Dimensions of L2 performance and proficiency (pp. 120). John Benjamins.CrossRefGoogle Scholar
Housen, A., Pierrard, M., & Van Daele, S. (2005). Rule complexity and the efficacy of explicit grammar instruction. In Housen, A. & Pierrard, M. (Eds.), Investigations of instructed second language acquisition (pp. 235269). Mouton de Gruyter.CrossRefGoogle Scholar
Housen, A., & Simoens, H. (2016). Introduction: Cognitive perspectives on difficulty and complexity in L2 acquisition. Studies in Second Language Acquisition, 38, 163175. https://doi.org/10.1017/S0272263116000176.CrossRefGoogle Scholar
Hui, B. (2020). Processing variability in intentional and incidental word learning: An extension of Solovyeva and Dekeyser (2018). Studies in Second Language Acquisition, 42, 327357. https://doi.org/10.1017/S0272263119000603.CrossRefGoogle Scholar
Hulstijn, J. H., & de Graaff, R. (1994). Under what conditions does explicit knowledge of a second language facilitate the acquisition of implicit knowledge? A research proposal. AILA Review, 11, 97112.Google Scholar
Hwang, H., Shin, J.-A., & Hartsuiker, R. J. (2018). Late bilinguals share syntax unsparingly between L1 and L2: Evidence from crosslinguistically similar and different constructions. Language Learning, 68, 177205. https://doi.org/10.1111/lang.12272.CrossRefGoogle Scholar
Ishikawa, K. (2019). Incidental and explicit learning of L2 derivational morphology and the nature of acquired knowledge. Applied Psycholinguistics, 40, 13771404. https://doi.org/10.1017/S0142716419000304.CrossRefGoogle Scholar
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning: With applications in R. Springer.CrossRefGoogle Scholar
Kang, E. Y., Sok, S., & Han, Z. (2018). Thirty-five years of ISLA on form-focused instruction: A meta-analysis. Language Teaching Research, 23, 428523. https://doi.org/10.1177/1362168818776671.CrossRefGoogle Scholar
Kellerman, E. (1978). Giving learners a break: Native language intuitions as a source of predictions about transferability. Working Papers on Bilingualism, 15 (pp. 5992). Ontario Institute for Studies in Education.Google Scholar
Kim, K. M., & Godfroid, A. (2019). Should we listen or read? Modality effects in implicit and explicit knowledge. Modern Language Journal, 103, 648664. https://doi.org/10.1111/modl.12583.Google Scholar
Larson-Hall, J. (2016). A guide to doing statistics in second language research using SPSS and R (2nd ed.). Routledge.Google Scholar
Leow, R. P. (2019). ISLA: How implicit or how explicit should it be? Theoretical, empirical, and pedagogical/curricular issues. Language Teaching Research, 23, 476493. https://doi.org/10.1177/1362168818776674.CrossRefGoogle Scholar
Linck, J. A., & Cunnings, I. (2015). The utility and application of mixed-effects models in second language research. Language Learning, 65, 185207. https://doi.org/10.1111/lang.12117.CrossRefGoogle Scholar
Luke, K.-K., Liu, H. L., Wai, Y. Y., Wan, Y. L., & Tan, L. H. (2002). Functional anatomy of syntactic and semantic processing in language comprehension. Human Brain Mapping, 16, 133145. https://doi.org/10.1002/hbm.10029.CrossRefGoogle ScholarPubMed
MacWhinney, B., Bates, E., & Kliegl, R. (1984). Cue validity and sentence interpretation in English, German, and Italian. Journal of Verbal Learning and Verbal Behavior, 23, 127150.CrossRefGoogle Scholar
Maie, R. (2020, March). Eyeing acceptability judgment task: Automaticity moderates different L2 knowledge and processing. Paper presented at the American Association for Applied Linguistics, Denver, CO.Google Scholar
Maie, R., & DeKeyser, R. M. (2020). Conflicting evidence of explicit and implicit knowledge from objective and subjective measures. Studies in Second Language Acquisition, 42, 359382. https://doi.org/10.1017/S0272263119000615.CrossRefGoogle Scholar
Mair, P., & Wilcox, R. R. (2020). Robust statistical methods in, R using the WRS2 package. Behavior Research Methods, 52, 464488. https://doi.org/10.3758/s13428-019-01246-w.CrossRefGoogle ScholarPubMed
Marsden, E., Morgan-Short, K., Thompson, S., & Abugaber, D. (2018). Replication in second language research: Narrative and systematic reviews and recommendations for the field. Language Learning, 68, 321391. https://doi.org/10.1111/lang.12286.CrossRefGoogle Scholar
Miller, Z. F., & Godfroid, A. (2020). Emotions in incidental language learning: An individual differences approach. Studies in Second Language Acquisition, 42, 115141. https://doi.org/10.1017/S027226311900041X.CrossRefGoogle Scholar
Morgan-Short, K., Steinhauer, K., Sanz, C., & Ullman, M. T. (2012). Explicit and implicit second language training differentially affect the achievement of native-like brain activation patterns. Journal of Cognitive Neuroscience, 24, 933947. https://doi.org/10.1162/jocn_a_00119.CrossRefGoogle ScholarPubMed
Mukarami, A. (2016). Modeling systematicity and individuality in nonlinear second language development: The case of English grammatical morphemes. Language Learning, 66, 834871. https://doi.org/10.1111/lang.12166.Google Scholar
Neeleman, A., & Weerman, F. (1997). L1 and L2 word order acquisition. Language Acquisition, 6, 125170.CrossRefGoogle Scholar
Norris, J. M., & Ortega, L. (2000). Effectiveness of L2 instruction: A research synthesis and quantitative meta-analysis. Language Learning, 50, 417528. https://doi.org/10.1111/0023-8333.00136.CrossRefGoogle Scholar
Plonsky, L., & Oswald, F. L. (2014). How big is “big”? Interpreting effect sizes in L2 research. Language Learning, 64, 878912. https://doi.org/10.1111/lang.12079.CrossRefGoogle Scholar
Rankin, T. (2014). Variational learning in L2: The transfer of L1 syntax and parsing strategies in the interpretation of wh-questions by L1 German learners of L2 English. Linguistic Approaches to Bilingualism, 4, 432461. https://doi.org/10.1075/lab.4.4.02ran.CrossRefGoogle Scholar
Reber, A. S., Kassin, S. M., Lewis, S., & Cantor, G. W. (1980). On the relationship between implicit and explicit modes in the learning of a complex rule structure. Journal of Experimental Psychology: Human Learning and Memory, 6, 492502.Google Scholar
Rebuschat, P. (2008). Implicit learning of natural language syntax (Unpublished doctoral dissertation). University of Cambridge, Cambridge.Google Scholar
Rebuschat, P. (Ed.). (2015). Implicit and explicit learning of languages. John Benjamins.CrossRefGoogle Scholar
Rebuschat, P., & Williams, J. N. (2012). Implicit and explicit knowledge in second language acquisition. Applied Psycholinguistics, 33, 829856. https://doi.org/10.1017/S0142716411000580.CrossRefGoogle Scholar
Robinson, P. (1996). Learning simple and complex second language rules under implicit, incidental, rule-search and instructed conditions. Studies in Second Language Acquisition, 18, 2768.CrossRefGoogle Scholar
Ruiz, S., Tagarelli, K. M., & Rebuschat, P. (2018). Simultaneous acquisition of words and syntax: Effects of exposure condition and declarative memory. Frontiers in Psychology, 9, 1168. https://doi.org/10.3389/fpsyg.2018.01168.CrossRefGoogle ScholarPubMed
Schwartz, B. D., & Sprouse, R. A. (1994). Word order and nominative case in non-native language acquisition: A longitudinal study of (L1 Turkish) German interlanguage. In Hoekstra, T. & Schwartz, B. D. (Eds.), Language acquisition studies in generative grammar: Papers in honor of Kenneth Wexler from the 1991 Glow Workshop (pp. 317368). John Benjamins.CrossRefGoogle Scholar
Spada, N., & Tomita, Y. (2010). Interactions between type of instruction and type of language feature: A meta analysis. Language Learning, 60, 263308. https://doi.org/10.1111/j.1467-9922.2010.00562.x.CrossRefGoogle Scholar
Suzuki, Y. (2017). Validity of new measures of implicit knowledge: Distinguishing implicit knowledge from automatized explicit knowledge. Applied Psycholinguistics, 38, 12291261. https://doi.org/10.1017/S014271641700011X.CrossRefGoogle Scholar
Suzuki, Y., & DeKeyser, R. (2017). The interface of explicit and implicit knowledge in a second language: Insights from individual differences in cognitive aptitude. Language Learning, 67, 747790. https://doi.org/10.1111/lang.12241.CrossRefGoogle Scholar
Tagarelli, K., Ruiz, S., Moreno Vega, J. L., & Rebuschat, P. (2016). Variability in second language learning: The roles of individual differences, learning conditions, and linguistic complexity. Studies in Second Language Acquisition, 38, 293316. https://doi.org/10.1017/S0272263116000036.CrossRefGoogle Scholar
Tao, Y., & Williams, J. N. (2018). Generalization of syntactic knowledge in semiartificial language learning. Language Learning, 68, 10011031. https://doi.org/10.1111/lang.12299.CrossRefGoogle Scholar
Vafaee, P., Suzuki, Y., & Kachinske, I. (2017). Validating grammaticality judgment tests: Evidence from two new psycholinguistic measures. Studies in Second Language Acquisition, 39, 5995. https://doi.org/10.1017/S0272263116000097.CrossRefGoogle Scholar
Van Patten, B. (2003). Input processing in SLA. In VanPatten, B. (Ed.), Processing instruction: Theory, research, and commentary (pp. 532). Lawrence Erlbaum Associates.Google Scholar
Wilcox, R. R., & Schönbrodt, F. D. (2014). The WRS package for robust statistics in R (version 0.24) [Computer software]. http://r-forge.r-project.org/projects/wrs/ Google Scholar
Williams, J. N. (2005). Form-focused instruction. In Hinkel, E. (Ed.), Handbook on research in second language teaching and learning (pp. 673691). Lawrence Erlbaum Associates.Google Scholar
Williams, J. N. (2010). Initial incidental acquisition of word order regularities: Is it just sequence learning? Language Learning, 60, 221244. https://doi.org/10.1111/j.1467-9922.2010.00607.x.CrossRefGoogle Scholar
Williams, J. N., & Evans, J. (1998). What kind of focus and on which forms? In Doughty, C. & Williams, J. (Eds.), Focus on form in classroom second language acquisition (pp. 139155). Cambridge University Press.Google Scholar
Williams, J. N., & Kuribara, C. (2008). Comparing a nativist and emergentist approach to the initial stage of SLA: An investigation of Japanese scrambling. Lingua, 118, 522553. https://doi.org/10.1016/j.lingua.2007.03.003.CrossRefGoogle Scholar
Yuen, K. K. (1974). The two sample trimmed t for unequal population variances. Biometrika, 61, 165170.CrossRefGoogle Scholar
Figure 0

TABLE 1. Grammaticality of sentence patterns in training and test stimuli in German/English/Chinese

Figure 1

TABLE 2. Accuracy (%) and d’ scores on the GJT for each group overall and according to complexity

Figure 2

FIGURE 1. Effects of group and complexity on GJT performance.

Figure 3

TABLE 3. Post hoc pairwise comparisons on trimmed means (20% trimming level)

Figure 4

TABLE 4. Summary of findings on RQ1: Original study and replication contrasted

Figure 5

TABLE 5. Overview of the sentence-production data

Figure 6

TABLE 6. Summary of binary mixed-effects logistic regressions model comparisons

Figure 7

TABLE 7. Summary of Model 4

Figure 8

TABLE 8. Summary of findings about RQ2: original study and this study contrasted

Supplementary material: File

Gao and Ma supplementary material

Gao and Ma supplementary material

Download Gao and Ma supplementary material(File)
File 201.1 KB