Hostname: page-component-745bb68f8f-s22k5 Total loading time: 0 Render date: 2025-02-11T01:31:34.190Z Has data issue: false hasContentIssue false

Infant statistical-learning ability is related to real-time language processing*

Published online by Cambridge University Press:  19 July 2017

JILL LANY*
Affiliation:
Department of Psychology, University of Notre Dame
AMBER SHOAIB
Affiliation:
Department of Psychology, University of Notre Dame
ABBIE THOMPSON
Affiliation:
Department of Psychology, University of Notre Dame
KATHARINE GRAF ESTES
Affiliation:
Department of Psychology, University of California-Davis
*
Address for correspondence: Jill Lany, Department of Psychology, Haggar Hall, Notre Dame, IN 46556. e-mail: jlany@nd.edu
Rights & Permissions [Opens in a new window]

Abstract

Infants are adept at learning statistical regularities in artificial language materials, suggesting that the ability to learn statistical structure may support language development. Indeed, infants who perform better on statistical learning tasks tend to be more advanced in parental reports of infants’ language skills. Work with adults suggests that one way statistical learning ability affects language proficiency is by facilitating real-time language processing. Here we tested whether 15-month-olds’ ability to learn sequential statistical structure in artificial language materials is related to their ability to encode and interpret native-language speech. Specifically, we tested their ability to learn sequential structure among syllables (Experiment 1) and words (Experiment 2), as well as their ability to encode familiar English words in sentences. The results suggest that infants' ability to learn sequential structure among syllables is related to their lexical-processing efficiency, providing continuity with findings from children and adults, though effects were modest.

Type
Articles
Copyright
Copyright © Cambridge University Press 2017 

INTRODUCTION

Spoken language is a part of virtually all hearing children's daily experience, and they clearly learn what they hear. The words that are most prevalent in children's language input are learned relatively early (Huttenlocher, Haight, Bryk, Seltzer & Lyons, Reference Huttenlocher, Haight, Bryk, Seltzer and Lyons1991), and most of children's early multiword utterances are imitations of adult productions (Lieven, Behrens, Speares & Tomasello, Reference Lieven, Behrens, Speares and Tomasello2003). However, beyond learning specific words and phrases from the speech they hear, infants may also be learning about the statistical patterns of their language. There are many instances of sequential statistical structure in spoken language, such as the fact that syllables that reliably co-occur are likely to belong to the same word, while syllables that rarely co-occur are more likely to span word boundaries (e.g. Swingley, Reference Swingley2005). Beyond syllable-level statistics, words are ordered in highly predictable ways in many languages. For example, in English, determiners tend to co-occur with nouns, while pronouns and auxiliaries tend to co-occur with verbs (e.g. Mintz, Newport & Bever, Reference Mintz, Newport and Bever2002).

Even very young infants can learn novel sequential statistics in artificial languages with just a few minutes of exposure (see Lany & Saffran, Reference Lany and Saffran2013, for a review). Much work has focused on infants’ ability to learn a form of sequential structure referred to as a transitional probability, or TP. The TP between two elements, X and Y, is computed by dividing the frequency of XY by the frequency of X, yielding the probability that if X occurs, Y will also occur. Transitional probabilities between syllables tend to be higher within words than between them, and thus provide information that is relevant to segmenting word forms within fluent speech. Infants can track TPs between syllables in synthesized and naturally produced speech, suggesting that it may be one mechanism by which word forms are segmented (e.g. Aslin, Saffran & Newport, Reference Aslin, Saffran and Newport1998; Pelucci, Hay & Saffran, Reference Pelucci, Hay and Saffran2009; Saffran, Aslin & Newport, Reference Saffran, Aslin and Newport1996). Furthermore, infants map sequences with high internal TPs to referents, but do not do so for sequences that are equally frequent but have low internal TPs (Graf Estes, Reference Graf Estes2012; Graf Estes, Evans, Alibali & Saffran, Reference Graf Estes, Evans., Alibali and Saffran2007; Hay, Pelucci, Graf Estes & Saffran, Reference Hay, Pelucci, Graf Estes and Saffran2011). The fact that infants selectively treat high TP sequences as labels suggests that sequential learning can yield word-like representations, and that such learning is potentially relevant to lexical development.

Infants can also track sequential dependencies at higher levels of language structure, such as the likelihood that words or word categories will co-occur (e.g. determiner–noun and auxiliary–verb co-occurrence relationships). Many studies have used an aX bY artificial language, in which as are followed by Xs and bs by Ys but not vice versa, to study this sequential structure. Infants learn these co-occurrence relationships best when Xs and Ys can be distinguished by salient perceptual cues, such as differences in their phonological characteristics (Gerken, Wilson & Lewis, Reference Gerken, Wilson and Lewis2005; Gómez & Lakusta, Reference Gómez and Lakusta2004; Lany & Gómez, Reference Lany and Gómez2008). By 12 months of age infants can learn the aX bY structure, even when it is probabilistic (Gómez & Lakusta, Reference Gómez and Lakusta2004). Moreover, by 22 months of age, infants can use the correlated phonological and distributional cues in the aX bY language to help them form robust word–referent associations, and to guide their interpretation of novel words when reference is ambiguous (Lany & Saffran, Reference Lany and Saffran2010). Specifically, infants were given auditory experience with an aX bY language as described above (the Experimental Group), or with a version of the language in which distributional and phonological cues did not reliably cue category membership (the Control Group). Both groups were then trained on pairings between the languages’ word-categories and pictures of animals and vehicles (e.g. aX-phrases labeled animals, and bY-phrases labeled vehicles), and tested on trained and novel picture–word pairings. Only Experimental infants learned the associations between words and pictures they were trained on and generalized the pattern to novel pairings, suggesting that statistical cues were extremely important to word learning. These findings parallel those from studies of native language development, which show that children use distributional cues to help them establish whether words refer to objects or actions by 24 months of age (Waxman, Lidz, Braun & Lavin, Reference Waxman, Lidz, Braun and Lavin2009).

These studies suggest that infants are sensitive to sequential statistical information that has potential to be highly informative for acquiring their native language. A handful of studies have directly examined whether infants' ability to detect novel sequential statistics presented in the auditory modality is related to native language development, typically their scores on the MacArthur-Bates Communicative Development Inventory (MCDI), which is a normed parent-report measure of vocabulary size. Some researchers have approached this question by testing whether infants' ability to learn phonotactic patterns that are not allowed in their native language is related to their native language proficiency. For example, 19-month-old infants with larger vocabularies were less likely to learn phonotactic patterns that are illegal in English (i.e. words beginning with the sound sequences tl, ps, fw, and shn) than infants who knew fewer English words (Graf Estes, Gluck & Grimm, Reference Graf Estes, Gluck and Grimm2016). Infants with larger vocabularies are also less likely to learn words with illegal phonotactic patterns as object labels (Graf Estes, Edwards & Saffran, Reference Graf Estes, Edwards and Saffran2011). These findings suggest that infants who know more words in their native language tend to be more entrenched in their native language phonotactics, resistant to learning patterns that do not conform to them.

Another line of work has investigated the ability to learn statistical regularities in non-linguistic materials, thus avoiding the problem of native language interference. For example, Shafto, Conway, Field, and Houston (Reference Shafto, Conway, Field and Houston2012) tested whether 8·5-month-olds’ ability to learn a visuospatial sequence was related to concurrent and later parent-report measures of language development. Infants were presented with triplets of shapes, with each triplet occurring in a consistently ordered spatial configuration. Infants’ visual sequence learning was related to both gesture use and receptive vocabulary size, though the effects were modest and somewhat inconsistent across the ages tested (see also Ellis, Robledo Gonzalez & Deak, Reference Ellis, Robledo Gonzalez and Deak2013).

In a related study, Kidd and Arciuli (Reference Kidd and Arciuli2016) tested whether six- to eight-year-olds' ability to learn non-linguistic visual sequences was related to their performance on a sentence comprehension task. The statistical learning task tapped children's ability to learn high TP triplets within a series of cartoon-like aliens. The sentence comprehension task involved picking the picture described by a sentence (i.e. “Which chicken is being kissed by the mouse?”). Children who exhibited better visual sequence learning also demonstrated better comprehension of the most challenging sentences, even when controlling for measures of their working memory capacity and non-verbal IQ. These findings provide compelling evidence that statistical learning ability is related to language proficiency in childhood. The work is also notable in its use of a behavioral test to measure specific aspects of native-language proficiency, rather than more general language outcome measures.

Other studies have approached the task of investigating relations between statistical learning and language development by using artificial languages that contain statistical regularities that are novel, but that do not directly violate native language structures. For example, Lany tested whether infants who are better able to capitalize on statistical regularities in an aX bY language, which is modeled after determiner–noun and auxiliary–verb co-occurrence relations in English, are also more advanced in their native language development (Lany, Reference Lany2014; Lany & Safffran, Reference Lany and Saffran2011). By 22 months of age infants use the statistical structure in the language to help them learn word–referent mappings (Lany & Saffran, Reference Lany and Saffran2010). Importantly, infants who are better able to use probabilistic sequential regularities to learn word–referent mappings in an artificial language are more likely to have begun combining words regularly in their own speech (Lany, Reference Lany2014). In addition, even though infants benefit from both distributional and phonological cues in these tasks, infants with larger vocabularies are more likely to rely on word-order cues to generalize (i.e. using the determiner-like word to help establish reference), while infants with smaller vocabularies are more likely to use the phonological properties of the words to do so (Lany & Saffran, Reference Lany and Saffran2011).

One advantage to using artificial languages that are constructed to parallel natural language is that it allows researchers to test connections between native language development and learning statistical regularities that clearly tap processes relevant to language learning. For example, by studying how infants use statistical regularities to learn words, the studies described above may shed light on developmental differences in the accessibility of different kinds of cues in word learning tasks (Lany, Reference Lany2014; Lany & Saffran, Reference Lany and Saffran2011). We note that there is convergence in results across tasks, as the relations observed in studies using artificial languages modeled after natural languages are consistent with those from work examining relations between tasks assessing statistical learning in the visual modality across infancy and childhood (Kidd & Arciuli, Reference Kidd and Arciuli2016; Shafto et al., Reference Shafto, Conway, Field and Houston2012), as well as with the relations between infants’ knowledge of native language statistics and vocabulary development (Graf Estes et al., Reference Graf Estes, Gluck and Grimm2016).

Altogether, the studies reviewed above provide empirical support for the hypothesis that infants’ ability to learn sequential structure plays a role in language development. Specifically, they suggest that performance on tasks assessing sequential statistical learning are related to parent-report measures of language development, such as vocabulary size and grammatical development. They are also consistent with evidence that sensitivity to sequential statistical structure is related to real-time language processing in adults. For example, Farmer, Christiansen, and Monaghan (Reference Farmer, Christiansen and Monaghan2006) tested how quickly adults recognize nouns as a function of their phonological properties. The target nouns occurred in sentence contexts that provided sequential cues that unambiguously suggested the next word should be a noun (e.g. “The curious boy saved the ____”). Participants were faster to recognize target nouns when they had phonological characteristics that were more typical of nouns (e.g. marble) than when their phonology was more typical of verbs (e.g. insect). Likewise, Conway, Bauernschmidt, Huang, and Pisoni (Reference Conway, Bauernschmidt, Huang and Pisoni2010) found that adults who were better able to learn novel sequential patterns in artificial grammars were more likely to capitalize on sequential structure to recognize words in native language speech.

Conway et al. (Reference Conway, Bauernschmidt, Huang and Pisoni2010) interpret these findings as evidence that individuals with better statistical learning abilities are better able to learn sequential structure in their native language, and in turn are able to capitalize on such structure to facilitate encoding and interpreting speech. However, we do not know whether statistical learning skill is related to speech processing during language acquisition. Infants' lexical-processing efficiency, or LPE, can be tested by presenting a familiar ‘target’ word (e.g. “Find the doggie!”) in conjunction with a picture array containing the target referent (i.e. a dog) and a distractor (e.g. a baby). Relative to adults, infants and children have poor LPE, recognizing words more slowly and less accurately, but LPE improves substantially across the second year of life (Fernald, Pinto, Swingley, Weinberg & McRoberts, Reference Fernald, Pinto, Swingley, Weinberg and McRoberts1998; Fernald, Swingley & Pinto, Reference Fernald, Swingley and Pinto2001). It is possible that infants’ ability to track sequential statistics facilitates gains in LPE.

Thus, in the current work, we tested the hypothesis that infants' sequential statistical learning ability is related to LPE across two experiments using well-studied artificial languages that contain sequential structure. In both experiments we assessed LPE using the task described above, which has been used extensively to study individual differences in real-time processing in infants of the ages we tested here (e.g. Fernald, Perfors & Marchman, Reference Fernald, Perfors and Marchman2006; Fernald, Zangl, Portillo & Marchman, Reference Fernald, Zangl, Portillo, Marchman, Sekerina, Fernández and Clahsen2008). In Experiment 1, we tested whether infants' performance on a word segmentation task is related to their performance on the LPE task. In Experiment 2, we tested whether infants’ ability to track word-order patterns is related to their LPE.

EXPERIMENT 1

We asked whether infants who are better able to learn sequential structure in speech are better able to recognize word forms quickly in real time. We tested statistical learning using an artificial language that was composed of a continuous stream of syllables. Within the stream there were syllable sequences with both High and Low TPs (HTP, LTP; Graf Estes et al., 2007). We tested whether infants were able to track the TPs by asking whether they discriminate between the HTP and LTP syllable sequences using the Head-Turn Preference Procedure (HTPP). In the HTTP, infants' ability to discriminate between two types of stimuli is assessed through preferential listening. Learning can be expressed both as a novelty preference and a familiarity preference, depending on many factors, such as the social relevance of the stimuli, and infants’ age and encoding ability (Houston-Price & Nakia, Reference Houston-Price and Nakia2004; Hunter & Ames, Reference Hunter and Ames1988).

The HTPP is the task most commonly used to study infant statistical learning in the auditory modality. Thus, there is a great deal of value in using this task to measure statistical learning in the current work. The HTPP has predominantly been used to test language learning and development by examining group-level performance. However, it has recently been used to test whether there is a connection between individual infants’ speech segmentation ability and standardized assessments of their language development (Newman, Ratner, Jusczyk & Jusczyk, Reference Newman, Ratner, Jusczyk and Jusczyk2006; Newman, Rowe & Ratner, Reference Newman, Rowe and Ratner2016; Singh et al., Reference Singh, Reznik and Xuehua2012; see also Graf Estes et al., Reference Graf Estes, Gluck and Grimm2016). Thus, the HTPP may also reveal meaningful individual differences in sensitivity to TPs relevant for word segmentation in the current experiment.

Furthermore, by examining whether there are individual differences in infants’ preference patterns (i.e. novelty versus familiarity, and magnitude of preference), this work may shed light on failures to replicate prior findings. In developmental research with the HTPP, researchers typically focus on tight age ranges in an effort to reduce individual variability, but age is not necessarily the best predictor of language development. Thus, including characteristics of the sample that pertain to language development (e.g. LPE) may help researchers determine whether differences in results are related to differences in language skills across samples, or whether to create groups based on age and language competence rather than on age alone.

To measure LPE, we used a task designed by Fernald and colleagues that assesses infants' ability to use spoken language to find matching visual objects (see Fernald et al., Reference Fernald, Zangl, Portillo, Marchman, Sekerina, Fernández and Clahsen2008, for an overview of the method). On a given trial, infants might view a picture of a dog and a picture of a baby, and hear “Find the baby!”. The words selected for testing are highly likely to be known by infants and toddlers at this age, and thus our participants were likely to be able to find the correct picture. Of primary interest is the speed with which they did so. We chose this task because it has been used extensively (e.g. Fernald et al., Reference Fernald, Pinto, Swingley, Weinberg and McRoberts1998; Fernald et al., Reference Fernald, Perfors and Marchman2006), and because infants with larger vocabularies at 15 months tend to process speech more rapidly by the time they are about two years of age (Fernald et al., Reference Fernald, Perfors and Marchman2006).

We tested 15-month-old infants because this is the youngest age at which Fernald recommends using the LPE task (Fernald et al., Reference Fernald, Zangl, Portillo, Marchman, Sekerina, Fernández and Clahsen2008). The artificial language we chose had not previously been used with 15-month-olds. Thus, we did not have an a priori prediction for the direction of preference that would be associated with better statistical learning. Rather, we predicted that the magnitude of infants’ preferences would be associated with their performance on the LPE task.

METHODS

Participants

Participants were thirty-eight English-learning infants between 15·0 and 16·0 months of age (21 were female; see Table 1 for additional descriptive statistics). Infants were free of hearing, vision, or language-development problems based on parental report. Infants were not eligible if they were born before 36 weeks gestation, or weighed less than 5lbs 5oz at birth. An additional sixteen infants were tested but their data were not included due to failure to contribute sufficient data on the LPE (n = 5) or statistical learning tasks (n = 5), due to fussiness or inattention, equipment failure or experimenter error (n = 4), falling asleep (n = 1), or because their scores on any of the measures were greater than 3 standard deviations from the mean (n = 1).Footnote 1 Parental consent was obtained for all participants. Participants were given a children's book or $15 for participating in the study.

Table 1. Experiment 1 descriptive statistics

note: The Words understood raw and percentile scores came from the MCDI, the SL (Statistical Learning) Task Preference score reflects their preference for listening to words over part-words (a proportion score), and the LPE Task RT reflects the speed with which they used English words to find target pictures.

Materials and procedure

Infants were first trained and tested on the statistical learning task, followed by the LPE task. A parent or caregiver accompanied the infant for the duration of the experiment, and they were instructed not to talk during that time. Both tasks took place in an 8′ × 8′ sound-attenuated room. A 60″ LCD screen was mounted on the central wall, and a 20″ monitor with a speaker behind it was mounted on each side wall, all at the height of 31″. A chair for the parent was located approximately 3 feet away from the central screen. A digital video camera was mounted flush with the lower edge of the 60″ central screen so that infants’ visual attention could be monitored and recorded to a computer hard drive when seated on their parent's lap. An experimenter administered the tasks from outside the room via a PC computer running custom software. The experimenter was blind to the experimental stimuli and monitored infant visual attention via the digital video feed.

Statistical learning task materials and procedure

The materials and procedure for the statistical learning task were the auditory stimuli used in Experiment 2 of Graf Estes et al. (Reference Graf Estes, Evans., Alibali and Saffran2007). Specifically, during the Auditory Familiarization phase, infants listened to a 5·5-minute stream of concatenated consonant–vowel syllables while playing quietly on the floor of the sound-attenuated room. The Familiarization stream was composed of eight unique syllables that were combined into four statistically defined disyllabic ‘words’. There were two counterbalanced languages: Language 1 and Language 2. Language 1 words were timay, dobu, gapi, and moku, and Language 2 words were pimo, kuga, buti, and maydo. The materials were generated by a trained female speaker who recorded sets of three-syllable sequences. The middle syllables from these sequences were spliced into a fluent speech stream (e.g. the sequences timaydo and maydobu were spliced to form the sequence maydo). This technique preserved appropriate coarticulation contexts for the target syllables, both within and across words, but resulted in no pauses or other reliable acoustic cues to word boundaries.

In each language, two of the words occurred with relatively high frequency. For example, in Language 1, timay and dobu occurred 180 times, while gapi and moku occurred 90 times. The high-frequency words occurred together in sequence 90 times (e.g. timaydobu; dobutimay), creating ‘part-word’ sequences that recurred across word boundaries 90 times (e.g. maydo; buti). At Test, we used these frequently occurring part-words, as well as the low-frequency words (e.g. gapi and moku), which occurred with equal frequency. Importantly, the test part-words had internal TPs of ·5, but the test words had internal TPs of 1·0.

Infants’ listening preferences for low-frequency words vs. part-words was assessed using the HTPP. On each test trial a word or part-word was repeated up to twenty-five times. For example, on the gapi test trial, four different tokens of gapi were repeated in a random order separated by 0·5 sec pauses. Each test trial was presented twice, for a total of eight test trials. All infants heard the same set of test trials, but the sequences that were words for an individual infant familiarized with Language 1 were part-words for infants familiarized with Language 2, and vice versa. This allowed us to control for arbitrary listening preferences.

During the test phase, the experimenter used the custom software to control the presentation of auditory stimuli and record infants’ listening times. The infant was seated on a parent's lap, and each test trial began with a flashing red circle displayed on a monitor located in front of the infant. Once the infant focused on the screen, a GIF animation of a steam train appeared on a monitor 90 degrees to the right or left, and the center circle disappeared. When the infant turned his or her head at least 30 degrees toward the side stimulus, the experimenter signaled the computer to present an auditory stimulus. The auditory stimulus continued to play until the infant looked away for at least two consecutive seconds, or until 35 seconds had elapsed. To train infants on the contingency between their looking behavior and the presentation of auditory stimuli, the test phase began with two trials that consisted of music.

Lexical-processing efficiency (LPE) task materials and procedure

After infants completed the statistical learning task they were given a short break (if needed) before we assessed their LPE. The LPE task tested infants’ ability to find the referents of several common English words (see Fernald et al., Reference Fernald, Zangl, Portillo, Marchman, Sekerina, Fernández and Clahsen2008, for an overview). On each trial, two realistic color pictures of familiar animals or objects appeared in the lower right and left corners of the central monitor. After 3 seconds, infants heard the phrase “Find the [target word]” and the trial continued for approximately 3 seconds. The phrases were naturally produced in an infant-directed register by a female speaker.

The target words were kitty, doggie, birdie, baby, car, and shoe. Infants were tested on each target word four times, with side of target presentation counterbalanced. Each picture occurred equally often as a target and a foil. After every fourth trial, infants viewed a reward trial consisting of a colorful image moving across the screen paired with encouraging remarks, such as “Good job!” and “Way to go!”. Infants' looking behavior during this task was recorded directly to a hard drive for coding and analysis.

Vocabulary size assessment

After the experiment, parents completed the MacArthur-Bates Communicative Development Inventory (MCDI): Words and Gestures. This form contains a set of lexical items that are typically learned within the first three years, and we used a count of the words that parents indicated that their child understood (receptive vocabulary).

RESULTS AND DISCUSSION

Data preparation

Because the minimum time needed to hear a word or part-word sequence was approximately 1 second, trials on which infants listened for less than a second were not included. Infants who did not contribute at least two usable test trials of each type (word and part-word) were excluded from analysis. We used a proportional preference measure to capture performance on the statistical learning task. A preference score for each infant was created by dividing her average listening time to word trials by the sum of her average listening time to word plus part-word trials. Thus, scores above ·5 indicate a stronger preference for words over part-words, and the higher the number the greater their word preference. The proportional preference measure normalizes across differences in overall listening time.Footnote 2 While both words and part-words had been heard equally often during Familiarization, the words contained more reliable statistical structure, and can therefore be thought of as relatively ‘familiar’. The part-words, because they consisted of less reliable transitions, were relatively ‘novel’.

The recordings of the LPE task were coded frame-by-frame off-line by trained observers naive to the content of each trial using iCoder software developed by Fernald et al. (Reference Fernald, Zangl, Portillo, Marchman, Sekerina, Fernández and Clahsen2008). Data from 10% of participants were randomly selected to be independently coded. Agreement between coders within a single frame was greater than 98%. Following Fernald and colleagues (2008), LPE was assessed on trials during which infants were looking to the distractor picture at label onset. In particular, the time it took to initiate a gaze shift to the correct picture was averaged across those trials to obtain a reaction time (RT) score. To increase the likelihood that shifts were related to hearing the target word, trials on which infants shifted before 367 ms had elapsed, the minimum time needed to plan and initiate an eye-movement in response to auditory information, or after 2200 ms, were not included. Using this task, Fernald has found that correlational analyses between RT calculated over all six words presented and MCDI scores do not yield different results from those using RT calculated using only words that parents reported their infant to understand (Fernald et al., Reference Fernald, Perfors and Marchman2006). Calculating RT over more trials is likely to provide a more stable estimate, and thus we included trials testing all six words. Data from infants who did not contribute at least two usable trials over which to calculate a mean RT score were excluded from analysis.

Preliminary analyses

All tests in this experiment were two-tailed and alpha levels were set to ·05. Preliminary analyses revealed no effects of artificial language version on infants' preferences for words, and no effects of sex for this measure or for the LPE and vocabulary size measures. These factors were thus not included in further analyses. A one-sample t-test indicated that infants as a whole did not show a preference for either test trial type (M = ·5, SE = ·02, t(37) = ·11, p = ·92). We will address the implications of this finding in the context of the results of the correlational analyses.

Relations between performance on the statistical learning and LPE tasks

Our primary question was whether individual differences in statistical learning are related to LPE. We addressed this question by testing how these factors covary. We found that they are negatively correlated, such that infants who showed stronger familiarity preferences had shorter RTs on the LPE task (see Table 2; Figure 1). Infants' preferences can change over the course of testing, and we found that the relation between preferences and LPE was stronger for the first test block (r(36) = –0·33, p = ·04) than in the second block (r(36) = –0·25, p = ·14). However, an ANCOVA indicated that there was no interaction between these factors (F(1,34) = 0·47, p = ·5), and thus we did not break down the preference score by block in subsequent analyses.

Fig. 1. Infants preference scores are calculated as the proportion of time spent listening to syllable sequences that had high TPs divided by total listening time.

Table 2. Experiment 1 correlations

notes: Infants' receptive vocabulary size (raw) was measured using the MCDI, SL (Statistical Learning) Task Preference by the proportion score reflecting their preference for words over part-words, and the LPE by the speed with which they used spoken words to find target pictures; * p < ·05.

Because previous work suggests that infants' LPE scores are related to age and vocabulary size (Fernald et al., Reference Fernald, Pinto, Swingley, Weinberg and McRoberts1998, Reference Fernald, Perfors and Marchman2006), we also wanted to account for variance in LPE that was related to these factors. To that end, we performed a hierarchical regression in which we tested whether statistical learning scores account for unique variance in LPE. In the Control Model, we entered age and vocabulary size as control variables, and performance on the statistical learning task was entered in the next block of the analysis to test the SL model. The dependent variable was performance on the LPE Task. The Control Model including age and vocabulary size did not predict variance in LPE (R 2 = ·11, F(2,35) = 2·15, p = ·13) (see Table 3). Critically, adding statistical learning to the model (the SL Model in Table 3) accounted for significant additional variance in LPE (ΔR 2 = ·12, ΔF (1,34) = 5·02, p = ·03) (see also Figure 1).

Table 3. Regression models

These findings suggest that infants’ performance on a test of their ability to learn statistical structure in speech was selectively related to their native language LPE. Specifically, infants with stronger preferences for word test items were faster on the LPE task, but neither preference scores nor LPE were related to age. Furthermore, infants' vocabulary size predicted their LPE, but performance on the statistical learning task accounted for additional variance.

As can be seen in Figure 1, infants who showed a familiarity preference tended to have faster LPE scores, and those with a novelty preference tended to have slower LPE scores. However, when the same artificial language was used to test TP learning in younger infants using a central fixation auditory preference procedure (Graf Estes, Reference Graf Estes2012), they showed a novelty preference at test. It is beyond the scope of the current work to determine why our findings differed from those of Graf Estes (i.e. whether the difference is related to the specific method we used, to the age of our participants, etc.). However, given that we did not predict a direction of preference a priori, we note the possibility of a type 1 error, or that we obtained this correlation by chance. Thus, Experiment 2 was designed as a conceptual replication in which we asked if sequential statistical learning and LPE are related when using different artificial-language materials that incorporate sequential statistics relevant to learning word-order patterns.

EXPERIMENT 2

The goal of Experiment 2 was to test whether infants’ ability to learn sequential statistical regularities across words is related to their LPE. To that end, we presented 15-month-olds with an artificial language containing aX bY word-order relationships. This language incorporates a sequential structure modeled after determiner–noun and auxiliary–verb co-occurrence relationships. Gómez and Lakusta (Reference Gómez and Lakusta2004) found that when 12-month-olds were familiarized to a similar aX bY language, and then tested on grammatical and ungrammatical strings, most infants showed a preference for the grammatical strings. Thus, in the current experiment we used the HTPP to assess infants’ ability to learn statistical regularities in the aX bY language, and assessed LPE as in Experiment 1. Based on the findings of Experiment 1, and the prior work with this artificial language, we predicted that infants showing a stronger preference for statistically robust structure (i.e. a stronger preference for familiar grammatical strings over ungrammatical ones) would also have better LPE.

METHODS

Participants

Participants were thirty infants between 15·0 and 15·9 months of age (16 female; see Table 3 for additional descriptive statistics). The inclusion criteria were the same as in Experiment 1. An additional fourteen infants were tested but their data were excluded for fussiness (n = 8), equipment failure and experimenter error (n = 4), or failure to contribute sufficient usable data (n = 2).Footnote 3

Materials and procedure

The entire experiment took place in the sound-attenuated booth from Experiment 1, and the general structure was parallel, such that infants were trained and tested on an artificial language-learning task, then tested on the LPE task, and finally caregivers filled out the MCDI: Words and Gestures form.

Statistical learning task

During the Familiarization phase, infants were seated on a parent's lap as the artificial language was played. There were two a-words (ong, erd) and two b-words (alt, ush), and eight each of the X- and Y-words. The X-words were disyllabic (coomo, fengle, kicey, loga, paylig, wazil, bevit, meeper), while the Y-words were monosyllabic (deech, ghope, jic, skige, vabe, tam, vot, rud). Infants were familiarized to one of two versions of the language. In Language 1, strings took the form aX and bY (e.g. erd coomo, ong kicey, ush deech, alt skige). In Language 2, the pairings were switched such that strings took the form aY and bX (e.g. erd deech, ong skige, ush coomo, alt kicey).

A key feature of this artificial language is that the X and Y categories were distinguished by correlated phonological and distributional cues. Using Language 1 to illustrate, disyllabic X-words followed a-words in phrases (e.g. ong coomo, erd coomo), whereas monosyllabic Y-words followed b-words (e.g. alt deech, ush deech). Thus, words that shared phonological properties also shared distributional properties. The presence of correlated cues is critical to grouping words into categories, and to learning their co-occurrence relationships in this aX bY language (Gómez & Lakusta, Reference Gómez and Lakusta2004; Lany & Gómez, Reference Lany and Gómez2008).

The materials were spoken by an adult female in an animated voice and digitized for editing. One token of each word was selected so that aX and bY phrases (or aY and bX phrases in Language 2) could be created by splicing tokens together separated by 0·1-sec pauses. Thus, each language contained thirty-two unique phrases (16 aX and 16 bY, or vice versa in Language 2). Each phrase occurred four times during the randomized Familiarization sequence, which lasted 3·5 minutes.

During Familiarization, infants were also trained on the contingency between their looking behavior and the appearance and disappearance of stimuli on the center and side monitors. When the infant was attentive, the experimenter used custom PC software to initiate the Familiarization stream and to control the presentation of the visual stimuli. At the same time, a flashing light was displayed on a monitor located in front of the infant. Once the infant was focused on the screen, the experimenter signaled the computer to present the steam train GIF on a monitor 90 degrees to the right or left, and the flashing light disappeared. The side stimulus was displayed until the infant looked away for more than two consecutive seconds, or more than 45 seconds had elapsed. At that point, the center stimulus reappeared and the process repeated. The auditory stream played continuously, while the front and side stimuli appeared contingent on infants’ looking behavior.

After Familiarization, infants were tested on their ability to discriminate between grammatical and ungrammatical strings. The test included four trials consisting of Language 1 strings (e.g. erd coomo, ush deech, ong fengle, alt ghope) and four trials consisting of Version 2 strings (e.g. erd deech, ush coomo, ong ghope, alt fengle). Thus, all of the words in the test trials were familiar to infants, but not all of the phrases conformed to the Familiarization language. For infants familiarized to Language 1, the Language 1 strings were Grammatical, and the Language 2 strings were Ungrammatical, and vice versa for infants familiarized to Language 2. Critically, infants could only distinguish between Language 1 and 2 strings if they had learned the co-occurrence relationships between the words.

Infants were presented with two blocks of test trials, each consisting of two Language 1 and two Language 2 trials, for a total of eight test trials. The custom software was used to control the presentation of the auditory stimuli and to record infants’ listening times, as in Experiment 1. Trials lasted until the infant looked away for more than two consecutive seconds or more than 45 seconds elapsed. The order and side of test-trial presentation were randomized.

RESULTS AND DISCUSSION

Data processing

The HTPP data were processed similarly to Experiment 1, with one exception. Because strings were approximately 1·5 seconds in duration, trials on which infants’ listening times were less than 1·5 seconds were not included in the calculation of their mean listening times. As in Experiment 1, infants who did not contribute at least two usable test trials of each type (Grammatical and Ungrammatical) were excluded from analysis. Infants' preference scores were calculated as in Experiment 1, as were their RTs on the LPE task. LPE data from 10% of participants were randomly selected and independently coded. Agreement between coders within a single frame was greater than 98%.

Preliminary analyses

Infants’ performance on the LPE and statistical-learning tasks did not differ as a function of sex or language version, and thus we collapsed across these variables in subsequent analyses. As in Experiment 1, infants did not show a significant preference for the familiar test items (M = ·51, SE = ·03; t(29) = ·2, p = ·4).

Relations between performance on the statistical learning and LPE tasks

Our main question was whether infants' statistical learning ability was related to their LPE. We used a one-tailed correlation for our initial analyses because we predicted that infants with greater familiarity preferences would also process native language speech more rapidly. Overall, infants’ preferences were negatively correlated with their LPE, but the strength of the relation was quite weak, and did not reach significance (r(29) = –0·13, p > ·05). As in Experiment 1, the relation was significant in the first block of testing (r(29) = –0·31, p = ·05), but not the second (r(25) = 0·21, p > ·05) (see Figure 2; Table 5). A two-tailed ANCOVA revealed an interaction between test block (Blocks 1 and 2) and LPE (F(1,24) = 4·24, p = ·05). Only Block 1 was related to SPE (B = –14·02, p = ·03) (Block 2: B = 4·35, p = ·39). Thus, we focused on Block 1 performance in subsequent analyses.

Fig. 2. Infants’ preference scores are calculated as the proportion of time spent listening to attested word-order patterns divided by total listening time.

To determine whether LPE was selectively related to statistical learning performance, we used a hierarchical regression in which infants’ age and receptive vocabulary size were entered in the first step, and statistical learning (Block 1) was entered in the second (see Table 4 for descriptives). The Control Model including age and vocabulary size did not explain significant variance in SPE (R 2 = ·07, F(2,27) = 0·95, p = ·4) (see Table 6). However, adding the infants’ statistical learning scores (the SL Model in Table 6) increased the model fit (ΔR 2 = ·14, ΔF(1,26) = 4·5, p = ·04).Footnote 4

Table 4. Experiment 2 descriptive statistics

note: As in Experiment 1, the Words Understood raw and percentile scores came from the MCDI, the SL (Statistical Learning) Task score reflects their preference for listening to Grammatical over Ungrammatical strings (a proportion score), and LPE RT reflects the speed with which they interpreted familiar English words.

Table 5. Experiment 2 correlations

notes: Receptive vocabulary size (raw) was measured using the MCDI, LPE by the speed with which they used English words to find target pictures, and SL (Statistical Learning) by their preference for listening to Grammatical over Ungrammatical strings in Block 1 (a proportion score); * p < ·1, ** p < ·05.

Table 6. Regression models

In sum, we found that 15-month-old infants’ LPE is related to their performance on a task assessing learning of word-level co-occurrence statistics in an artificial language. Specifically, infants’ preference for familiar grammatical strings vs. ungrammatical strings predicted their LPE, such that infants who tended to listen longer to familiar, grammatical strings processed familiar native language words more rapidly. This relation, while holding only in the first block of testing, was significant when including variance related to infants’ age and vocabulary size.

These findings parallel those from Experiment 1, in which infants with better LPE showed a preference for highly predictable syllable sequences. Thus, across two different tasks, two groups of infants showed the same pattern of performance. Infants who encoded native language speech more efficiently tended to listen longer to sequences that were consistent with the most robust statistical regularities in their Familiarization, while infants with slower LPE scores tended to listen longer to sequences that were inconsistent with that structure. In both experiments we also found relations between native language processing and statistical learning when including variance accounted for by age and vocabulary size. Nonetheless, the relations we observed are modest, and should be taken as a starting point for future research.

GENERAL DISCUSSION

We tested whether infants’ sequential statistical learning ability is related to their ability to encode and interpret native-language speech. We used two classic artificial language structures that tap infants' learning of sequential structure: one tapped statistical regularities holding across syllables, or TPs (Experiment 1), and the other tapped sequential regularities holding across words (Experiment 2). Previous work suggested that infants aged 15 months and younger are capable of learning the kinds of sequential structure found in both artificial languages (Aslin et al., Reference Aslin, Saffran and Newport1998; Gómez & Lakusta, Reference Gómez and Lakusta2004; Graf Estes, Reference Graf Estes2012; Graf Estes et al., 2007; Lany & Gómez, Reference Lany and Gómez2008). We assessed speech processing using a well-studied lexical-processing efficiency (LPE) task in which infants heard words that were likely to be familiar to them. The words were presented in simple ostensive labeling phrases that are common in speech to infants (Cameron-Faulkner, Lieven & Tomasello, Reference Cameron-Faulkner, Lieven and Tomasello2003).

In Experiment 1, we found that infants who showed stronger preferences for word-like units from an artificial language also performed better on the LPE task. In Experiment 2, we found that infants who showed stronger preferences for attested word-order patterns also had better LPE, though infants' preferences faded over the course of testing. The relations we observed were of modest strength, but there was consistency in the results. Together, these findings suggest that infants’ ability to learn novel sequential regularities is related to their ability to encode and interpret fluent speech from their native language.

While we found evidence that sequential statistical learning abilities and LPE are related, we should note that we did not predict that better statistical learning would be evidenced as a novelty preference; in fact, we did not predict a specific direction of preference. Researchers often remain uncommitted about the likely direction of preference that they will obtain in a given HTPP experiment, as both familiarity and novelty preferences are common. For example, Gómez and Lakusta (Reference Gómez and Lakusta2004) found familiarity preferences using an aX bY language similar to the one used in Experiment 2, as we did, while other studies using artificial language materials have found novelty preferences (e.g. Marcus et al., Reference Marcus, Vijayan, Rao and Vishton1999; Saffran et al., Reference Saffran, Aslin and Newport1996). Some studies have also reported both familiarity and novelty preferences across related experiments as a function of age and training or testing conditions (e.g. Gómez & Maye, Reference Gómez and Maye2005; Saffran & Thiessen, Reference Saffran and Thiessen2003).

Nonetheless, interpreting our HTPP results requires careful consideration. In two recent studies, stronger familiarity preferences on a segmentation task using natural language materials were associated with better native language development (Newman et al., Reference Newman, Ratner, Jusczyk and Jusczyk2006; Singh, Reznik & Xuehua, Reference Singh, Reznik and Xuehua2012; but see Newman et al., Reference Newman, Rowe and Ratner2016, for a different result). Our finding that stronger familiarity preferences for attested structure in artificial-language materials were associated with better LPE is consistent with these studies, and with an interpretation in which better statistical learning is related to better LPE. While it is possible that greater novelty preferences actually reflected better learning, we suggest that this is unlikely: such an interpretation would mean that infants with better statistical learning skills were less proficient with their native language, but this pattern of findings is more characteristic in studies in which the novel statistical regularities are designed to violate native language phonotactics (e.g. Graf Estes et al., Reference Graf Estes, Edwards and Saffran2011, Reference Graf Estes, Gluck and Grimm2016). In such studies, the effects tend to become more pronounced with age. However, infants older than those we tested persist in tracking the statistical regularities in the artificial languages we used here (Graf Estes et al., Reference Graf Estes, Evans., Alibali and Saffran2007; Lany & Saffran, Reference Lany and Saffran2010), and better performance has also been associated with better language development (Lany, Reference Lany2014). The interpretation that greater familiarity preferences reflect better statistical learning, and consequently that better statistical learning is associated with better LPE, is also consistent with evidence that the ability to learn statistical regularities is related to language processing skills in adults (Conway et al., Reference Conway, Bauernschmidt, Huang and Pisoni2010; Misyak & Christiansen, Reference Misyak and Christiansen2012) and school-aged children (Kidd & Arciuli, Reference Kidd and Arciuli2016; Spencer, Kaschak, Jones & Lonigan, Reference Spencer, Kaschak, Jones and Lonigan2015).

In future work it will be important to continue to investigate how statistical learning is related to real-time language processing. Interpreting relations between HTPP tasks assessing statistical learning and other measures of language learning and development would be easier if group-level direction-of-preference effects were more straightforward and if it were possible to make a priori precise direction-of-preference predictions. In principle, the HTPP is well positioned to capture both group effects and individual variability (Kemler Nelson, Jusczk, Mandel, Myers, Turn & Gerken, Reference Kemler Nelson, Jusczk, Mandel, Myers, Turn and Gerken.1995), but the parameters of the task may need to be more extensively tailored to a given set of materials at a specific age or developmental level. Thus, it will be valuable to refine tests of infant sequential statistical learning in future work.

This issue notwithstanding, our data may have implications for studies using the HTPP that yield null results. A common approach taken to facilitate interpretation of direction of preference is to test infants in a narrow age range, or to contrast performance across several narrow age ranges. The current data suggest that, even if a group of infants at a specific age show no statistically reliable preference overall, it may not mean that infants failed to distinguish between different types of trials (in this case statistically attested vs. unattested structure). Infants' age is a good predictor of language learning development at a gross level (24-month-olds are generally more advanced than 18-month-olds, and 18-month-olds more advanced than 12-month-olds), but not necessarily at a month-to-month level. For example, vocabulary size is better than age at predicting LPE and when infants will begin to combine words in their own speech (Bates, Bretherton & Snyder, Reference Bates, Bretherton and Snyder1988; Borovsky, Elman & Fernald, Reference Borovsky, Elman and Fernald2012). Thus, one potentially promising approach in studies employing the HTPP, including efforts to replicate previous work, would be to match participant samples on factors other than age, including measures of language processing. Such an approach might yield more meaningful data, and facilitate replicating previous findings.

Another caveat is that the correlational design we used cannot support inferences about causation. Our goal was to study the naturally occurring covariation between statistical learning ability and speech processing as a means of supplementing the extensive body of tightly controlled experimental work on statistical learning. However, we cannot rule out the possibility that the observed relations were driven by a third underlying factor. For example, it is possible that infants who both processed speech efficiently and readily learned statistical regularities were simply more advanced across the board. We were able to partially address this limitation by building on previous findings that there is a relation between measures of infants’ vocabulary size, and measures of their LPE (Fernald et al., Reference Fernald, Pinto, Swingley, Weinberg and McRoberts1998, Reference Fernald, Perfors and Marchman2006; Weisleder & Fernald, Reference Weisleder and Fernald2013). Specifically, we found that statistical learning ability is also related to LPE, above and beyond the variance related to vocabulary size, providing some evidence for specificity in the relation. Nonetheless, it is possible that more advanced language skills like phonological encoding drive superior performance in both tasks. Thus, it would be valuable to test whether sequential processing in both artificial- and native-language materials is related to the precision of phoneme representations, as well as other aspects of phonological development. For example, it would be informative to test the precision of infants' phoneme representations, and the speed with which they access them, in conjunction with sequential statistical learning and LPE, to determine whether these factors explain additional variance in the measures, or in the relation between them. It would also be valuable to test relations between sequential statistical learning and LPE tasks that are tailored to tap different kinds of sequential structure. While a small number of studies have assessed differences in young children's processing efficiency for predictive vs. unpredictive structures or contexts, these tasks have been used minimally, and not at all with the younger ages tested here (Borovsky, Ellis, Evans & Elman, Reference Borovsky, Ellis, Evans and Elman2016; Mani & Huettig, Reference Mani and Huettig2012). Thus, testing slightly older infants might provide important insights into this question.

Considered together, we suggest that the findings from these two experiments provide a platform on which future research on connections between sequential statistical learning and language development can build. For example, a critical step in understanding the mechanisms of language learning will be to determine how speech-processing efficiency and statistical learning ability are related. In the ‘Introduction’ we suggested that statistical learning may support on-line speech processing, and specifically that the ability to detect and use predictive structure in speech via statistical learning is what contributes to gains in the ability to encode and interpret speech in real time. However, it is also possible that speech-processing ability gates the input to statistical learning mechanisms. In fact, it is likely that encoding and statistical learning processes influence each other across development, perhaps becoming more interdependent as infants gain language experience. A potential approach to determining how these processes are related would entail examining these relationships as they unfold across development (see Arciuli & Torkildson, Reference Arciuli and Torkildsen2012), including measures of phonological processing.

In sum, previous studies have linked statistical learning with language outcomes (e.g. Graf Estes et al., Reference Graf Estes, Edwards and Saffran2011; Lany, Reference Lany2014), and the current work builds on such findings by demonstrating a link between statistical learning and a specific language skill – lexical-processing efficiency. These data suggest that LPE may be a good way to test relations between statistical learning ability and language development. Furthermore, consistent with findings from work with adults (Conway et al., Reference Conway, Bauernschmidt, Huang and Pisoni2010), they suggest that one way that good statistical learning skills may affect infants’ language development is by promoting the ability to encode speech rapidly.

Footnotes

[*]

This work was supported by funds from NSF BCS-1352443 and by Notre Dame's Institute for Scholarship in the Liberal Arts to JL.

1 Excluded infants’ receptive vocabulary size (M = 161, range 4–383) was comparable to that of the infants whose data were included (see Table 1).

2 The results were largely the same when we used a subtraction measure of listening time to words minus part-words.

3 On average, excluded participants' vocabulary size (M = 129, range 22–338), which was comparable to that of the infants included in the final dataset (see Table 3).

4 As in Experiment 1, the pattern of findings across the analyses was unchanged when we used mean listening to Grammatical strings minus listening to Ungrammatical strings, rather than the proportional score used here.

References

REFERENCES

Arciuli, J. & Torkildsen, J. von Kos (2012). Advancing our understanding of the link between statistical learning and language acquisition: the need for longitudinal data. Frontiers in Psychology 3, online: <doi:10.3389/fpsyg.2012.00324>.Google Scholar
Aslin, R. N., Saffran, J. R. & Newport, E. L. (1998). Computation of conditional probability statistics by 8-month-old infants. Psychological Science 9 321324.CrossRefGoogle Scholar
Bates, E., Bretherton, I. & Snyder, L. (1988). From first words to grammar: individual differences and dissociable mechanisms. Cambridge: Cambridge University Press.Google Scholar
Borovsky, A., Ellis, E. M., Evans, J. L., Elman, J. L. (2016). Semantic structure in vocabulary knowledge interacts with lexical and sentence processing in infancy. Child Development 87, 1893–908.Google Scholar
Borovsky, A., Elman, J. L. & Fernald, A. (2012). Knowing a lot for one's age: vocabulary skill and not age is associated with anticipatory incremental sentence interpretation in children and adults. Journal of Experimental Child Psychology 112, 417–36.Google Scholar
Cameron-Faulkner, T., Lieven, E. & Tomasello, M. (2003). A construction based analysis of child directed speech. Cognitive Science 27, 843–73.CrossRefGoogle Scholar
Conway, C. M., Bauernschmidt, A., Huang, S. S. & Pisoni, D. B. (2010). Implicit statistical learning in language processing: word predictability is the key. Cognition 114, 356–71.CrossRefGoogle ScholarPubMed
Ellis, E. A., Robledo Gonzalez, M. & Deak, G. O. (2013). Visual prediction in infancy: What is the association with later vocabulary? Language Learning and Development 10, 3650.Google Scholar
Farmer, T. A., Christiansen, M. H. & Monaghan, R. (2006). Phonological typicality influences on-line sentence comprehension. Proceedings of the National Academy of Sciences 103, 12203–08.CrossRefGoogle ScholarPubMed
Fernald, A., Perfors, A. & Marchman, V. A. (2006). Picking up speed in understanding: speech processing efficiency and vocabulary growth across the 2nd year. Developmental Psychology 42, 98116.Google Scholar
Fernald, A., Pinto, J. P., Swingley, D., Weinberg, A. & McRoberts, G. W. (1998). Rapid gains in speed of verbal processing by infants in the 2nd year. Psychological Science 9, 228–31.CrossRefGoogle Scholar
Fernald, A., Swingley, D. & Pinto, J. P. (2001). When half a word is enough: infants can recognize spoken words using partial phonetic information. Child Development 72, 1003–15.CrossRefGoogle ScholarPubMed
Fernald, A., Zangl, R., Portillo, A. & Marchman, V. A. (2008). Looking while listening: using eye movements to monitor spoken language comprehension by infants and young children. In Sekerina, I. A., Fernández, E. M. & Clahsen, H. (eds), Developmental psycholinguistics: on-line methods in children's language processing, 97135. Amsterdam: John Benjamins.CrossRefGoogle Scholar
Gerken, L. A., Wilson, R. & Lewis, W. (2005). Seventeen-month-olds can use distributional cues to form syntactic categories. Journal of Child Language 32, 249–68.Google Scholar
Gómez, R. L. & Lakusta, L. (2004). A first step in form-based category abstraction in 12-month-old infants. Developmental Science 7, 567–80.CrossRefGoogle ScholarPubMed
Gómez, R. L. & Maye, J. (2005). The developmental trajectory of nonadjacent dependency learning. Infancy 7, 183206.Google Scholar
Graf Estes, K. (2012). Infants generalize representations of statistically segmented words. Frontiers in Psychology 3. Online: <doi: 10.3389/fpsyg.2012.00447>.Google Scholar
Graf Estes, K., Edwards, J. & Saffran, J. R. (2011). Phonotactic constraints on infant word learning. Infancy 16, 180–97.CrossRefGoogle ScholarPubMed
Graf Estes, K., Evans., J. L., Alibali, M. W. & Saffran, J. R. (2007). Can infants map meaning to newly segmented words? Statistical segmentation and word learning. Psychological Science 18, 254–60.Google Scholar
Graf Estes, K., Gluck, S. C. & Grimm, K. J. (2016). Finding patterns and learning words: infant phonotactic knowledge is associated with vocabulary size. Journal of Experimental Child Psychology 146, 3449.Google Scholar
Hay, J. F., Pelucci, B., Graf Estes, K. & Saffran, J. R. (2011). Linking sounds to meanings: infant statistical learning in a natural language. Cognitive Psychology 63, 93106.Google Scholar
Houston-Price, C. & Nakia, S. (2004). Distinguishing novelty and familiarity effects in infant preference procedures. Infant and Child Development 13, 341–8.Google Scholar
Hunter, M. A. & Ames, E. W. (1988). A multifactor model of infant preferences for novel and familiar stimuli. Advances in Infancy Research 5, 6995.Google Scholar
Huttenlocher, J., Haight, W., Bryk, A., Seltzer, M. & Lyons, T. (1991). Early vocabulary growth: relation to language input and gender. Developmental Psychology 27, 236–48.CrossRefGoogle Scholar
Kemler Nelson, D. G., Jusczk, P. W., Mandel, D. R., Myers, J., Turn, A. & Gerken., L. A. (1995). The Head-Turn Preference Procedure for testing auditory perception. Infant Behavior and Development 18, 111–6.Google Scholar
Kidd, E. & Arciuli, J. (2016). Individual differences in statistical learning predict children's comprehension of syntax. Child Development 87, 184–93.Google Scholar
Lany, J. (2014). Judging words by their covers and the company they keep: probabilistic cues support word learning. Child Development 85, 1727–39.Google Scholar
Lany, J. & Gómez, R. L. (2008). Twelve-month-old infants benefit from prior experience in statistical learning. Psychological Science 19, 1247–52.Google Scholar
Lany, J. & Saffran, J. R. (2010). From statistics to meaning: infant acquisition of lexical categories. Psychological Science 21, 284–91.Google Scholar
Lany, J. & Saffran, J. R. (2011). Interactions between statistical and semantic information in infant language development. Developmental Science 14, 1207–19.Google Scholar
Lany, J. & Saffran, J. R. (2013). Statistical learning mechanisms in infancy. In J. L. R. Rubenstein & P. Rakic (eds.), Comprehensive developmental neuroscience: neural circuit development and function in the brain, vol. 3 (pp. 231-248). Amsterdam: Elsevier.Google Scholar
Lieven, E., Behrens, H., Speares, J. & Tomasello, M. (2003). Early syntactic creativity: a usage-based approach. Journal of Child Language 30, 333–70.Google Scholar
Mani, N. & Huettig, F. (2012). Prediction during language processing is a piece of cake – but only for skilled producers. Journal of Experimental Psychology: Human Perception and Performance 38, 843–7.Google Scholar
Marcus, G. F., Vijayan, S., Rao, S. B. & Vishton, P. M. (1999). Rule learning by seven-month-old infants. Science 283, 7780.Google Scholar
Mintz, T. H., Newport, E. L. & Bever, T. G. (2002). The distributional structure of grammatical categories in speech to young children. Cognitive Science 26, 393424.Google Scholar
Misyak, J. B. & Christiansen, M. H. (2012). Statistical learning and language: an individual differences study. Language Learning: A Journal of Research in Language Studies 62, 302–31.Google Scholar
Newman, R., Ratner, N. B., Jusczyk, A. M. & Jusczyk, P. W. (2006). Infants’ early ability to segment the conversational speech signal predicts later language development: a retrospective analysis. Developmental Psychology 42, 643–55.Google Scholar
Newman, R., Rowe, M.L. & Ratner, N. B (2016). Input and uptake at 7 months predicts toddler vocabulary: the role of child-directed speech and infants processing skills in language development. Journal of Child Language 43, 1158–73.Google Scholar
Pelucci, B., Hay, J. F. & Saffran, J. R. (2009). Statistical learning in a natural language by 8-month-old infants. Child Development 80, 674–85.Google Scholar
Saffran, J. R., Aslin, R. N. & Newport, E. L. (1996). Statistical learning by 8-month-old infants. Science 274, 1926–8.Google Scholar
Saffran, J. R. & Thiessen, E. D. (2003). Pattern induction by infant language learners. Developmental Psychology 39, 484–94.Google Scholar
Shafto, C. L., Conway, C. M., Field, S. L. & Houston, D. M. (2012). Visual sequence learning in infancy: domain-general and domain-specific associations with language. Infancy 17, 247–71.Google Scholar
Singh, L., Reznik, J. S. & Xuehua, L. (2012). Infant word segmentation and childhood vocabulary development: a longitudinal analysis. Developmental Science 15, 482–95.Google Scholar
Spencer, M., Kaschak, M. P., Jones, J. L. & Lonigan, C. J. (2015). Statistical learning is related to early literacy-related skills. Reading and Writing 28, 467–90.Google Scholar
Swingley, D. (2005). Statistical clustering and the contents of the infant vocabulary. Cognitive Psychology 50, 86132.Google Scholar
Waxman, S. R., Lidz, J. L., Braun, I. E. & Lavin, T. (2009). Twenty-four-month-old infants’ interpretation of novel verbs and nouns in dynamic scenes. Cognitive Psychology 59, 6795.CrossRefGoogle ScholarPubMed
Weisleder, A. & Fernald, A. (2013). Talking to children matters: early language experience strengthens processing and builds vocabulary. Psychological Science 24, 2143–52.Google Scholar
Figure 0

Table 1. Experiment 1 descriptive statistics

Figure 1

Fig. 1. Infants preference scores are calculated as the proportion of time spent listening to syllable sequences that had high TPs divided by total listening time.

Figure 2

Table 2. Experiment 1 correlations

Figure 3

Table 3. Regression models

Figure 4

Fig. 2. Infants’ preference scores are calculated as the proportion of time spent listening to attested word-order patterns divided by total listening time.

Figure 5

Table 4. Experiment 2 descriptive statistics

Figure 6

Table 5. Experiment 2 correlations

Figure 7

Table 6. Regression models