Hostname: page-component-745bb68f8f-hvd4g Total loading time: 0 Render date: 2025-02-11T10:22:43.069Z Has data issue: false hasContentIssue false

On the role of vowel duration in the New Zealand English front vowel shift

Published online by Cambridge University Press:  27 November 2009

Christian Langstrof
Affiliation:
University of Freiburg
Rights & Permissions [Opens in a new window]

Abstract

This article investigates the role of vowel duration in the front vowel system of New Zealand English (NZE), drawing on data obtained from speakers born between the 1890s and the 1930s. After providing a brief overview of the history of short vowels in NZE, a comprehensive analysis of front vowel duration in conjunction with a number of earlier results from formant frequency measurements will be presented. It will be shown that the front vowel system of NZE shows interaction between vowel duration and formant frequency. A number of implications that follow from these patterns for the front vowel system of NZE will be discussed. It will be argued that it is reasonable to divide up the class of short front vowels in NZE into a short set (consisting only of one vowel) and a “not-so-short set.” In addition, it will be concluded that phonological class membership is irrelevant to making generalizations over patterns of movements in vowel change.

Type
Research Article
Copyright
Copyright © Cambridge University Press 2009

This article investigates the role of vowel duration in the front vowel system of New Zealand English (NZE). Duration has largely been ignored in previous studies on the development of the short front vowels (SFVs) in NZE. In the present article, it will be argued that vowel duration plays an integral role in the process of the restructuring of the NZE front vowel space over the last 150 years. It will be shown that it is reasonable to divide up the class of SFVs (/Iε æ/) in NZE into a “short set” (consisting only of one vowel) and a “not-so-short set,” for want of a more appropriate designation. In the context of the NZE chain shift that involves the SFVs, duration will be shown to constitute a crucial phonetic parameter in resolving vowel distributions that overlap in F1/F2 space in intermediate stages of the shift. Therefore, the existence of solid length differences in these intermediate stages of the chain shift might have contributed to there being a chain shift in the first place rather than a merger.

In addition to this, the existence of solid durational differences between different short vowels might help elucidate the problem of the “erratic” behavior of these vowels in the theoretical framework on chain shifts proposed by Labov (Reference Labov1994). In Labov's view, the NZE SFV chain shift constitutes a counterexample to the general observation that short vowels generally fall in chain shifts, that is, show downward movements in vowel space over time. The data presented here suggests that two of the vowels in question (namely /E/ and /æ/) can be regarded as somewhat different from a “true” short vowel as /I/ in terms of their durational properties. Because these vowels are not particularly good examples of short vowels, it follows that their behavior is indeed in accordance with the Labovian regularities in chain shifts. It will be argued that what counts in predicting the pathways of vowels in chain shifts are the phonetic properties of vowels rather than their phonological status.

This article is structured as follows. The Background section provides a brief overview of the history of short vowels in NZE. The Methods section discusses the data sample. The Results section provides a comprehensive analysis of front vowel duration in conjunction with a number of earlier results from formant frequency measurements. It will be shown that, on a phonetic level at least, the front vowel system of NZE shows an interaction between vowel duration and formant frequency. The Summary/Discussion and Conclusions section discuss a number of implications that follow from these patterns for the front vowel system of NZE.

BACKGROUND

The repartitioning of the NZE front vowel space is now one of the most widely documented changes in progress in contemporary English (cf. Bauer, Reference Bauer1979, Reference Bauer1986, Reference Bauer1992; Gordon, Campbell, Hay, Maclagan, Sudbury, & Trudgill, Reference Gordon, Campbell, Hay, Maclagan, Sudbury and Trudgill2004; Maclagan, Reference Maclagan1982; Trudgill, Gordon, & Lewis, Reference Trudgill, Gordon and Lewis1998). It mainly includes a vowel shift between the short vowels /I/, /ε/ and /æ/. In addition, it has been shown that correlations exist between the quality and duration of these vowels and a number of other vowels in the front vowel system of NZE, namely /i/ (Maclagan & Hay, Reference Maclagan and Hay2007) and /a/ (Gordon et al., Reference Gordon, Campbell, Hay, Maclagan, Sudbury and Trudgill2004). I will refer to the vowels in question by their lexical set names, where kit = /I/, dress = /ε/, and trap /æ/ (cf. Langstrof Reference Langstrof2006a:ch. 1 for discussion; Wells, Reference Wells1982).

The SFV shift itself consists of raising and fronting of the lower two vowels trap and dress as well as the centralization and lowering of the highest vowel kit, which results in a restructured short vowel space consisting of two front vowels (dress and trap), two central vowels (kit and strut), and two back vowels (foot and lot). Although it has been shown that raised variants were already present in the speech of the earliest European immigrants to New Zealand (Gordon et al., Reference Gordon, Campbell, Hay, Maclagan, Sudbury and Trudgill2004), most of the chain shift is endemic (Langstrof, Reference Langstrof2006a:ch. 3). In addition, it is now clear that the raising of trap and dress predated the centralization of kit, which implies that the process constitutes a push-chain (Langstrof, Reference Langstrof2006b). Although most studies have focused on the SFV shift in isolation (and therewith implicitly regard this as a self-contained historical process), there is evidence of interaction between the SFVs and other vowels in NZE. Gordon et al. (Reference Gordon, Campbell, Hay, Maclagan, Sudbury and Trudgill2004) showed that there exists, for some speakers, a significant correlation between the short vowel trap and the long vowel /a/ (henceforth start) in their data from first- and second-generation New Zealanders. Specifically, for the speakers for whom the correlation holds, frontness in start correlates positively with frontness and closeness in trap. However, it is unclear whether this constitutes a truly causal relationship between the two elements, because there are essentially three ways for the observed correlation to come about: (1) via a push-chain relationship, which would imply that fronting of start triggered the entire short vowel shift; (2) as a secondary pull-chain effect whereby start moves into the vowel space vacated by raised trap; (3) no causal relation at all, which means that the correlation comes about incidentally as speakers of comparable innovativeness in a given vowel change often show a similar degree of innovativeness in another one.

A more in-depth analysis of an interrelation between a short vowel and a long vowel is provided by Maclagan & Hay (Reference Maclagan and Hay2007), who analyzed high front vowels in contemporary NZE and found both a correlation between vowels of different subsystems as well as providing evidence for trade-off relationships across different phonetic dimensions. It is shown that formerly short dress is now a high monophthong, which is distinguished from fleece mainly on the basis of diphthongization and duration rather than quality in the speech of younger informants. Although it is not unusual for mid-monophthongs to push out high monophthongs into the diphthong system (a similar process happened during the Great Vowel Shift) as a result of a raising process, the interesting facet to note in this case is the fact that dress is originally a “short” vowel. This process therefore reflects one of the allegedly rare cases of a clear functional relationship over time between vowels from different subsystems (cf. the extensive discussion in Labov [1994] on this topic). The obvious alternative to the preceding conclusion is to regard contemporary NZE dress as a long vowel, which would make the preceding process rather less mysterious. The Results section of this article presents data from earlier speakers of NZE, which suggests that this alternative view might be preferable on phonetic grounds, while the Summary/Discussion section discusses a number of problems that arise from this solution.

METHODS

The speaker sample

The speakers analyzed in the present article are New Zealanders born between the late 1890s and the mid-1930s. Thus, they represent an intermediate stage between the speakers analyzed by Gordon et al. (Reference Gordon, Campbell, Hay, Maclagan, Sudbury and Trudgill2004) and contemporary New Zealanders. These speakers will be referred to as “intermediate speakers.” Nine speakers who represent a subset of a larger corpus of approximately 110 speakers held at the Origins of New Zealand English (ONZE) archive at the University of Canterbury were analyzed. The speakers are divided into two groups. The first group consists of four male speakers born at the beginning of the intermediate period (more specifically, between the 1890s and the early 1900s). The second group consists of four female speakers born toward the end of that period. The design of the sample was motivated by the following considerations. A parallel analysis of vowel quality of 30 intermediate speakers discussed in Langstrof (Reference Langstrof2006a, Reference Langstrof2006b) revealed that it was in the intermediate period that the shift in the NZE SFV system gained momentum. In addition, the early male group and the late female group represent the polar ends of the innovativeness spectrum with regard to the quality change in the SFVs. It was therefore assumed that if any significant changes occurred over the intermediate period in terms of vowel duration as well, these should show up best if speakers at the opposite end of the innovativeness scale are compared. It should also be noted that the labels “early” and “late” as used in the following analysis refer to the intermediate period only rather than the overall history of the development of NZE.

The speech material consists of interviews of varying lengths that were recorded by various interviewers in the early- to mid-1990s in the course of a project on oral history in New Zealand. Subjects were asked to supply information about their childhood in various regions of New Zealand. The material on which the present study is based consists of running speech.

Analysis

Acoustic analysis of the vowels kit, dress, and trap was carried out using Praat software as well as the plug-in Akustyk.Footnote 1 The duration measurements reported were extracted by Akustyk, whereas formant frequency measurements were taken directly in Praat. The formant frequency data were subsequently normalized using Lobanov's z-score algorithm (Lobanov, Reference Lobanov1971) and coded for a number of independent variables (speaker age, voicing status of the following consonant, number of syllables, F1, F2) and analyzed using classification and regression tree (CART) analysis.Footnote 2 Duration measurements were taken from the visible onset of modal voicing on the spectrogram. Only stressed tokens were measured. The token numbers obtained from each speaker/vowel are given in Table 1. Table 2 shows token numbers for monosyllables and polysyllables, respectively. Only tokens before alveolar stops were measured for dress and trap. For kit, prealveolar as well as prevelar tokens were measured.Footnote 3

Table 1. Number of tokens obtained from two groups of intermediate speakers

[+/–voice] specifies the voicing state of the following consonant.

Table 2. Token numbers obtained from two groups of intermediate speakers broken down into occurrences in monosyllabic and polysyllabic words

RESULTS

The duration data obtained from the nine intermediate speakers are shown in Table 3.

Table 3. Average durations in milliseconds (ms) of SFVs in the speech of 9 intermediate speakers, 4 early males, and 5 late females

Token numbers below n = 5 are indicated.

Vowels in monosyllables

The data given in Table 3 suggest a number of pertinent points. First of all, as can be seen in the second column, both dress and trap are appreciably longer than kit in the speech of the intermediate speakers, where the overall ratios are approximately 1:2 for kit vs. dress, approximately 1:2.1 for kit vs. trap, and 1:1.05 for dress vs. trap in monosyllables. The duration difference between kit and dress is significant (Wilcox test, W = 442.5, p<.0001), but the difference between dress and trap is not. The same pattern holds if we break down the data into the two age groups. The differences between the age groups are not significant. In addition, the well-known length difference within a vowel depending on whether it occurs before a voiced stop or a voiceless stop holds within the present sample (where the difference is significant at p<.0001 [W = 1607.5] for the kit vowel, at p<.0001 for dress [W = 385.5], and at p<.01 for trap [W = 776.5]).

If the data are broken down further into age + voicing of the following consonant, the duration difference between kit vowels before voiced and voiceless consonants fails to reach significance in the late sample. The differences in dress and trap retain significance within both groups.

Overall, this accords with the internal partitioning within the SFV set with respect to the chain-shift in F1/F2 space, in that it is dress and trap that underwent an upward movement, whereas kit centralized. Here we have the same subgrouping (kit vs. dress and trap).

Vowels in polysyllables

If we compare the durations of the SFVs in monosyllables to those in polysyllables, it seems clear that whereas dress and trap are shorter in polysyllables, this does not hold for kit. The duration difference between monosyllables and polysyllables is significant for dress only (W = 276.5, p<.01). In addition, it is only in polysyllables where the duration difference between dress and trap is significant (W = 153.5, p<.01). The contrast in duration between prevoiced and prevoiceless tokens fails to reach significance in all vowels, which might be partly due to low token numbers.

It seems that the results quoted as to the effect of polysyllabicity on duration are reflected to some extent in the present analysis (i.e., the dress vowel). In addition, there is some support for Klatt's (Reference Klatt1973) assumption that there is no linear “adding-up” of shortening factors but rather some minimal level of vowel duration around which shortening factors do not apply anymore. This is exemplified by the kit vowel, where duration differences are minor and reach significance only between two conditions: (1) early male monosyllables closed by a voiced stop vs. early male monosyllables closed by a voiceless stop (W = 733, p<.0001), and (2) early male monosyllables closed by a voiced stop vs. early male polysyllables closed by a voiced stop (W = 348.5, p<.01).

On the whole, it seems that shortening factors do indeed interact. Figure 1 shows duration averages of all three vowels before voiced and voiceless stops in monosyllables (Figures 1a and 1b) and polysyllables (Figures 1c and 1d). It seems that for a vowel to be subject to shortening factors, it needs to have a certain minimum length. That is, the only significant shortenings occur in dress/trap where monosyllabic tokens before voiced stops undergo similar degrees of shortening in both polysyllables as well as in voiceless environments. In addition, these shortening factors do not stack. The kit vowel is, by and large, exempt from either process with the exception of early male kit before voiced stops, which undergoes shortening. As far as duration averages are concerned, we conclude that dress and trap are not only significantly longer than kit, but also have a number of properties typical of long vowels such as susceptibility to shortening in certain environments as well as a greater internal (i.e., within one vowel) durational variability.

Figure 1. Average durations of kit, dress, and trap before voiced and voiceless stops: monosyllables in the early male (a) and early female (b) samples; polysyllables in the early male (c) and early female (d) samples.

Factor analysis and the relation between duration and formant frequency

A CART analysis was carried out in order to clarify the relation between duration and formant frequency as it holds in the intermediate speaker sample.Footnote 4 This was done to test for the solidity of the subgroupings inferred from the data discussed previously. If dress and trap are truly different than kit in terms of duration, this should be reflected in a regression analysis. In addition, this subgrouping should be relatively more prominent compared with other factors that are known to have an impact on vowel duration, such as voicing of the following consonant or syllabicity.

Figure 2 shows the result of a CART analysis carried out by taking into account a further continuous factor, namely formant frequency. In Figure 2, duration is the dependent variable regressed onto the following independent variables: vowel (kit/dress/trap), F1, F2, age group (early vs. late), syllable (mono[syllabic]/poly[syllabic]), voicing status of the following consonant (voiced/voiceless).

Figure 2. Classification and regression tree of pooled duration measurements along the independent variables F1, F2, age, vowel, voice, and syllable.

As was to be expected on the grounds of the data presented, the main predictor of overall duration is vowel identity, whereby kit is appreciably shorter than dress and trap. In addition, no further factors partition the kit branch, whereas the dress/trap branch is split into vowels before voiceless stops (average duration 85.29 ms) and voiced stops (average duration 110.7 ms). This distinction takes precedence over vowel identity, which is a factor only within the voiceless node, where dress is shorter than trap. For dress/trap vowels before voiced stops, it is syllable structure that predicts duration in that both dress and trap tend to be markedly longer if they occur in monosyllables closed by a voiced stop than in polysyllables closed by a voiced stop. The two major conclusions from the Analysis section therefore seem reflected in the pooled CART data as well. “Shortness” of a vowel goes along with a lack of further internal partitioning along the duration dimension (as kit represents a terminal node, whereas dress/trap do not) and shortening factors interact, although with respect to Figure 2, we may prefer to say “lengthening factors interact” because the terminal branching that relates to a phonetic lengthening factor (i.e., syllabicity) stems from the superordinate node, which has the higher duration average (i.e., the dress/trap tokens before voiced stops). Before voiceless stops, vowel identity is a better predictor of duration (where dress is shorter than trap).

What is striking is that in the pooled data, formant frequency does not show up as a factor in predicting vowel length, although this would be expected on the grounds that F1 has frequently been found to correlate positively with duration (cf. Peterson & Lehiste, Reference Peterson and Lehiste1960). In addition, “peripherality” has previously been related to length (cf. Labov, Reference Labov1994; Labov, Yaeger, & Steiner, Reference Labov, Yaeger and Steiner1972, for a discussion of the relevant terminology) to the effect that the more peripheral a vowel, the longer it would be expected to be. Because we are concerned with front vowels, we would therefore expect a positive correlation between F2 and duration.

This interaction does indeed show up if we break up the overall data pool by vowel. Figures 3a to 3c show separate CART trees for the three vowels. The variables are as previously given. The trees are truncated below the second branching.

Within both the kit and the dress trees, the major predictive factor in determining duration is the voicing status of the following consonant. Vowels before voiced consonants are longer than before voiceless ones (where the ratios are 1:1.2 for kit and 1:1.58 for dress). Two points deserve mention in this context. First of all, it is somewhat surprising to find this division in the kit data as it seems to be mainly a property of the early group, which implies that if anything, we would expect this as a subordinate branching under a superordinate branching along the factor age. On the other hand, the late group does show a length difference (albeit not significant) within kit to the same effect, which suggests that this difference is carried by the late group in the pooled kit data. Second, it looks as if the duration difference of vowels before voiced vs. voiceless consonants which we noted as a property of the overall dress/trap data in Figure 3 was carried mainly by dress, as it fails to show up as a predictive factor in the trap data (Figure 3c). In the kit data, second formant frequency is a subordinate factor below both primary branches. The data are somewhat contradictory because within the group of tokens before voiceless consonants, high F2 values seem to predict lower vowel duration, whereas within the sample of vowels that occurs before voiced stops, the inverse is the case. Although this seems unexpected, the vowel tokens for which the negative correlation holds constitute the shortest subsample overall. If we recall the notion as outlined previously that the longer the intrinsic duration of a vowel, the more well-established the phonetic mechanisms which favor lengthenings/shortenings, this is probably the subsample where overriding of the expected mechanisms is least unexpected.

Figure 3. Classification and regression tree analysis of duration measurements broken down into the three SFVs.

The trap data are best predicted by second formant frequency in that the higher the F2, the longer the duration of a vowel token. In addition, we note in both the dress and the trap data an effect of syllable structure whereby vowels in monosyllables have higher duration averages than vowels in polysyllables within a branching that is already of comparatively high average duration.

It is interesting to note that the expected correlation between F1 and duration shows up only within one terminal branch and for one vowel (dress) only. In addition, it fails to reach significance in a Spearman correlation test (ρ = .27, S = 2955, p>1). Overall correlation test results are summed up in Table 4.

Table 4. Correlation coefficients between the variables duration and F1/F2 in the SFVs as obtained by a Spearman rank correlation (where ρ = correlation coefficient)

Correlations that reach statistical significance below the 5% level are given in bold.

It seems clear that there is nowhere a statistically significant correlation between F1 and vowel duration. However, some F2 effects can be noted. In the kit data, there is a positive correlation between F2 and duration in monosyllables and before voiced stops, which is in keeping with the preceding analysis. Within the dress/trap data, F2 correlates with duration overall, although the effect seems to be carried by monosyllables only. In addition, there is an interesting mismatch between the two vowels in terms of how they behave across the two speaker groups. Whereas duration and F2 correlate in dress in the early group only, the inverse is the case in the trap data.

Duration and F1/F2 overlap

Given the preceding results, we can conclude that dress and trap are different from a “true” short vowel such as kit. The reasons for this development in NZE can probably be sought in the mechanisms of the SFV shift itself. There is an appreciable degree of F1/F2 overlap between kit and dress in the speech of early intermediate speakers (cf. Langstrof, Reference Langstrof2006a, Reference Langstrof2006b). This overlap can be shown to exist in the speech of one and the same individual, which implies that additional cues are necessary to disambiguate vowel tokens in regions of overlap to prevent merger.

Figure 4a shows that for one and the same individual, a number of tokens are ambiguous between dress and kit in F1/F2 space. However, if we plot length against the first formant frequency for the same speaker (Figure 4b), this overlap largely disappears. It therefore appears that there is, to some extent, a trade-off relation between duration and formant frequency in terms of overlap between adjacent vowel distributions. Figure 5 plots duration against both formant frequencies for the early male and the late female sample.

Figure 4. Differential degrees of overlap between the lexical sets of kit and dress in the speech of an early male speaker. Figure 4a plots the first two formants against each other, and 4b plots duration against the first formant.

Figure 5. F1/F2 duration plots of kit, dress, and trap in the early male and late female sample.

In the F1/F2 dimension, the early sample shows nearly complete overlap between dress and its neighboring vowels. This overlap is resolved over the intermediate period by kit centralisation and dress raising (Figure 5b). However, if we plot frequency against duration (Figures 5c to 5f) it appears that the trap distribution moves toward the kit distribution in the F1/duration plot. Table 5 presents Hotelling-Lawley trace scores (a measure of the relative distance between distributions) for each of the distributions plotted in Figure 5.

Table 5. Hotelling-Lawley trace scores of the distributions of kit, dress, and trap in the early male and the late female sample

All scores are significant below .01.

The Hotelling-Lawley traces shown in Table 5 indicate wider separation between kit and dress in the late sample compared with the early sample. As for dress vs. trap, it is only in the F1/duration dimension where the distance between the two distributions decreases over time. Finally, kit and trap approximate each other in all dimensions.

The assumption that speakers/listeners of a language that is undergoing a vowel change whereby the distributions of two vowel phonemes approximate each other over time take advantage of (or “exaggerate”) alternative cues to the identity of a token in the overlap region seems plausible. What seems to be important here is that subphonemic durational cues can be dialect-specific and induced by historical processes such as vowel shift. In addition, the analysis presented earlier suggests that they are part of an extrinsic allophony and are therefore variable in time and space.

What requires an explanation is why there should have occurred a reactionary movement of kit, given that in this intermediate situation the two vowels were still solidly distinguished in F1 vs. duration space. The only hypothesis I have to offer here is purely conjectural. During the stage of the shift as exemplified by speakers such as J.M., we find three vowels in rather close proximity to each other, namely kit, dress, and fleece. What this amounts to is a three-way distinction in vowel length in that region of the vowel space, which is not mirrored anywhere else in the vowel system. It has been pointed out before (cf. Bybee, Reference Bybee2001, on the rise and fall of early Middle English rounded front vowels) that although such constellations can arise as the outcome of nonteleological sound change, they seem to be abandoned rather quickly. That is, phoneme systems tend to have similar types of distinctions across the board. In the case of the SFV shift in the intermediate period, this means that the duration difference might be best regarded as a “transitional cue” in disambiguating F1/F2 overlap between two phonemes.

SUMMARY/DISCUSSION

The important points to note are as follows:

  • dress and trap are considerably longer than kit.

  • For all three lexical sets, vowels before voiced consonants are longer than before voiceless ones.

  • In addition, this effect is stronger in dress than it is in any of the other two vowels.

  • Lengthening factors combine more linearly in longer segments.

  • There is no statistically significant effect of F1 (i.e., height) on vowel length.

  • In monosyllables, there is a strong effect of F2 on vowel length, whereby on the whole, frontness correlates positively with length.

  • trap differs from the other two vowels in that the F2 effect is more important in predicting vowel duration than the voicing state of the following consonant.

  • There are no statistically significant duration differences between the two speaker groups in any of the SFVs.

  • Duration is a solid predictor of vowel identity in overlapping distributions.

From these results, a number of important implications follow with regard to front vowel length in the intermediate period. First of all, overall vowel length indicates that dress and trap are in fact different from kit, because they are appreciably longer. Against the background of the front vowel shift, we can therefore conclude that they are indeed “different things” on the grounds of their differential behavior along two phonetic dimensions (i.e., dress and trap both raise/front and are longer; kit centralizes and is shorter).

Second, the lack of any strong correlation between F1 and length permits two possible interpretations of the relation between height and quantity in NZE. The straightforward one would be to conclude that in NZE, intrinsic constraints on vowel length depending on the vowel's openness are simply overridden. The appeal of this assumption probably depends on how mechanistically one interprets such a constraint in the first place, that is, whether this relation is primarily language- or dialect-specific, or whether it follows from motoric constraints in speech production. Under the latter premise, this interpretation of vowel length in NZE becomes rather ad hoc and mysterious. The alternative would be to assume that the condition holds, but that there has been moderate subphonemic lengthening over time in the SFV set in NZE that offsets this intrinsic effect and thereby preserves isometry in the face of movement across articulatory space. However, the data presented here permit no clear-cut conclusion in this regard.

With respect to Labov's (1994) theory of directional vowel shift, the present analysis casts doubt upon the alleged exceptional status of dress and trap in NZE. Phonetic shortness of dress and trap is hard to ascertain in the face of the presence of a phonetically much shorter vowel, namely kit. Apart from phonetic duration, it was also shown that dress and trap show a number of phonetic interactions that are more typical of long vowels, such as a larger duration difference depending on whether the vowel occurs in mono- or polysyllables or before a voiced or voiceless stop. It may therefore be advisable to remain cautious in positing that dress and trap are short vowels in NZE and should therefore be expected to behave like lot, strut, or foot rather than their tense counterparts. Under this interpretation, the behavior of dress and trap is therefore well in line with the general principles of vowel shifts.

CONCLUSIONS

In this article, I showed that there exist solid duration differences between lax front vowels in NZE and argued that this duration difference might have been the crucial factor that allowed listeners to keep the distributions of otherwise overlapping vowels apart. Labov & Baranowski (Reference Labov and Baranowski2006) came to a similar conclusion. In a perceptual study of vowel contrasts in American English, it was found that a durational difference of 50 ms is sufficient for listeners to reliably keep vowel distributions apart in perception. It therefore seems advisable to take a closer look at the role of vowel duration in studies on vowel change in the future.

In the case of the speakers analyzed in the present article, both dress and trap are considerably longer than kit. This subgrouping reflects the patterning of these vowels in the NZE front vowel shift, where dress and trap undergo raising and fronting, whereas kit undergoes centralization. In addition, only dress and trap show a considerable degree of durational allophonization, a property usually found in long vowels. We can therefore justifiably posit that these elements may be different from “true” short vowels on phonetic and historical grounds. Taking this argument one step further, it seems advisable to separate the phonological status of vowels from their classification as phonetic and diachronic entities. Although this is probably a rather commonplace assumption, the conclusion with respect to what counts in sound change may be less so. If the subgrouping of these elements with regard to their movements in vowel spaces reflects their phonetic patterning rather than phonological class-membership (after all, all three vowels show the well-known phonotactic properties of lax vowels in English), we cannot make statements such as “tense vowels rise” and the like in the absence of a workable a priori definition of tenseness in phonetic terms. Rather, generalizations over the movements of certain elements over time in vowel space need to be based on their phonetic properties, which furthermore need to be reassessed in each variety or language. Assigning a given element (say the dress vowel) to a “short” vowel set on the basis of etymology a priori does not yield the correct results with regard to how one would expect a given element to behave in vowel change.

This view is corroborated by the data in the Results section, where it was shown that duration can serve as a cue toward separating vowel distributions that overlap in F1/F2 space, even in cases where the two distributions in question belong to the same subset of vowels. Therefore, what counts as a predictor in making generalizations over vowel movements are the phonetic properties of the given vowels at specific points in time rather than their phonological properties or their etymological origins.

Footnotes

1 Praat: Doing Phonetics by Computer was developed by Paul Boersma and David Weenink. It is available at: www.praat.org. The Akustyk add-on was developed by Bartlomiej Plichta. It is available at: bartus.org.

2 An in-depth mathematical description of CART can be found in Breiman, Friedman, Olshen, & Stone (Reference Breiman, Friedman, Olshen and Stone1984). Cf. also Mendoza-Denton, Hay, & Jannedy (Reference Mendoza-Denton, Hay, Jannedy, Bod, Hay and Jannedy2003) for a discussion regarding the advantages of CART for analyses that involve continuous dependent variables such as formant frequency values.

3 This was done because prevelar kit tokens were shown to be the least innovative allophones in the F1/F2 dimension during the SFV shift (Langstrof, Reference Langstrof2006a). However, no significant durational difference between prevelar tokens and prealveolar tokens showed up in the subsample discussed here.

4 Note that in the CART figures as shown throughout this article, the first branching corresponds to the statistically most significant division within the data. The CART trees always divide the data pool into two parts that reflect the most significant partitioning of the data within each node. For example, the highest-order node in Figure 3 is to be read as follows. Within the overall data pool on vowel duration, the most significant predictor of duration is vowel identity. The group of kit tokens has an average duration of 48.33 ms, the group of dress and trap tokens has an average duration of 98.32 ms. The length of the branches is not related to the significance of the division.

References

REFERENCES

Bauer, Laurie. (1979). The Second Great Vowel Shift. Journal of the International Phonetic Association 9.2:5766.Google Scholar
Bauer, Laurie (1986). Notes on New Zealand English phonetics and phonology. English World-Wide 7:225258.Google Scholar
Bauer, Laurie (1992). The Second Great Vowel Shift revisited. English World-Wide 13.2:253268.Google Scholar
Breiman, Leo, Friedman, Jerome, Olshen, Richard A., & Stone, Charles. (1984). Classification and regression trees. New York: Chapman and Hall.Google Scholar
Bybee, Joan. (2001). Phonology and language use. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Gordon, Elizabeth, Campbell, Lyle, Hay, Jennifer, Maclagan, Margaret, Sudbury, Andrea, & Trudgill, Peter. (2004). New Zealand English: Its origin and evolution. Cambridge: Cambridge University Press.Google Scholar
Klatt, Dennis. (1973). Interaction between two factors that influence vowel duration. Journal of the Acoustical Society of America 55:11021104.Google Scholar
Labov, William. (1994). Principles of linguistic change, Vol. 1: Internal factors. Malden, MA: Blackwell.Google Scholar
Labov, William, Yaeger, Malcah, & Steiner, Richard. (1972). A quantitative study of sound change in progress. Report on National Science Foundation Contract NSF-GS-3287, University of Pennsylvania.Google Scholar
Labov, William, & Baranowski, Maciej. (2006). 50 milliseconds. Language Variation and Change 18:118.Google Scholar
Langstrof, Christian. (2006a). Vowel change in New Zealand English—Patterns and implications. Ph.D. dissertation, University of Canterbury.Google Scholar
Langstrof, Christian (2006b). Acoustic evidence for a push-chain shift in the intermediate period of New Zealand English. Language Variation and Change 18:141164.Google Scholar
Lobanov, Boris. (1971). Classification of Russian vowels spoken by different speakers. Journal of the Acoustical Society of America 49:606608.Google Scholar
Maclagan, Margaret. (1982). An acoustic study of New Zealand English vowels. The New Zealand Speech Therapists' Journal 37(1):2026.Google Scholar
Maclagan, Margaret, & Hay, Jennifer. (2007). Getting fed up with our feet: Contrast maintenance and the New Zealand English “short” front vowel shift. Language Variation and Change 19:125.CrossRefGoogle Scholar
Mendoza-Denton, Norma, Hay, Jennifer, & Jannedy, Stefanie. (2003). Probabilistic sociolinguistics: Beyond variable rules. In Bod, R., Hay, J., & Jannedy, S. (eds.), Probabilistic Linguistics. Cambridge, MA: MIT Press.Google Scholar
Peterson, G. E., & Lehiste, I. (1960). Duration of syllable nuclei in English. Journal of the Acoustical Society of America 32:693703.Google Scholar
Trudgill, Peter, Gordon, Elizabeth, & Lewis, Gillian. (1998). New dialect formation and Southern Hemisphere English: The New Zealand short front vowels. Journal of Sociolinguistics 2(1):3551.Google Scholar
Wells, J. C. (1982). Accents of English. Cambridge: Cambridge University Press.Google Scholar
Figure 0

Table 1. Number of tokens obtained from two groups of intermediate speakers

Figure 1

Table 2. Token numbers obtained from two groups of intermediate speakers broken down into occurrences in monosyllabic and polysyllabic words

Figure 2

Table 3. Average durations in milliseconds (ms) of SFVs in the speech of 9 intermediate speakers, 4 early males, and 5 late females

Figure 3

Figure 1. Average durations of kit, dress, and trap before voiced and voiceless stops: monosyllables in the early male (a) and early female (b) samples; polysyllables in the early male (c) and early female (d) samples.

Figure 4

Figure 2. Classification and regression tree of pooled duration measurements along the independent variables F1, F2, age, vowel, voice, and syllable.

Figure 5

Figure 3. Classification and regression tree analysis of duration measurements broken down into the three SFVs.

Figure 6

Table 4. Correlation coefficients between the variables duration and F1/F2 in the SFVs as obtained by a Spearman rank correlation (where ρ = correlation coefficient)

Figure 7

Figure 4. Differential degrees of overlap between the lexical sets of kit and dress in the speech of an early male speaker. Figure 4a plots the first two formants against each other, and 4b plots duration against the first formant.

Figure 8

Figure 5. F1/F2 duration plots of kit, dress, and trap in the early male and late female sample.

Figure 9

Table 5. Hotelling-Lawley trace scores of the distributions of kit, dress, and trap in the early male and the late female sample