1 Introduction
1.1 An aeroacoustic confound revisited
Ejective fricatives are relatively rare sounds in the world's languages. In the UCLA Phonological Segment Inventory Database (UPSID-PC) (Maddieson & Precoda Reference Maddieson and Precoda1991), only ten of 451 languages (2.2%) are reported to have an ejective fricative. The languages with ejective fricatives reported in this database are:
Acoma (Keres) – Miller Reference Miller1966
Berta (Nilo-Saharan) – Triulzi, Dafallah & Bender Reference Triulzi, Dafallah, Bender and Bender1976
Dakota (Siouan) – Boas & Deloria Reference Boas and Deloria1939
Kabardian (Caucasian) – Kuipers Reference Kuipers1960
Koma (Adamawa-Ubangi) – Tucker & Bryan Reference Tucker and Bryan1966
Mazahua (Oto-Manguean) – Spotts Reference Spotts1953
Soqotri (South Arabian) – Leslau Reference Leslau1938, Johnstone Reference Johnstone1975
Tlingit (Na-Dene) – Story & Naish Reference Story and Naish1973, Swanton Reference Swanton and Boas1911
WichitaFootnote 1 (Caddoan) – Garvin Reference Garvin1950, Rood Reference Rood1975
Yuchi – Crawford Reference Crawford1973, Ballard Reference Ballard and Crawford1975
To the UPSID-PC list, one might add the following:
Adyghe, Bagwali (or Bagwalal), and Karata (Caucasian) – Catford Reference Catford and Paris1992
HausaFootnote 2 (Chadic) – Ladefoged & Maddieson Reference Ladefoged and Maddieson1996, Newman Reference Newman2000, although see Lindsey, Hayward & Haruna Reference Lindsey, Hayward and Haruna1992
Mehri, Harsusi, Bathari, Hobyot, Jibbali (or Shehri) (SouthArabian) – Simeone-Senelle Reference Simeone-Senelle and Hetzron1997
Upper Necaxa Totonac (Totonac-Tepehua) – Beck Reference Beck and Kirchner2006
Yapese (Oceanic) – Jensen Reference Jensen1977, Maddieson Reference Maddieson1998
Amharic (Ethio-Semitic) – Leslau Reference Leslau1995
Tigre (Ethio-Semitic) – Palmer Reference Palmer1962, Raz Reference Raz1983
Tigrinya (Ethio-Semitic) – Leslau Reference Leslau1939, Reference Leslau1941
This additional list is based on a general survey of the literature on ejective fricatives, not on an exhaustive survey of language grammars.
The presence of an ejective fricative in a language seems almost always to imply the presence of at least one other ejective obstruent. In all UPSID-PC languages the presence of an ejective fricative implies the presence of at least one ejective stop. One exception to this generalization is Upper Necaxa Totonac (Beck Reference Beck and Kirchner2006; not reported in UPSID-PC), which has ejective fricatives but no ejective stops or ejective affricates. Like languages with ejective fricatives, UPSID-PC languages with ejective affricates fit neatly within the set of languages with ejective stops. The only exception is Iraqw, a Cushitic language of Tanzania which has two ejective affricates but no ejective stop (nor an ejective fricative) (Whiteley Reference Whiteley1958, see also Mous Reference Mous1993).Footnote 3 Of the ten UPSID-PC languages with ejective fricatives, only three lack ejective affricates: Berta, Koma, and Soqotri. Conversely, of the 59 languages that have ejective affricates, only seven have ejective fricatives (see Figure 1). Thus, the chance of a language having ejective fricatives seems to increase slightly if the language also has ejective affricates. Of the seven languages that have both ejective fricatives and affricates, six (Acoma, Dakota, Mazahua, Tlingit, Wichita, and Yuchi) have affricates and ejectives at the same place of articulation.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-84747-mediumThumb-S0025100310000319_fig1g.jpg?pub-status=live)
Figure 1 Languages with ejective stops in UPSID-PC (Maddieson & Precoda Reference Maddieson and Precoda1991) categorized by the presence/absence of ejective fricatives and/or affricates.
Ejectives are produced by trapping air between the tightly adducted vocal folds and a constriction at an oral place of articulation. To produce an ejective, the larynx should be raised, thereby compressing the air in the supralaryngeal cavity (Ladefoged Reference Ladefoged1993: 130). Upon release, ‘the entrapped high-pressure air will momentarily burst forth in a short sharp explosion’ (Catford Reference Catford2001: 22).
The release of the oral closure results in a sudden outflow of air (a glottalic egressive airstream), along with a characteristic burst of acoustic energy. After this, the laryngeal closure is released, the suprahyoid musculature relaxes, thus lowering the larynx, and pulmonic egressive airflow may course through the glottis.
The production of an ejective fricative involves an aeroacoustic dilemma. Two competing demands, stated here as directives, must be satisfied: (i) Ejectivity: Substantially increase air pressure above the larynx (to a degree greater than expected during pulmonic egressive obstruents); (ii) Frication: Vent air continuously through a narrow constriction. The venting required of a fricative may defeat the increase in pressure required of an ejective.
Maddieson (Reference Maddieson1997, Reference Maddieson1998) has put forward two mechanisms for satisfying these demands. First, he proposes reducing the size of the constriction between oral articulators to increase intraoral pressure. Thus, the fricative can be produced exclusively on the outflow of air trapped above the larynx, albeit at some cost to net flow and therefore duration. The ejective fricatives of Tlingit, with an oral constriction narrower than that of pulmonic fricatives, are of this type. They allow intraoral air pressure to achieve a higher peak than is found during pulmonic fricatives but the duration is reduced, since the supralaryngeal cavity volume is much smaller than the volume of the lungs (Maddieson, Smith & Bessell Reference Maddieson, Smith and Bessell2001).
Maddieson's (Reference Maddieson1997, Reference Maddieson1998) second proposal for resolving the aeroacoustic dilemma is to sequence frication and glottal constriction, as in Yapese. This proposal raises the question of whether such a sound still qualifies as an ejective in the classic sense, e.g. that of Catford (Reference Catford2001). After all, frication is not produced simultaneously with glottal closure, nor is the larynx necessarily raised to increase pressure during the fricative. Indeed, Maddieson (Reference Maddieson1998) concludes that Yapese ejective fricatives are not ejectives at all, but a sequence of e.g. /f/ + /ʔ/ (see Section 1.3.5 for further discussion of Yapese).
Beyond these two mechanisms, another may be posited: Frication may be preceded by a period of oral closure, allowing time for air pressure to build (we will refer to this as the affrication hypothesis). Closure plus frication is characteristic of affricates, not fricatives, and thus raises the question of whether ejective fricatives realized in this way have crossed an ontological boundary. Pre-release closures have been reported in the literature for ejective fricatives in Amharic (Demolin Reference Demolin, Gussenhoven and Warner2002) and Kabardian (Gordon & Applebaum Reference Gordon and Appelbaum2006) with little comment.
The purpose of this paper is to further elaborate the affrication hypothesis through an acoustic study of the alveolar ejective fricative in Tigrinya. The results will suggest that this fricative is often realized as an affricate, thus explaining historical confusion about the transcription of the sound, as well as dialectal variation in its realization. More generally, the results will suggest that affrication is a phonetically natural path in the evolution of ejective fricatives.
1.2 Tigrinya and its ejective fricatives
Tigrinya is a North Ethio-Semitic language spoken in northern Ethiopia and Eritrea. The Ethio-Semitic (also termed Ethiopian Semitic) family is classified as South Semitic, part of the larger West Semitic branch (Faber Reference Faber and Hetzron2007). The consonant inventory for Tigrinya is shown in Table 1. There are five ejectives: /t’ k’ kʷ’ s’ ʧ’/,
Table 1 Phonemic obstruents in Tigrinya.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-54987-mediumThumb-S0025100310000319_tab1.jpg?pub-status=live)
All consonants in Tigrinya, including the ejectives, may be geminate/long, although the ‘guttural’ sounds /h ħ ʔ ʕ/ appear as geminate in limited circumstances (Berhane Reference Berhane1991).Footnote 4 The sounds [p] and [ʧ] appear in loanwords, primarily from Italian and English, whereas the sound [p’] appears in Greek loanwords. [ʒ] is uncommon and varies with /ʤ/. The voiceless velar stop /k/ is realized as [x] in post-vocalic position, and the ejective /k’/ has a post-vocalic allophone commonly transcribed as [x’] (Leslau Reference Leslau1941, Schein Reference Schein, Borer and Aoun1981, Kenstowicz Reference Kenstowicz1982, Denais Reference Denais1990, Fitzgerald Reference Fitzgerald and Uhlig2006). The actual production of this sound is reported as uvular (Fre Woldu Reference Fre Woldu1985), but whether it is ejective is unclear. An acoustic and aerodynamic investigation of its properties is being undertaken by the current authors.
[s’] is described as an alveolar ejective fricative in the standard linguistic and grammatical literature on Tigrinya and usually transcribed with the letter 〈s〉 and a diacritic to indicate glottalization (Leslau Reference Leslau1941, Pam Reference Pam1973, Denais Reference Denais1990, Berhane Reference Berhane1991, Kogan Reference Kogan and Hetzron1997, Tewolde Reference Tewolde2002. However, Denais (Reference Denais1990: 32) notes that /s’/ is a fricative but has a free variant [ts’]. The sound is also transcribed as [ts’] without comment by Palmer (Reference Palmer1966) and Walther (Reference Walther1999). Finally, Fre Woldu (Reference Fre Woldu1985: 4), a native Tigrinya speaker, uses a fricative transcription, but places the sound in the affricate section of the obstruent chart. Berhane (Reference Berhane1991), another native speaker of Tigrinya, calls it a fricative. These variable transcriptions lend credence to the hypothesis that Tigrinya's ejective fricative may be realized with initial closure.
Alternation of [s’] with [ts’] is reported for other languages. In Hausa, the digraph 〈ts〉 is used in the standard orthography for what Jaggar (Reference Jaggar2001) describes as an ejective alveolar sibilant, but it is listed as an affricate in the consonant chart. Leslau (Reference Leslau1957: 156) reports that in the Amharic dialect of Godjam, [s’] is ‘often definitely pronounced as affricate [ts’]’. Bender (Reference Bender and Bright1992) reports that [s’] alternates with [ts’], but also tends to be realized as [t’] in the Shewa dialect of Amharic. Indeed, Amharic cognates of Tigrinya words containing [s’] generally show [t’] or its palatalized version [ʧ’] (Leslau Reference Leslau1987), and some words show variation, e.g. [s’omə] ~ [t’omə] ‘he fasted’ (Tadesse Reference Tadesse1992, Leslau Reference Leslau1995). In Dime (Fleming Reference Fleming and Hayward1990), [s’] and [ts’] pronunciations cannot be distinguished, at least in initial position (although Seyoum (Reference Seyoum2008) does not mention this variation). Savà (Reference Savà2005: 23) reports that the ejective affricate /ts’/ in Ts'amakko has an ejective fricative [s’] as a free variant. In Lowland Oaxaca Chontal (Maddieson, Avelino & O'Connor Reference Maddieson, Avelino and O'Connor2009), /s’/ is realized as [ts’] or [tsʔ] with a few tokens of [sʔ] or [ʔs] (see Section 1.3.6 for more discussion).
There is also historical and dialectal evidence which may support the affrication hypothesis for Tigrinya. First, Steiner (Reference Steiner1982) proposes that West Semitic *Ṣ (Ṣade) should be reconstructed as a glottalized affricate [tsˀ] rather than a fricative, and Faber (Reference Faber1985) supports and extends this position to Proto-Semitic. Reflexes of this sound appear as [ts] in Modern Hebrew, as [sˤ] in Arabic, and as [s’] in Tigrinya, e.g. [ħisˤaːr] (Arabic) ‘enclosure’ [xatsər] (Modern Hebrew) ‘enclosure, courtyard’, [ħas’ərə] (Tigrinya) ‘make a fence’. Arabic acquired secondary pharyngealization, whereas Hebrew lost glottalization. These variable reflexes could stem from an affricate pronunciation of an ejective fricative. Second, rural dialects of Tigrinya favor a palato-alveolar affricate [ʧ’] instead of [s’] in some lexical items: e.g. [ħas’ərə] ~ [ħaʧ’ərə] ‘become short’, [k’ʷɨns’i] ~ [k’ʷɨnʧ’i] ‘flea’ or [s’əmmərə] ~ [ʧ’əmmərə] ‘add, unite’. Tigre cognates also occasionally have [ʧ’]: [ʧ’amraː] ‘unite’ or [k’ars’aː] ~ [k’arʧ’aː] ‘slice, cut off’ (Tigrinya [k’ərəs’ə]) (Leslau Reference Leslau1987). Ullendorff (Reference Ullendorff1955) and Voigt (Reference Voigt1988) propose that this alternation is a type of palatalization due to influence from rounded vowels by way of labial consonants, parallel to a [s ~ ʃ] alternation. Although the viability of the rounding trigger may be questioned, it is clear that [s’] alternates not with [ʃ’], but with an affricate.
Fre Woldu (Reference Fre Woldu1985) analyzed the properties of voiceless, voiced and ejective stops in Tigrinya, but not the ejective fricative /s’/. He found that the main characteristic of ejective stops, in relation to pulmonics, was a longer burst release followed by a period of silence preceding the vowel. In perceptual experiments, native listeners reliably cued into this long silent period in order to distinguish ejective and plain voiceless stops.
Kingston (Reference Kingston1985) analyzed larynx movement for Tigrinya stops. He observed 4–6 mm of laryngeal displacement during the ejective consonants but commented that larynx elevation is variable across speakers and is too slow and ill-timed with oral closure and release to fully explain the decreased volume of the oral cavity necessary for ejectives. No data on the ejective fricatives of Tigrinya were reported.
1.3 The phonetics of ejective fricatives: A cross-linguistic perspective
In this section we present observations about ejective fricatives in those languages where the sounds have been described instrumentally.
1.3.1 Amharic
Demolin (Reference Demolin, Gussenhoven and Warner2002, Reference Demolin2004) reports the main acoustic, aerodynamic, and articulatory characteristics of the ejective fricative /s’/ in Amharic, a South Ethio-Semitic language closely related to Tigrinya.Footnote 5 The duration of the ejective fricative, both singleton and geminate, is always shorter than that of the plain fricative. The frication noise of a plain fricative has constant amplitude, but for ejectives there is a gradual increase in amplitude during constriction, and 12 ms (geminate) or 30 ms (singleton) of glottal lag before vowel onset that is not present in pulmonic fricatives. There is no discussion of a preceding silent interval characteristic of affrication, though there is one example of a recorded ejective fricative whose waveform is characterized by a (nearly) silent interval (Figure 4 in Demolin Reference Demolin, Gussenhoven and Warner2002: 469). Intraoral air pressure was measured using a small plastic tube inserted through the nasal cavity into the oro-pharynx, and oral airflow was measured using a silicone rubber mouthpiece, both pieces of equipment connected to a Physiologia workstation (Bucella et al. Reference Bucella, Hassid, Beeckmans, Soquet and Demolin2000: 449–450; Teston & Galindo Reference Teston and Galindo1990). Intraoral air pressure for the ejective rises rapidly with a dramatic decrease right after the peak, whereas the plain fricative shows a slower increase and decrease. Peak intraoral pressure is twice as high for ejectives as it is for pulmonics, but subglottal pressure is not significantly different.
Electropalatography (EPG) and static palatography reveal that the Amharic ejective fricative has wider contact in the alveopalatal region, which Demolin (Reference Demolin, Gussenhoven and Warner2002: 472) interprets as a reduction in the overall size of the oral cavity, probably intended to reduce the supralaryngeal cavity size and thereby maximize pressure. There is an alveopalatal closure observed for both singleton and geminate ejective fricatives. Demolin (Reference Demolin, Gussenhoven and Warner2002: 473) concludes that oral and glottal closures are simultaneous, but oral constriction is released before glottal closure.
1.3.2 Tlingit
Tlingit, a Na-Dene language spoken in the Pacific Northwest coast of North America has six ejective fricatives, /s’ ɬ’ x’ xʷ’ χ’ χʷ’/, five ejective stops, /t’ k’ kʷ’ q’ qʷ’/, and three ejective affricates, /ts’ ʧ’ tɬ’/ (Maddieson et al. Reference Maddieson, Smith and Bessell2001). All of these sounds have plain pulmonic counterparts. Maddieson et al. (Reference Maddieson, Smith and Bessell2001) present both acoustic and aerodynamic data for the ejective fricatives. Word-initial ejective fricatives are characterized by an approximately 46 ms silent interval following a word-initial fricative, punctuated by a release (of glottal closure) into the vowel. Word-final fricatives are flanked by silent intervals, and there is a detectable release burst approximately 39 ms after the silent period. The noisy portion of the fricative is significantly shorter in an ejective fricative than a pulmonic fricative (word-initial 148 ms vs. 222 ms and word-final 127 ms vs. 301 ms, respectively), and the noise is ‘scrapy’ (broken into a series of pulses). Maddieson et al. (Reference Maddieson, Smith and Bessell2001: 159) postulate that this scrapy quality (which they did not observe in /s’/) occurs due to intermittent closures of the oral channel, possibly related to the presence of saliva in a very narrow constriction. The scrapy quality may also be caused by friction at the aryepiglottic folds (Esling Reference Esling1996, Moisik Reference Moisik2008). Maddieson et al. (Reference Maddieson, Smith and Bessell2001: 172) explicitly reject the possibility that the scrapy quality is a glottal phenomenon. Given the high intraoral pressure observed during ejective fricatives, they argue that glottal pulsing would require an ‘improbably high’ level of subglottal pressure. Although pulmonic fricatives undergo final lengthening in Tlingit, ejective fricatives do not.
Intraoral air pressure data was recorded by speakers holding a narrow plastic tube positioned between the lips with the open end resting against the center of the hard palate. Pressure behind the constriction was detected through the tube and recorded using Macquirer hardware and software. A series of calibration recordings were then made using a U-tube manometer. The intraoral pressure trace for Tlingit ejective fricatives is parabolic, with a high peak level between 14 hPa and 17 hPa, but without a sustained peak, compared with between 5.2 hPa and 8.6 hPa for pulmonic fricatives. Air pressure is presumably maintained by forcing air through a narrower supraglottal constriction, which could also explain the scrapy, pulsing quality of the fricative noise. Maddieson et al. (Reference Maddieson, Smith and Bessell2001) propose that the silent periods flanking frication are due to glottal closure overlapping the entire fricative. The silent interval before frication appears to occur variably, as illustrated by a word with a word-medial ejective fricative uttered by two different native speakers (Dauenhauer & Dauenhauer Reference Dauenhauer and Dauenhauer1991; see Figure 2). It is not clear why the closure preceding frication is interpreted as glottal rather than oral. No data on the ejective affricates were reported to test how contrast is realized between the fricative/affricate pairs [s’] and [ts’] or [ɬ’] and [tɬ’].
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-18548-mediumThumb-S0025100310000319_fig2g.jpg?pub-status=live)
Figure 2 The Tlingit word ús'aa ‘soap’ produced by a female speaker (left) and a male (right), taken from audio files in Dauenhauer & Dauenhauer (Reference Dauenhauer and Dauenhauer1991). The second token is marked by a nearly silent closure before the onset of frication [(t)]. Both are marked by a period of irregular vibration or near silence after the frication, transcribed here as [ʔ].
1.3.3 Kabardian
Kabardian (Gordon & Applebaum Reference Gordon and Appelbaum2006) is a Northwest Caucasian language which has three ejective fricatives: /f’ ʃ’ ɬ’/ as well as an ejective affricate /ts’/ and a series of ejectives stops: /p’ t’ kʲ’ kʷ’ q’/. Gordon & Applebaum (Reference Gordon and Appelbaum2006) found that the ejective fricatives had shorter frication duration (average 130 ms) than their pulmonic counterparts (191 ms) in word-initial position. Frication was often characterized by pulsing noise, as in Tlingit. Ejective fricatives were characterized by a long VOT lag, or silent period before vowel onset, and were less intense than the pulmonics. Comparison of palatograms for the alveopalatal pulmonic and ejective fricatives suggests that the ejective fricative is produced with a narrower constriction with wider contact and a narrower opening in the anterior portion of the mouth. One palatogram even showed evidence of alveopalatal closure prior to frication in the ejective. Acoustic evidence of closure prior to frication was found commonly with the lateral fricative. Aerodynamic data, using the same plastic tube methodology as in Maddieson et al. (Reference Maddieson, Smith and Bessell2001), revealed a high intraoral pressure peak at the point of release. No data on the Kabardian ejective affricate /ts’/ were reported.
1.3.4 Upper Necaxa Totonac
Upper Necaxa Totonac is a member of the isolate Totonac-Tepehua family spoken in Puebla, Mexico (Beck Reference Beck and Kirchner2006). It has three ejective fricatives: /s’ ʃ’ ɬ’/, but no ejective stops. Beck (Reference Beck and Kirchner2006) collected both acoustic and aerodynamic data for the ejective fricatives. No details on the aerodynamic methodology are provided other than mention of an airflow mask and Macquirer 6.0 airflow analysis software. The peak value of intraoral air pressure was not different between ejectives and pulmonics, but the intraoral pressure does peak earlier for ejectives than for pulmonic fricatives. Air flow reaches its peak following the intraoral pressure for ejectives, but earlier than the intraoral pressure peak for pulmonics. As for duration of frication, pulmonic fricatives are shorter than ejective fricatives, the reverse of the pattern observed in Amharic, Tlingit and Kabardian. There is a brief hiatus of approximately 9 ms following frication for the ejective before vowel onset, but on the spectrograms provided, it is rather noisy.Footnote 6
1.3.5 Yapese
Maddieson (Reference Maddieson1997, Reference Maddieson1998) reports that in Yapese, an Austronesian language spoken in the Micronesian state of Yap, the ejectives /f’/ and /Ɵ’/ have similar noise durations. Intraoral pressure is the same in plain and glottalized fricatives, and only differs in that the pressure rise occurs earlier in the ejective. Maddieson (Reference Maddieson1998) concludes that they are not ejectives but a sequence of frication and glottal constriction.
1.3.6 Lowland Oaxaca Chontal
Oaxaca Chontal is reported to have ejective obstruents /f’ ɬ’ ts’ ʧ’ k’/ (Waterhouse Reference Waterhouse1962, Reference Waterhouse and McQuown1967), but Suárez (Reference Suárez1983) utilizes an affricate transcription /pɸ’ tɬ’/ instead of /f’ ɬ’/. Maddieson et al. (Reference Maddieson, Avelino and O'Connor2009) argue that the phonemic fricative notation may have some credence in that the ‘glottalized’ obstruents can be derived from fricatives via a plural formation strategy. One method of plural formation is to add a glottal stop to words ending in vowels, or glottalization to words ending in sonorants and fricatives (there are no word-final stops or affricates). Yet, there is considerable variation in how these derived glottalized fricatives are realized, as well as underived glottalized obstruents: stop, fricative and affricate productions are attested with various forms of glottalization. Maddieson et al. (Reference Maddieson, Avelino and O'Connor2009) report descriptive acoustics and did not take any aerodynamic measurements. For velars, a stop [k’] is the most common production, whereas coronal obstruents are typically realized as affricate [ts’ tɬ’]. Affricate ejectives are characterized by silent closure followed by frication, a release burst and a period of (nearly) silent glottal closure. Fricative realizations can involve pre- or post-glottalization, described as glottal constriction sequenced with frication (e.g. [sʔ] and [ʔs]); these tokens lack the glottal closure and release burst characteristic of ejectives. The pronunciation [fʔ] is the most common realization for the labial, although affricate [pf’] is also attested. From 630 tokens from eight speakers, only two tokens exhibited an ejective fricative realization: [ɬ’]. Therefore, it appears that Oaxaca Chontal does not have ejective fricatives per se, only ejective affricates, or glottal–fricative sequences, as in Yapese. The data from Oaxaca Chontal therefore lend credence to the hypothesis that ejective fricatives present aeroacoustic challenges, and one means of solving this problem is by realizing the ejective fricative via affrication.
1.3.7 Summary
In conclusion, ejective fricatives are characterized by shorter frication duration than pulmonic fricatives (except for Upper Necaxa Totonac), with a following (near-)silent lag before vowel onset, presumably associated with the time that it takes for the larynx to drop and regular vocal fold vibration to initiate under an egressive pulmonic airflow regime. The frication itself is often described as ‘scrapy’ or ‘pulsing’, perhaps indicative of an oral constriction that opens and closes quickly. Ejective fricatives generally show higher intraoral pressure peaks, typically occurring earlier than they do during pulmonics. High supralaryngeal cavity pressure is of course a hallmark of ejectives. The timing of the peak suggests that pressure builds more slowly in pulmonic fricatives, a difference that is likely to have acoustic consequences as well. Upper Necaxa Totonac is an exception to this intraoral pressure pattern. Palatograms reveal wider contact and narrower constriction, and in some cases evidence of total alveolar or alveopalatal occlusion, particularly in Amharic. Silence preceding frication is sporadically reported and was interpreted as glottal closure (without further comment) for Tlingit (Maddieson et al. Reference Maddieson, Smith and Bessell2001). This can be seen on the right-hand side of Figure 2. A summary of these characteristics for four languages, excluding Yapese and Lowland Oaxaca Chontal (because their so-called ejective fricatives have been interpreted as a sequence of fricative plus glottal stop), is shown in Table 2. None of the languages surveyed had geminate ejective fricatives.
Table 2 Characteristics of ejective fricatives compared to pulmonic fricatives (n/r = not recorded or reported).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-63466-mediumThumb-S0025100310000319_tab2.jpg?pub-status=live)
1.4 The problem
The motivation for our study is based on several observations: (i) most languages that have ejective fricatives also have ejective affricates (see Figure 1); (ii) Tigrinya ejective fricatives do not contrast with homorganic affricates, but the fricatives are occasionally transcribed or described as affricates and dialectal variation along these lines is reported; (iii) linguopalatal occlusion has been reported for the ejective fricative of Amharic, a related language; and (iv) the buildup of intraoral pressure during ejective consonants can be achieved most effectively by creating a complete supralaryngeal occlusion while raising the larynx. Thus, our study will investigate the affrication hypothesis, i.e. there is a complete supralaryngeal occlusion before the onset of frication in Tigrinya ejective fricatives. If this hypothesis is correct, we expect that ejective fricative /s’/ will manifest a silent interval preceding fricative release and that pulmonic fricative /s/ will not. We will report measures of this pre-release silent interval as ‘closure duration’ throughout the paper.
We also hypothesize that closure duration will be longer in the geminate ejective fricative than the singleton. Longer closure duration is the expected measure of phonological length contrast for affricates (Pycha Reference Pycha2009).
As mentioned above, there is abundant dialectal and transcriptional evidence suggesting that the ejective fricative of Tigrinya varies with an affricate. Furthermore, Demolin's (Reference Demolin, Gussenhoven and Warner2002, Reference Demolin2004) aerodynamic and EPG measures of /s’/ in Amharic, a closely-related language, are indicative of complete linguopalatal closure. In addition, if closure is manipulated to convey length differences for the /s’/ vs. /sː’/ contrast as it does for affricates, then this further reinforces our interpretation of the closure as linguopalatal. Nevertheless, there is the possibility that the closure could be interpreted as glottal. Maddieson et al. (Reference Maddieson, Smith and Bessell2001) assume this intepretation for Tlingit /s’/ where increased intraoral pressure is interpreted as evidence of increased supraglottal constriction but not full linguopalatal closure. Despite the absence of aerodynamic or physiological data for Tigrinya /s’/, we believe that the facts are more in line with a supralaryngeal closure interpretation.
We will investigate other acoustic properties (besides closure duration) that differentiate the two types of fricatives from each other and which also differentiate Tigrinya's ejective fricative from its ejective affricate. These are release duration, preceding and following vowel duration, presence and duration of post-release laryngealization, amplitude rise slope, and rise time.Footnote 7 These measurements will allow us to present a full acoustic profile of intervocalic /s’/ in comparison to the pulmonic fricative and assess its properties as characteristic of either fricative or affricate. We predict that release duration will be shorter for the ejective fricative than the pulmonic fricative, and we anticipate that the presence of laryngealization and perhaps longer laryngealized intervals will signal ejectivity. We make no strong predictions concerning vowel duration. On the one hand, the presence of predicted closure in ejective fricatives may correlate with a shorter preceding vowel. On the other hand, if release duration is shorter for ejective fricatives, then the preceding vowel may be longer. Following vowel duration may adjust for a longer laryngealization period, with shorter vowels following ejectives. The affrication hypothesis predicts that, apart from properties associated with alveolar versus palato-alveolar place of articulation differences, /s’/ should share characteristics with /ʧ’/: similar release duration, high amplitude rise slope, and long rise time. Finally, we will discuss the likelihood that affrication is a viable strategy for satisfying the competing aeroacoustic demands involved in the production of simultaneous frication and ejectivity, particularly in languages that lack a corresponding ejective affricate.
2 Method
2.1 Speakers
Data from five male subjects between the ages of 25 and 45 years were recorded. All subjects were from Asmara, the capital city of Eritrea, and were native speakers of Tigrinya, having left Eritrea after reaching adulthood. All subjects also spoke English and Amharic. In addition, one subject spoke some Blin, a Cushitic language of Eritrea, and another was fluent in Italian. Uncompressed 16-bit wav-files of the subjects’ utterances were recorded on a Marantz PMD-660 solid state recorder. Recordings took place whenever possible in a sound-attenuated booth but alternatively in a quiet room in a private house. Subjects were recorded using a N/D267a Dynamic Cardioid table-top microphone, positioned approximately 10–12 inches from their mouths. They were instructed to remain as still as possible while speaking. Speech was sampled at 48 kHz for the first four speakers and (inadvertently) at 44.1 kHz for the fifth. Because spectral measures were not included in the analysis the discrepancy in sampling rate, though unfortunate, was not considered problematic.
2.2 Tokens
Production consisted of a word list of 183 Tigrinya verb forms drawn from Kane (Reference Kane2000) (137 target items and 46 fillers, including items with test consonants /t’/ and /ʃ/, which do not form part of our analysis here) in a carrier sentence: /təmali ___ ʔilu/ ‘Yesterday ___ he said’. These were presented to the subjects in Ethiopic writing using Abyssinica font (www.sil.org): e.g. /təmali fət’iru ʔilu/ ‘Yesterday, he created, he said’. Gemination is not normally indicated in Ethiopic writing, but can be marked with two dots over a symbol (Leslau Reference Leslau1995). Gemination diacritics were added as appropriate to ensure minimal pairs were distinguished (e.g.
/nəs’ilu/ ‘it came off the handle’ vs.
/nəs’ːilu/ ‘he separated’). Subjects were given the chance to familiarize themselves with the extra diacritic before beginning the experiment, and subsequently had no trouble with its use. Sentences were randomized and presented to subjects using Microsoft PowerPoint, with four sentences per slide. The investigator controlled slide transitions.
The word list was designed to compare several factors. Five target consonants differing in manner of articulation and airstream mechanism were selected: /s t s’ t’ ʧ’/. (We present statistical results bearing on /s s’ ʧ’/ only.) Note that Tigrinya lacks the pulmonic singleton affricate /ʧ/ except in loanwords, chiefly from Italian. Target consonants appeared in word initial position and in intervocalic medial position. Target consonants in medial position were either singleton or geminate. The vowel adjacent to the target consonant (in the antepenultimate syllable) was either low /a/, mid /ə/ (slightly fronted) or high /ɨ/. This vowel followed the target consonant in initial position or preceded it in medial position. In Tigrinya, stress is generally word-final.
Triliteral consonant roots were conjugated in different verbal paradigms to balance vocalic context in the recordings. Tigrinya triliteral verbs are classified into three lexical classes labeled Type A, B, and C (Leslau Reference Leslau1941). Each verb type has a distinctive vowel pattern and consonant gemination pattern. For example, Type B verbs are characterized by gemination of a medial root consonant in most verb forms, whereas Type C verbs are characterized by a vowel /a/ between the first two root consonants. The forms used in this study were the 3rd person masculine singular (3msg) gerundive, the 3msg infinitive and the imperative plural. The laryngeal and pharyngeal consonants /ʔ h ʕ ħ/, or ‘gutturals’, condition vowel lowering of /ə/ to [a]. This pattern permitted generation of additional word types with [a]. Representative template shapes and examples are provided in Table 3.
Table 3 Word types illustrating position and vocalic context of fricatives under investigation.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-36346-mediumThumb-S0025100310000319_tab3.jpg?pub-status=live)
In total, of the 137 target verbs, there were 87 different verb roots: 33 Type A (conjugated in both gerundive and imperative plural), 17 Type B (gerundive and infinitive), and 25 Type C/Type A with guttural (gerundive only). The target consonants were distributed as illustrated in Table 4.
Table 4 Distribution of target consonants, speakers, repetitions, subtotals and totals.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-07523-mediumThumb-S0025100310000319_tab4.jpg?pub-status=live)
Four repetitions of each token were planned for each speaker. However, not all speakers were able to complete four repetitions due to constraints of time. This amounted to two repetitions for Speaker 5 (S5), three for Speakers 3 and 4 (S3 and S4), and four for Speakers 2 and 1 (S2 and S1). Initial tokens were collected but not analyzed for this particular paper, because a word-boundary pause and supralaryngeal closure could not be distinguished based on acoustic evidence alone, thereby making conclusions about affrication tentative. However, we note that the silent interval has been interpreted as supralaryngeal closure in a recent study of word-initial affricates despite the potential ambiguity (Pycha Reference Pycha2009). Other features of initial /s’/ such as release duration, post-release laryngealization, amplitude rise slope, and rise time were important data to collect for a general descriptive overview of prevocalic /s’/, but the analysis of these data is left for future research.
Disfluencies in a corpus this large were to be expected, however, the repeated-measures design compensated for this to a large extent. In a few cases some speakers did not produce a fluent token on any repetition, so the tokens were excluded even for speakers that did produce them. These included two tokens of /s/ and one each of /sː’/, /sː/, and /ʧ’/. In these cases, speakers systematically produced an alveopalatal instead of an alveolar or vice versa.
2.3 Measurements
The tokens were analyzed using Praat 5.0.02 (Boersma Reference Boersma2001). Up to five acoustic landmarks were identified by a student annotator using a written protocol drawn up by the authors. The annotator took detailed notes of difficult and ambiguous tokens while segmenting data for a single speaker. The annotation for this speaker was reviewed, any ambiguities in the protocol were taken into account, and the protocol was revised to better suit potentially troublesome cases. Annotation of the first speaker's data, along with the rest, then commenced using the revised protocol. When the annotation was finished, the authors reviewed and corrected the annotation where necessary. The five intervals, as presented to the annotator, are described below:
Figure 3 exemplifies several tokens with the (up to) five intervals labeled. Intervals were only labeled if the corresponding acoustic event actually occurred. Tokens were marked as ‘problematic’ if vowels were mispronounced or the utterance was otherwise disfluent.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-39597-mediumThumb-S0025100310000319_fig3g.jpg?pub-status=live)
Figure 3 From top to bottom, the tokens [las'iju] ‘he shaved’, [ʕatʃ'ix'u] ‘he stuffed himself with food’, [ʕas'ːimu] ‘it was hard and stiff’ (uttered by S4), and [masisu] ‘he caressed’ (uttered by S3). Up to five intervals were labeled according to definitions set forth in Section 2.3: V1 = Vowel 1; C = Closure; R = Release; Q = Post-release laryngealization; V2 = Vowel 2.
Using a comprehensive script written by the first author, the following measurements were taken in Praat 5.0.02. Closure, release, and laryngealization duration were measured based on the boundaries chosen by the annotator. The RMS amplitude of the release was calculated in three 10 ms frames arrayed across each release. The left and right edges of the first and third frames were aligned with the beginning and end of the release. The second frame was positioned so that its center occurred at the center of the fricative. The content of each frame was transformed with a Hamming window. The values were normalized as a function of the average RMS intensity of the entire VCV sequence (measured using Praat's Get-root-mean-square function applied to the relevant annotated material), and a regression line was then fitted to the points. Finally, the slope of this line was calculated to indicate how the amplitude of release changed as a function of time (Mitani, Kitama & Sato Reference Mitani, Kitama and Sato2006; see Figure 4 below).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-06594-mediumThumb-S0025100310000319_fig4g.jpg?pub-status=live)
Figure 4 Method for calculating amplitude rise slope exemplified. The fricative is /s'/ from the word /las'iju/ ‘he shaved’. The fricative was divided into three 10-ms frames and the RMS value of each frame was computed (the black dots). The dashed least-squares regression line represents the amplitude slope (in this case negative) of the fricative.
Rise-time of release was calculated by locating the absolute maximum amplitude of the release and determining the point in time when this event occurred (Johnson Reference Johnson2003). This measure was normalized by dividing the time stamp of maximum absolute amplitude by the duration of the entire release.
2.4 Statistical analysis
Using the reshape function (Recast Package 0.8.3) in R 2.8.1, data frames were pre-processed for a repeated measures design: Measures for all repetitions of a single token uttered by a single speaker were averaged. Next, using R's Linear and Nonlinear Mixed Effects (NLME) Package 3.1–92, linear mixed effects (LME) models were fitted to the data by restricting maximum log likelihood (REML). For the LME models, Speaker was stipulated as a random effect. We chose not to use traditional linear models, because in this case Speaker is a factor without repeatable levels. It is generally recommended that so-called random effects terms (based on non-repeatable factors) be modeled as random effects, i.e. with a mean of zero and unknown variance (Baayen Reference Baayen2008: 241–242). LMEs have the advantage of incorporating factors that are repeatable (like Airstream) with factors that are non-repeatable (like Speaker). Chi-square tests were used to analyze raw count data, which were not averaged across repetitions.
Linear mixed effects models were designed for all fricatives with Airstream (pulmonic, ejective) as fixed effect, and Speaker/Airstream as a nested random effect (Laird & Ware Reference Laird and Ware1982). In relevant cases, Phonological Length was considered a fixed effect. Inspection of variables revealed that distributions for durations of V1, Closure (C), Release (R), Laryngealization (Q), and V2 were right-skewed. Duration variables were transformed logarithmically to better approximate a normal distribution.
3 Results
3.1 Descriptive statistics: Repeated measures averages
In this section, we report values from seven measurements with repeated measures averages (Table 5).
Table 5 Repeated measures averages with standard deviations in parentheses. Starred (*) units indicate some kind of normalization (see Section 2.3).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-17014-mediumThumb-S0025100310000319_tab5.jpg?pub-status=live)
3.2 Linear mixed effects models
3.2.1 Ejective and pulmonic fricatives
Pulmonic fricative releases were longer than ejective fricative releases by a 5:3 ratio. Laryngealization was 13 ms after the pulmonic fricative but more than twice as long (31 ms) for the ejective. Pulmonic releases were longer than ejective releases by 101 ms for geminates and 33 ms for singletons. Amplitude slope was greater for ejective than pulmonic fricatives. For ejective and pulmonic fricatives, closure duration could not be analyzed using a repeated measures design because no speakers produced pulmonic fricatives with closure. Only four speakers produced pulmonic fricatives with laryngealization. Details of the linear mixed effects models for fricatives are presented in Table 6 and Figure 5.
Table 6 Results of linear mixed effects models for fricatives with Airstream as fixed effect and Speaker as random effect.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921004216607-0397:S0025100310000319:S0025100310000319_tab6.gif?pub-status=live)
* = p < .05; ** = p < .01
nDF = numerator degrees of freedom; dDF = denominator degrees of freedom; F = F-statistic; p(F) = probability
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-30213-mediumThumb-S0025100310000319_fig5g.jpg?pub-status=live)
Figure 5 Repeated measures (averages) pertaining to mixed linear effects model presented in Table 6. X-axis labels are speaker identifiers.
3.2.2 Ejective fricatives and ejective affricates
Using Manner as a fixed effect for ejective fricatives and affricates, it was shown that Manner is associated with closure duration, F(1,4) = 10.09 (p < .05) (see Figure 6). Closure duration was greater for affricates than fricatives by approximately 25 ms in the case of both singletons and geminates.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-50052-mediumThumb-S0025100310000319_fig6g.jpg?pub-status=live)
Figure 6 Boxplots of logarithmically-transformed closure duration for ejective fricatives and (ejective) affricates.
3.2.3 Ejective fricatives
Here we investigate the characteristics that differentiate phonological length among ejective fricatives. Closure duration was significantly longer for geminates than singletons (by about 13 ms), F(1,4) = 34.31 (p < .01). Relevant boxplots are given in Figure 7.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-87325-mediumThumb-S0025100310000319_fig7g.jpg?pub-status=live)
Figure 7 Boxplots of log-transformed closure duration of geminate and singleton ejective fricatives.
For ejective fricatives alone, a linear mixed effects model was implemented with Presence of Closure as a fixed effect to better understand whether closure affects the acoustics of ejective fricatives. No significant effects were found for any measure.
3.2.4 Ejective fricatives without closure and pulmonic fricatives
20% (66/327) of ejective fricatives in the corpus did not manifest closure. In this section, we address how ejective fricatives without closure differ from pulmonic fricatives. All ejective fricatives with closure were removed from the corpus and linear mixed models were used in a repeated measures design. This was complicated somewhat by the fact that S4 produced all ejective fricatives with closure, so his data were necessarily removed from the analysis. The purpose of the analysis was to judge the extent to which other acoustic cues may differentiate ejective and pulmonic fricatives in the absence of closure. Results are presented in Table 7 and Figure 8. Glottal duration was significantly longer for the no-closure ejective fricative than the pulmonic fricative and amplitude slope was significantly greater as well. This suggests that even in the absence of pre-release silence, there are other important cues signaling the difference between ejective and pulmonic fricatives.
Table 7 Results of linear mixed effects models with Speaker as random effect. The corpus contains ejective fricatives with no closure (NC) and pulmonic fricatives.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921004216607-0397:S0025100310000319:S0025100310000319_tab7.gif?pub-status=live)
* = p < .05; ** = p < .01
nDF = numerator degrees of freedom; dDF = denominator degrees of freedom; F = F-statistic; p(F) = probability
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-73262-mediumThumb-S0025100310000319_fig8g.jpg?pub-status=live)
Figure 8 Boxplots relating to the linear mixed effects models presented in Table 7.
3.2.5 Ejective fricatives without closure and ejective affricates
Ejective fricatives without closure and affricates were grouped together to determine the characteristics that differentiate them (besides the absence of closure). As in Section 3.2.4, S4's data were removed because all of his ejective fricatives manifested closure. Only release duration was associated with Manner, F(1,3) = 32.90 (p < .05). Release duration of no-closure ejective fricatives was on average 38 ms greater than the release duration of affricates (for geminates) and 45 ms greater for singletons. Relevant boxplots are presented in Figure 9.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160922021302-49516-mediumThumb-S0025100310000319_fig9g.jpg?pub-status=live)
Figure 9 Boxplots pertaining to linear mixed effects models for fricatives without closure and affricates.
3.3 Count data: Presence of closure and laryngealization
A Chi-square test was used to determine whether or not the proportions of pulmonic and ejective fricatives that manifested closure differed significantly. The results indicate that they did: χ2(1, N = 605) = 569.15, p < .001. Counts for these variables are given in Table 8.
Table 8 Counts of ejective and pulmonic fricatives with respect to closure.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921004216607-0397:S0025100310000319:S0025100310000319_tab8.gif?pub-status=live)
Geminate ejective fricatives were significantly more likely than singleton ejective fricatives to manifest closure; χ2(1, N = 327) = 14.05, p < .001 (Table 9).
Table 9 Counts of ejective fricatives manifesting closure, grouped by phonological length.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921004216607-0397:S0025100310000319:S0025100310000319_tab9.gif?pub-status=live)
Affricates were more likely to manifest closure than ejective fricatives; χ2(1, N = 548) = 35.98, p < .001 (Table 10).
Table 10 Counts of ejective fricatives and affricates with respect to closure.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921004216607-0397:S0025100310000319:S0025100310000319_tab10.gif?pub-status=live)
Ejective fricatives were more likely to manifest laryngealization than pulmonic fricatives; χ2(1, N = 605) = 256.72, p < .001 (Table 11).
Table 11 Counts of ejective and pulmonic fricatives with respect to laryngealization.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921004216607-0397:S0025100310000319:S0025100310000319_tab11.gif?pub-status=live)
Just over 78% (256/327) of ejective fricatives manifested both closure and laryngealization, whereas only about 2% (5/327) of ejective fricatives manifested closure without laryngealization. Because ejective fricatives always manifested either closure or laryngealization, and because 20% (64/327) manifested laryngealization without manifesting closure, the relationship between closure and laryngealization for ejective fricatives alone was not significant; p > .05 (Fisher's Exact Test for Count Data). These counts are listed in Table 12.
Table 12 Counts of ejective fricatives with respect to closure and laryngealization.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921004216607-0397:S0025100310000319:S0025100310000319_tab12.gif?pub-status=live)
4 Discussion
In support of the aeroacoustic affrication hypothesis, we find that phonological ejective fricatives are more likely to manifest pre-release silence than phonological pulmonic fricatives. Ejective fricatives manifest pre-release silence in 80% (261/327) of tokens. We interpret pre-release silence as linguopalatal closure on the basis of comparative palatographic and aerodynamic evidence in the related language Amharic (Demolin Reference Demolin, Gussenhoven and Warner2002, Reference Demolin2004). We do not, however, present conclusive physiological evidence for the realization of Tigrinya /s’/ as [ts’].
Affricate closure duration is on average 25 ms (50%) longer than ejective fricative closure, a statistically significant difference that may (in addition to place of articulation differences) help distinguish /s’/ from /ʧ’/ or at least prevent /s’/ from undergoing reanalysis as /ts’/.
As predicted, closure duration differentiates phonological length for the ejective fricatives /sː’/ and /s’/ (by about 13 ms). Closure duration also distinguishes geminate and singleton affricates by a margin of approximately 14 ms (see Table 5 above). The geminate ejective fricative /sː’/ was significantly more likely than the singleton ejective fricative /s’/ to manifest closure. We find that when closure is present geminate /sː’/ is realized as [tːs’], i.e. with a longer silent interval than the realization of singleton /s’/ → [ts’]. This is reminiscent of findings by Pycha (Reference Pycha2009: 21), who argues that among Hungarian affricates ‘phonological lengthening exclusively increases closure duration’ rather than release duration. To the extent that pre-release closure is associated with the phonological length of affricates (and our findings show that it is), we suggest that Tigrinya's ejective fricative, when accompanied by closure, is phonologically similar to an affricate.
For fricatives, pulmonic release is significantly longer than ejective release. The period of frication is 101 ms (110%) longer for pulmonic geminates and 34 ms (37%) longer for pulmonic singletons. This observation was expected and is supported by similar findings for Amharic (Demolin Reference Demolin, Gussenhoven and Warner2002, Reference Demolin2004) and Kabardian (Gordon & Applebaum Reference Gordon and Appelbaum2006). The effect has been attributed to the limited volume of air available between the glottis and supralaryngeal constriction for the ejective fricative. By contrast, pulmonic fricatives have access to a much larger volume of air contained in the lungs. Relatively high airflows are more sustainable during pulmonic fricatives (Maddieson Reference Maddieson1997, Maddieson et al. Reference Maddieson, Smith and Bessell2001). Further aerodynamic research could shed light on this issue.
Release duration does not appear to signal phonological length among ejective fricatives. This result is intriguing, since release duration does distinguish geminate pulmonic fricatives from singleton pulmonic fricatives in Tigrinya as well as other languages, e.g. Swedish (Löfqvist Reference Löfqvist2005). Instead of release duration, it is closure duration that signals phonological length among Tigrinya ejective fricatives, much as it does for affricates in Hungarian (Pycha Reference Pycha2009). Release duration is not a significant cue for phonological length even among the subset of ejective fricatives that lack closure.
While our findings show that affrication occurs in a majority of ejective fricatives, this leaves a minority of ejective fricatives that are crucially not distinguished by closure before release. These tokens provide a window to acoustic cues (besides closure) that may signal ejectivity among fricatives, and also allow us to compare differences between [s’] and [ts’], both surface realizations of Tigrinya's ejective fricative /s’/. Release duration is one such acoustic cue, and pulmonic fricatives do still have longer release duration than ejective fricative tokens with no closure. However, release duration still contributes relatively little, if anything, to the phonological length distinction among ejective fricatives, even for the cases with no closure. This leads us to speculate that phonological length may be neutralized when ejective fricative closure is lost.
Laryngealization duration, or the amount of time it takes for regular vocal fold vibration to initiate after release, is significantly greater for ejective fricatives in relation to pulmonics, a characteristic also observed for other languages with ejective fricatives. This most likely stems from the increased time it takes for the larynx to lower from its elevated position and/or for the appropriate pressure differential across the glottis to be reached, thus giving rise to modal voicing. Fre Woldu (Reference Fre Woldu1985) demonstrates that this period of relative silence is the weightiest psychoacoustic cue in distinguishing pulmonic and ejective stops.
A considerable amount of variation has been reported in the VOT (voice onset time) of ejective stops. VOTs with an average of 12 ms have been reported in Abkhaz (Catford Reference Catford, Bless and Abbs1983) while a mean VOT of 108 ms (SD = 31) has been reported for coronal ejective stops in Navajo (McDonough & Ladefoged Reference Ladefoged1993: 154). Our results show that the average post-release laryngealization (or VOT) of Tigrinya's alveolar ejective fricative is approximately 31 ms. Kingston (Reference Kingston1985) uses VOT as one of several parameters to typologize so-called tense and lax ejectives.Footnote 8 The lax variety has shorter VOTs, so our study suggests that Tigrinya's ejective fricatives, at least in terms of this parameter, are ‘lax’. VOTs measured by both Kingston (Reference Kingston1985) and Fre Woldu (Reference Fre Woldu1985) seem to indicate that Tigrinya's stops are ‘tense’, i.e. they have longer VOT. Laryngealization, however, is present even among pulmonic fricatives, albeit at a significantly reduced frequency and duration. This non-ejective laryngealization most likely has its origin in the wide adduction of the vocal folds observed during the production of high-airflow voiceless fricatives like [s] (Ohala Reference Ohala1993: 158).
Laryngealization is present in almost all cases of ejective fricatives, whether they manifest closure or not. Indeed, there is no statistically significant relationship between laryngealization and closure among ejective fricatives. Thus, it is reasonable to suggest that, in the absence of closure, laryngealization still helps to signal the difference between ejective and pulmonic fricatives, just as it does when closure is present. While laryngealization usually occurs on [ts’] realizations of /s’/, ejective fricatives without closure still have significantly longer laryngealization than pulmonic fricatives.
Vowel duration also correlates with ejective vs. pulmonic fricative status. Vowel duration is significantly longer before ejective fricatives than pulmonic fricatives. This duration difference may be connected to compensation for the shorter release duration of ejective fricatives. We found no significant results for the vowel following the test consonant.
Amplitude slope (Mitani et al. Reference Mitani, Kitama and Sato2006) is a measure of how quickly a release increases in its amplitude. We find that this measure is associated with airstream mechanism for Tigrinya fricatives. Ejective fricatives have a higher amplitude slope than pulmonic fricatives, meaning that, in effect, ejectives become louder more quickly than pulmonics. This is the case whether closure is present or not. High amplitude slope is associated with abrupt bursts and thus has been shown to differentiate affricates and fricatives (Weigelt, Sadoff & Miller Reference Weigelt, Sadoff and Miller1990; Mitani et al. Reference Mitani, Kitama and Sato2006). Indeed, ejective fricative tokens with closure manifest higher amplitude slope than those without, in support of the affrication hypothesis. However, amplitude slope could not distinguish ejective fricatives and affricates in Tigrinya, whether or not the fricatives manifested closure.
The results for rise time, or the amount of time it takes to reach the sound pressure peak after release, were not significant. Long rise times are reportedly typical of fricatives, not affricates. Indeed, it has been claimed that rise time is a salient cue for the fricative–affricate distinction (Cutting & Rosner Reference Cutting and Rosner1974, Howell & Rosen Reference Howell and Rosen1983, Johnson Reference Johnson2003). Because we found rise time to be insignificant in distinguishing ejective fricatives from pulmonic fricatives, we wonder if the predictions of Howell & Rosen's (Reference Howell and Rosen1983) model hold only for consonants produced by the same airstream mechanism (Cutting & Rosner Reference Cutting and Rosner1974 and Howell & Rosen Reference Howell and Rosen1983 investigated pulmonics only). Indeed, to the extent that the so-called ‘pulsing’ or ‘scrapy’ quality is a characteristic of Tigrinya ejective fricatives, Howell & Rosen's model (designed for smoothly rising amplitudes) may be inappropriate. We make no quantitative observations regarding the presence or absence of this ‘scrapy’ quality among Tigrinya ejective fricatives because we are unsure how to quantify this attribute. It is not clear, for example, whether the ‘pulses’ in an ejective fricative would be of roughly equal amplitude. Without knowing this, it seems an intractable problem to conclude whether the mathematical maximum is indeed the ‘true’ maximum amplitude of the fricative. Without a basic definition of what differentiates a ‘pulse’ of an ejective fricative from an otherwise noisy envelope, we doubt whether ‘scrapiness’ can be used in any consistent or falsifiable way (as noted earlier, the scrapy quality may arise from pharyngeal constriction; Esling Reference Esling1996, Moisik Reference Moisik2008). We note that more recent studies have challenged Cutting & Rosner (Reference Cutting and Rosner1974) and Howell & Rosen (Reference Howell and Rosen1983) on the issue of rise time. For example, both Diehl & Castleman (Reference Diehl, Castleman, Simpson and Pätzold1996) and Kluender & Walsh (Reference Kluender and Walsh1992) consider rise time an insufficient cue for distinguishing fricatives and affricates. Unfortunately, our results shed no further light on the issue.
In a majority (80%) of tokens, Tigrinya's ejective fricative /s’/ shows evidence of closure, a characteristic shared with affricates. The pre-release closure is significantly shorter than the pre-release closure of the phonological ejective affricate; moreover, closure is not present in all of the tokens recorded. However, closure duration, not release duration, signals the difference between singleton and geminate ejective fricatives, a pattern shared with affricates. Conversely, pulmonic fricatives manifest length contrast through release duration. Besides closure duration, ejective fricatives are distinguished from pulmonic fricatives by a number of characteristics including preceding vowel duration, release duration, and amplitude slope. Ejective fricatives are also typified by a greater likelihood of post-release laryngealization. Based on this evidence it would be premature to categorically reclassify Tigrinya /s’/ as an affricate, primarily because pre-release silence does not occur in all tokens and there are several other acoustic cues signaling the difference between ejective and pulmonic fricative (not least of all is release duration). However, it seems reasonable to cite Tigrinya /s’/ → [ts’] as an example of a phonetically-natural, aerodynamically-conditioned sound change in progress. Because Tigrinya has no homorganic affricate to pair with /s’/, the production of [ts’] has no consequence on the phonological inventory of the language, e.g. creating pressure on a hypothetical /ts’/ phoneme. As shown, however, there are phonological implications for the realization of quantity when it comes to Tigrinya /s’/: instead of increasing release duration, as is typical of long fricatives, Tigrinya's phonologically long /s’/ is realized with a lengthened closure, typical of affricates (Pycha Reference Pycha2009).
5 Conclusion
The realization of ejective fricatives tests the limits of human vocal production by combining two production mechanisms that, from a certain point of view, seem irreconcilable: ejectivity and frication. Based on the typological rarity of the sound, one might conclude that its production characteristics make it unstable across generations of speakers. However, this would miss the interesting typological and phonetic relationship between ejective fricatives and affricates known to exist in the languages of the world. If we sample only those languages that have ejective obstruents, the subset of those languages with ejective affricates is more likely to have ejective fricatives than a subset without ejective affricates. A brief inspection of the design of the human vocal tract and the requirements of ejectivity lead us to believe that this is not by chance. After all, the best way to realize an ejective is to seal and compress the column of air between the glottis and an oral closure; the best way to realize a fricative is to continuously vent air from the vocal tract. Thus it is not surprising that, for ejective fricatives, we see variation precisely in terms of the seal's presence or absence.
Transcriptions of Tigrinya are remarkably variable in capturing the realization of /s’/. This can be attributed either to misperception on the part of the linguists who have documented Tigrinya or actual variation in the realization of the consonant across dialects, speakers, and words. We have shown instrumentally that variation indeed exists. Since Tigrinya also happens to have the ejective affricate /ʧ’/, our analysis focused on the acoustic differences between the fricative and the affricate and between forms of the ejective fricative realized with and without closure. In reaching our conclusion that Tigrinya's ejective fricative is generally realized as an affricate, we are aware that the greatest challenge in extending our results to other languages is that Tigrinya itself does not have a phonemic ejective fricative and a phonemic ejective affricate at the same place of articulation. To understand whether affrication of ejective fricatives is widespread, a useful study would be to test the phonetic differences (if any) between ejective fricatives and affricates at the same place of articulation, e.g. in Mazahua, Tlingit, and Yuchi (all of which have /s’ ts’/), Kabardian (/s̪’ ts̪’/), or Acoma (/ʈʂ’ ʂ’/) (Maddieson Reference Maddieson1984).
We conclude that ejective fricatives in Tigrinya are frequently realized with complete oral occlusion before release, a point that awaits further verification using EPG technology. While affrication satisfies the unique aeroacoustic demands of ejectivity, it simultaneously changes the category of /s’/ from fricative to affricate. The absence of an ejective affricate at the same place of articulation arguably gives speakers of Tigrinya the freedom to variably affricate /s’/. The path from realization of [s’] to [ts’] seems natural, though it is unclear how long or whether a stable phonological distinction can truly be maintained between the two. This necessarily raises interesting questions about the 1.3% (6/451) of the world's languages (Maddieson & Precoda Reference Maddieson and Precoda1991) that reportedly maintain such an opposition and sets the bar higher for fully describing their acoustic, aerodynamic, and articulatory production.
Acknowledgements
We are very grateful to the Tigrinya speakers who participated in this study: Aki Ghebre, Marco Pedulli, Saad Ali, Teamrat Adhanom, Zerai Araia and Beraki Woldeabzghi. We thank John Esling and two anonymous reviewers for their helpful comments and suggestions on earlier versions of this manuscript. We acknowledge the comments and questions of audience members at the annual meetings of the LSA (2008) and NACAL (2008), in particular Gene Buckley, Scott Henderson, Grover Hudson, and Ian Maddieson. Melinda Fricke and Megan Osfar provided valuable service as research assistants. All errors and omissions are entirely our responsibility.