Hostname: page-component-6bf8c574d5-mggfc Total loading time: 0 Render date: 2025-02-21T04:27:50.594Z Has data issue: false hasContentIssue false

Clicks and percussives in English conversation

Published online by Cambridge University Press:  04 November 2013

Richard Ogden*
Affiliation:
Department of Language & Linguistic Science, University of Yorkrichard.ogden@york.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Clicks are known to occur in English conversation, and have traditionally been assumed to convey affective meaning, generally negative. This is indeed the lay interpretation of clicks. In this paper, we build on the work of Wright (2011a, b), who shows that clicks are also used in the management of sequences of talk. Firstly, we consider similarities and differences in the production of clicks and percussives. We consider some distributional properties of clicks in one variety of English. Drawing on a wide range of conversational data from Britain and the USA, we show some of the functions of clicks and percussives in conversation, which include stance-taking and sequence management (projection of a turn constructional unit and word-search), and handling aspects of timing between turns. We also consider some of the visual behaviours that may accompany clicks. The meaning or function of clicks and percussives, it is argued, must be considered in a fuller interactional context.

Type
Research Article
Copyright
Copyright © International Phonetic Association 2013 

1 Introduction

Although clicks are well known to occur in English conversation, there remain few studies which treat them anything more than anecdotally. In this paper, we survey some of the main forms and functions of clicks and their relations, percussives, from several collections of conversational data, and suggest possible future research.

Clicks in English are a worthwhile object of study for several reasons. Although they do not occur in lexical items in English, they are widely assumed, both by linguists and lay people, to have a communicative function. A study of clicks is therefore an exercise in excavating non-lexical function and meaning from everyday talk. This paper is a contribution to a growing tradition that approaches the problem by combining phonetic analysis with conversation analytic techniques as a way to demonstrate how participants themselves understand and orient to one another's displayed meanings (see e.g. Ogden Reference Ogden and Ogden2012a). Clicks have been shown to be involved in managing conversation (Wright Reference Wright2011a, Reference Wrightb), and are frequently implicated in displays of stance or affect (Reber Reference Reber2012). While there are theories which aim to connect the form and function of prosodic features such as loudness or intonation contours (see e.g. Gussenhoven Reference Gussenhoven2004 for an account using biological codes), such theories typically consider ways of modifying lexical material, and have little or nothing to say about punctual – i.e. instantaneous, transient – phonetic events like clicks.

Clicks are embedded within stretches of speech, and are ‘in’ speech but not ‘of’ it. They are sounds made in the vocal tract alongside speech, not as part of the lexical content of the language, but clearly as a resource for making meaning. The same claim could be made for many co-speech sounds, such as laughter (see e.g. Kohler Reference Kohler2007 for a description of varieties of co-speech laughter, and e.g. Jefferson Reference Jefferson, Atkinson and Heritage1984, Potter & Hepburn Reference Potter and Hepburn2010 on the positioning of laughter in conversation), sighing, sobbing, or audible inhalation and exhalation. As Laver (Reference Laver1994) points out, communication consists of several semiotic systems. In developing an understanding of the functions of clicks, we are exploring a sound that is describable in phonetic terms but which is considered normally to be at the margins of language itself, along with other semiotic systems such as gesture, facial expression, and other vocal and bodily movements and sounds. A study of clicks in a language where they have non-lexical functions, then, necessarily involves thinking about where the boundaries of language lie.

Gil (Reference Gil, Dryer and Haspelmath2011) claims that cross-linguistically clicks are typically paralinguistic in function because they are ‘not integrated into morphological and syntactic structures, [and] convey a restricted range of meanings, some of which are associated with the expression of emotions’ (p. 1). He claims that in some languages – Hebrew, Avar, Somali, Fula and Malayalam, among others – clicks are able to serve as a response to yes/no questions. In both cases, clicks resemble linguistic signs in that they are arbitrary and conventionalised, even though they are not lexical.

For English, the primary function of clicks is generally held to be stance-taking. Clicks are mostly – it is claimed – negatively valenced. Wright (Reference Wright2011a: 208) provides a summary: ‘disapproval (Ladefoged Reference Ladefoged1982: 124; Crystal Reference Crystal1987: 126), annoyance (Abercrombie Reference Abercrombie1967: 31; Ball Reference Ball1989: 10), irritation (Gimson Reference Gimson1970: 34), exasperation (Laver Reference Laver1994: 175), impatience (Laver Reference Laver1994: 175), regret (Clark & Yallop Reference Clark and Yallop1990: 18), sympathy (Gimson Reference Gimson1970: 34) and encouragement (Gimson Reference Gimson1970: 34; Abercrombie Reference Abercrombie1967: 31; Ladefoged Reference Ladefoged1982: 124; Laver Reference Laver1994: 177)’. Most of these labels come from anecdotal observation, and few are exemplified using empirical data. Reber (Reference Reber2012) claims, from an examination of natural conversations, that stand-alone clicks as responses to complaints about third parties or external events display a weak affiliative stance as opposed to strong moral indignation, which is done through expanded turns at talk. Ward (Reference Ward2006) subsumes the meanings of clicks under ‘personal dissatisfaction’, perhaps with something another person has said, the current topic, or (in the case of word search) with one's own performance. As we shall see, the meaning of clicks is more nuanced than this characterisation indicates; and we shall also argue against the essentially mono-systemic account of clicks which Ward assumes.

The lay interpretation of clicks in English-speaking cultures agrees that clicks display a negative stance or affect. This is conventionalised through the word ‘tutting’, which refers not just to the sound of a click, but also to the negative stance that clicking conveys. Example (1), taken from a radio phone-in in Britain, illustrates this well. (Transcription conventions are explained in Section 3 below.)

  1. (1) vegtalk 19.12.03.03:33

In lines 1–4, the caller, Ellie, describes an embarrassing situation resulting from her own inappropriate behaviour (line 3). In line 5, the presenter makes two click sounds, [! !], displaying an understanding of the situation as one which elicits disapproval. In line 7, he glosses this expression of disapproval as tutting. The verb form in line 8, along with his display of sympathy towards Ellie in line 9, retrospectively cast this disapproval as originating from the really embarrassed family watching Ellie as she made a spectacle of herself. Tutting, then, is a recognisable activity, occasioned by socially sanctionable behaviour.

Stance-taking is not the only use of clicks. Wright (Reference Wright2005, Reference Wright2007, Reference Wright2011a, Reference Wrightb) shows that in normal conversation, clicks can also handle aspects of sequence management. They commonly occur at boundaries between sequences, such as those found at the end of a phone call, where the speakers have to close down a current topic before initiating the closing of the call. This can be seen in Example (2): lines 1–5 close a prior sequence, and line 7, [!] okay, well then, initiates the closing of the call with arrangements for next seeing each other (see e.g. Schegloff & Sacks Reference Schegloff and Sacks1973, Button Reference Button, Button and Lee1987 on call-closing practices in English-speaking cultures).

  1. (2) ESF/O41 SBL:2:2:3:R:60–61

Wright (Reference Wright2005) also observes that clicks are commonly produced during word searches. We will extend Wright's work on word searches in this study, but will not provide more instances of clicks which index a new sequence.

The structure of the rest of this paper is as follows: First, we consider the phonetics of clicks and their close relatives, percussives. Next, we present some simple quantitative data on the distribution of clicks and percussives in a sample of English conversational data; and then we consider more closely the forms and functions of clicks in some of the sequences of talk where they are found. We will also look at aspects of timing of clicks and accompanying visible actions. We will argue, on the basis of the data presented, that clicks in English are implicated in three distinct systems with distinct functions but partially overlapping phonetic (and kinetic) exponents: (i) marking incipient speakership; (ii) handling aspects of sequence management; and (iii) as part of a display of stance.

2 Phonetic overview

Clicks are ingressive, velarically initiated suction stops (Catford Reference Catford2001). They can be co-produced with various articulatory accompaniments, such as nasal airflow, voicing, and phonation types (Ladefoged & Traill Reference Ladefoged and Traill1994). In English, most such accompaniments are rare. There is occasional aspiration, but clicks are often followed by one of a large number of response tokens such as oh, ah, aw, oo, which – in contrast to the click itself – may be articulated in a wide range of ways, including phonatory setting, loudness, duration, voice quality, vowel quality, and glottal stop initiation. These are not the main focus of this paper, but, as will be clear, these tokens and their modifications are crucial in the display of affect. The International Phonetic Alphabet classifies clicks by place of articulation. Nonetheless, Ladefoged & Maddieson (Reference Ladefoged and Maddieson1996) argue that the articulation of clicks is complex, and place of articulation may not be the most satisfactory or explanatory way to describe them: the speed of release, the amount of tongue tension, suction, and tongue shape are all important aspects of the production of clicks with complex acoustic correlates.

Given the difficulties mentioned, we represent lingual English clicks in transcription with the symbols [!] and [ǁ]. [!] is used to refer to a click articulated with the tongue tip, with central release which is generally slow and affricated. The closure may be dental or alveolar (Wright Reference Wright2011a: 215), but the release nearly always has affrication, which implies a relatively slow release and a relatively large contact between the tip/blade of the tongue and the roof of the mouth. This is the click generally represented in English texts as ‘tsk-tsk’ or ‘tut-tut’. This click is ubiquitous in English. The second kind of click that is common in English, but considerably less common than [!], is a lateral one, [ǁ]. This click also has closure with the tongue tip which impressionistically ranges from dental to palatal (and is retroflex), but the release is lateral. Compared to [!], the lateral click has a deeper, hollower ring, and a sharper release. This click is comparatively rare in the data in this paper.

Closely related to clicks in English are sounds made as the articulators separate and the speaker prepares to speak (Scobbie, Schaeffler & Mennen Reference Scobbie, Schaeffler and Mennen2011). We call these sounds percussives, following Pike (Reference Pike1943: 103–105):

Percussors differ from initiators in several ways: in opening and closing they move perpendicularly to the entrance of the air chamber . . .; they produce no directional air current, but merely a disturbance that starts sound waves which are modified by certain cavity resonators; they manifest their releasing or approaching percussive timbre only at the moments of the opening and closing of some passage . . . (Pike Reference Pike1943: 103)

Articulations which Pike specifically names as percussives are the opening and closing of the lips, the tongue making closure at the alveolar ridge, the velum closing, the vocal folds making a glottal closure, and the sublaminal percussive of the ‘cluck click’ as in ExtIPA's [!¡]. For Pike, then, percussives arise from either separation of articulators or the striking together of articulators. Percussives are not audible unless the movement is ‘sharp, forceful and rapid’ (Pike Reference Pike1943: 103), and when the resonating cavity is closed. Pike describes the initial burst of plosives as percussive, but says that it is not convenient to separate them from the rest of the plosive. In this paper, then, we will use ‘percussive’ to describe sounds arising from the audible separation of articulators which are most likely to be produced as pulmonic ingressive plosives; but the term ‘plosive’ suggests a deliberate articulation of a sound found in English words, and it is not clear that percussives are always deliberate. They seem, rather, to be often ‘organic’ or ‘vegetative’ sounds which arise as participants position themselves as an incipient speaker audibly gearing up to speak. These articulations serve as an audible and often visible signal that an individual is preparing to speak.

The Extensions to the IPA offer the symbols [ʬ] and [ʭ] for bilabial and bidental percussives respectively. These symbols refer to ‘the striking together of two rigid or semi-rigid articulators’ (IPA 1999: 187), a usage of the term ‘percussive’ which is more restrictive than Pike's. The sounds discussed here are generated by the separation of articulators, which makes [ʬ] and [ʭ] unsuitable. We shall therefore use instead a combination of the symbol for pulmonic plosives, to represent the location of the release, and the symbol for ingressive airflow, [↓]. Audible in-breaths are a regular feature before and after both percussives and clicks; so it is pragmatic and linguistically informative to transcribe all percussives in a consistent manner, as [p↓, ↓, ph↓], etc. The in-breath is most audible after the release of the percussive or click, but quite often inhalation is audible and visible on spectrograms.

Acoustically, clicks are expected to show two transients corresponding to the anterior and velar releases. An example of this is shown in Figure 1, which contains two transients, marked ‘T’. The first one corresponds to the anterior release, the second to the dorso-velar release. In general, clicks are louder than percussives. However, they are variable and the transients that must occur are often not easily seen through the high-amplitude friction noise which often accompanies clicks.

Figure 1 [! h↓ ʔɜːːm]. (Speaker from Salford; first part of a response to a question.) The two transients (marked ‘T’) correspond to the release of the anterior and posterior closures for the click [!].

Example (3) illustrates percussives in initial position in a Turn Constructional Unit (TCU), the basic unit out of which turns at talk are composed (Sack, Schegloff & Jefferson Reference Sacks, Schegloff and Jefferson1974; see also Section 5 below).

  1. (3) Salford Si&Sh 709–717 drinking

Before Sheila produces it's rubbish, she audibly breathes in and there are a number of transients in the speech signal. Auditory analysis suggests the tongue coming away from the alveolar ridge and velum almost simultaneously, hence the transcription [tkh↓].

Figure 2 shows a spectrogram of the percussives in Example (3). The first one is alveolar, while the other two transients are velar and show a typical velar burst with multiple releases. It can be seen that percussives can involve the release of multiple closures, since in the rest or closed position, the tongue may make contact along its whole surface and length. It is not difficult to see how at least some clicks may arise from such ingressive percussives: it only requires an anterior contact to be released before a velar contact, and some amount of suction on the tongue to produce the supralingual rarefaction required for a velaric ingressive suction stop. The amount of tongue pressure will control the amount of rarefaction, and thereby also the loudness and salience of the click. This means that some clicks are quiet, especially in turn-initial position, and some clicks and percussives are very hard to distinguish auditorily. As we will see, in at least some structural positions, clicks and percussives are in free variation, and it may be that the more communicative – and auditorily much more salient – clicks which handle sequence management and displays of stance have evolved from the quieter turn-initiating clicks which alternate with percussives. We will not in this paper explore this much further, because acoustic data do not easily allow it.

Figure 2 Example (3), line 5: it's rubbish. The transients around 0.2 s are percussives (marked P in the waveform and visible in the spectrogram), accompanied by an inhalation till about 0.4 s.

3 Data

The data for this paper come from a wide range of sources, all of them collected for other purposes than the current one. The sources of the examples in this paper are identified by a mnemonic in the header line; these are described in brief in the Appendix. All the data are conversational and include material from radio phone-ins, phone calls and sociolinguistic interviews.

The extracts presented here are transcribed using a version of the GAT conventions (Gesprächanalytisches Transkriptionssystem) for the transcription of conversation (Selting et al. Reference Selting, Auer, Barth-Weingarten, Bergmann, Bergmann, Birkner, Couper-Kuhlen, Deppermann, Gilles, Günthner, Hartung, Kern, Mertzlufft, Meyer, Morek, Oberzaucher, Peters, Quasthoff, Schütte, Stukenbrock and Uhmann2009), which mark features of talk using adaptations of orthography without losing readability. GAT uses the symbol ! to mark strong emphasis; this has been replaced with underlining to avoid confusion with the click [!]. Clicks and percussives are transcribed using IPA conventions, with some modifications as described in Section 2 above. Since square brackets are used for marking overlapping talk, any segmental IPA symbols in the transcriptions are presented in italics, to avoid confusion.

4 Patterns of distribution

To give some idea of how common clicks and percussives are, we report a survey of one subset of the data, the York section of a set of sociolinguistic interviews collected as part of a project on Northern Englishes.

The data consist of four male pairs and three female pairs. All speakers were in the age range 18–22 at the time of recording. All but two were in tertiary education, and all were born and brought up in York. There are about 40 minutes of material for each pair, i.e. about 280 minutes in total. Clicks and percussives were identified by three different transcribers, who labelled separately and then compared labels and agreed on the classification of each sound using acoustic and auditory evidence. Where sounds were quiet, transcribers could not confidently distinguish between clicks and percussives, so we report an ‘uncertain’ category as well.

Clicks and percussives were approximately equally numerous (Figure 3), and there was a substantial number of cases where transcribers could not reach agreement on the classification. In all, there were 222 clicks which the transcribers were certain of, which means that in five minutes of talk, on average, about four clicks occur. A similar frequency was found for Finnish (Pohjola Reference Pohjola2011).

Figure 3 Clicks and percussives (including uncertain cases) in a corpus of spoken data from York.

If clicks are considered by position in the turn, the majority (128, 57%) are TCU-initial; 90 (41%) are TCU-medial, and a very small number (4, 2%) are TCU-final; but it is likely that these clicks project a new TCU which is abandoned. Clicks can be followed by a range of vocalisations, such as uhm, mm, particles such as oh, ah, aw, or linguistic items such as words. Pauses are also common before and after clicks.

When the distribution is considered by speaker (Figure 4), it is obvious that some speakers produced a very large number of clicks: three speakers (Anna, Emma and Cathy) account for almost half the clicks in the whole collection. This is an interesting finding, but it is also hard to interpret. Are some speakers habitual clickers? If so, why? Is it the case that these three speakers (but not the others) produced a large number of turns at talk where clicking was relevant? Is it possible that some speakers produce a lot of clicks but not many percussives, and vice versa? If so, this would suggest that clicks and percussives are indeed in free variation, and the choice is idiosyncratic. Considering the numbers by speaker does not solve the problem: Anna produces many clicks and percussives, Emma produces many clicks but not percussives, while Dan and Rachel produce many percussives but not clicks. So it is not clear that clicks and percussives are in free variation and one kind of sound can freely be substituted for the other.

Figure 4 Clicks and percussives by speaker, arranged in descending order of the number of clicks produced. The patterns of usage look idiosyncratic.

Of 222 clicks identified, only six were clearly identified as nasalised; another six were identified as possibly nasal, and the transcribers could not reach agreement. Nasalised clicks such as [] are therefore very rare in this dataset, and comprise no more than about 5% of all clicks. No instances of voiceless nasal clicks were identified.

If clicks are grouped by speaker gender, then females overall click much more than males: females produced 176 of all the clicks ( = 79% of all clicks). On average, each female speaker produced 29.3 clicks in 40 minutes, i.e. approximately one click every 1 minute and 20 seconds, while each male speaker produced 5.75 clicks in 40 minutes, i.e. approximately one click every seven minutes. The three biggest clickers in the collection are females, and they produce 147 (66%) of all the clicks, which indicates that clicks are not just a matter of gender. It is hard to know how to explain this finding. It may be that these speakers produce more of the kinds of talk where clicks are relevant. It may be that the clicks have a social-indexical function. To provide an explanation, a better understanding is needed of how clicks are distributed in English conversation, and what functions they have. The next section addresses this issue.

5 Turn-initial clicks and percussives

In this paper, we assume that speakers’ turns at talk are organised as described by Sacks et al. (Reference Sacks, Schegloff and Jefferson1974). To simplify the details of that paper: The basic unit of organisation is a turn, which is the time during which a participant is speaking. Turns consist of one or more TCUs (Turn Constructional Units), which are of variable size, but contain lexical material. TCUs usually contain places, Transition Relevance Places (TRP), where a current listener may legitimately come in with a response: this is best thought of as a space rather than a point. Ford & Thompson (Reference Ford, Thompson, Ochs, Schegloff and Thompson1996), among others, show how TRPs are aligned with syntactic, pragmatic and prosodic structures. In linguistic terms, TCUs therefore are best thought of as constructions describable on several levels simultaneously.

The commonest location for clicks is the pre-turn position: that is, before any lexical content in the turn, and before any response tokens such as oh or aw. Many such initial clicks are ‘plain’, in two senses: they are voiceless and non-nasal, and have only an in-breath as an accompaniment (though not necessarily), and they seem to do little more than mark the initiation of a turn at talk.

Examples (4) and (5) illustrate ‘plain’, turn-initial clicks. Example (4) is taken from a radio show, and P is the presenter.

  1. (4) ell lists sum04 cheese

The question at line 1 gets a reply at line 2. This reply is initiated by well, which marks it as not a straightforward answer (Schegloff & Lerner Reference Schegloff and Lerner2009), and this non-straightforwardness comes in the form of a list. It would be socially odd for the click to mark a negative stance in this context, and there is no other material in the turn that displays a negative stance. A reasonable interpretation of clicks like these is that they are an audible signal of incipient speakership, an interpretation that is consistent with the subsequent in-breath. Speculatively, in multi-party talk, it may be that such clicks can be used to identify oneself as a next speaker, and to secure the attention of others. As relatively loud transient sounds, clicks may be well adapted to this purpose.

Example (5) resembles (4) in that is a responsive turn. However it is sequentially more complex because the turn starts with a new action (a greeting), and is the start of a longer activity, of responding to Imogen's point. In this respect it is similar to Wright's (Reference Wright2011b) New Sequence Indexing clicks.

  1. (5) klm lists sum04 housing

Here, a caller to a radio show, Imogen, has been complaining about new houses being built in her town. The moderator selects Councillor P as the next speaker in line 5. P's initial action on starting her turn is to greet Imogen, followed by a reply to Imogen's earlier comment. P's turn starts with a click, followed by an in-breath and the turn-initial particle oh. As in (4), it is hard to make a case for this click marking a stance; more likely is that the click is an audible sound of ‘gearing up to speak’ (see Scobbie et al. Reference Scobbie, Schaeffler and Mennen2011), as the speaker separates her articulators and takes a breath in.

Clicks in these structural positions seem to be in free variation with percussives, as seen in the arrowed turns in Examples (6) and (7), which are initiated with percussives.

  1. (6) Salford M & D 1009 intimidating school

  2. (7) Salford Si & Sh incidental PERC 926 big glasses

In Example (6), Marjorie displays some trouble in finding a word (glottal stop at the end of were, which is syntactically incomplete, and then an in-breath). Diane's [percussive] + [in-breath] in line 4, in overlap with Marjorie's in-breath, mark her incipient speakership, and her turn provides a continuation of Marjorie's, displaying an understanding that Marjorie is having trouble finding a word. In Example (7), Simon's TCU is prosodically complete at line 1, but but projects a possible continuation of his turn, which he develops in line 3 after a gap. Here, the percussive and in-breath mark the resumption of his own turn, which may be an orientation to Sharon's throat-clearing, which Simon seems to treat as possibly projecting talk by Sharon. Thus the percussive is TCU-initial but turn-medial, and it contributes to the vocal projection of continued talk by Simon.

Plain clicks commonly occur in word searches (see Wright Reference Wright2005 for a fuller account). The click in (8) is a typical example.

  1. (8) Salford M & D 1328 secondary school

In line 3, there are clear signs of trouble. First, there is a pause located after the possessive determiner my – by pausing after a word that starts a syntactic phrase, Marjorie syntactically projects further talk to come. The particle uhm and the second gap in the word search are followed by a click, and then the resolution of the word search is achieved with secondary school. Diane in the meantime makes no attempt to resolve the problem: this is common in stretches of talk where a larger syntactic phrase has been projected, as here. Clicks, pauses and particles like uhm (with level pitch near the middle of the speaker's range) are very common markers of word search. In this context, clicks are part of a set of practices used to mark trouble finding words. As we shall see in Section 7, however, silences and clicks sometimes have a more interesting explanation.

6 Stance-taking

As we have seen, clicks may be used as a device for projecting up-coming talk: they occur turn-initially and are followed by what has been projected in a prior turn. We look now at clicks that are closer to the lay interpretation, i.e. clicks which are implicated in displaying a stance. We shall see that the form of such clicks is in general no different from those of turn-initiating clicks, but their sequential placement provides them with an affect-laden interpretation. Percussives seem not to occur in these contexts. Example (9) provides a good example of a click as part of a conveyance of negative stance: a canonical ‘tutting’ click.

  1. (9) CallHome 4861.60

Mom has been discussing at length with her daughter Debbie why Debbie has not yet found a suitable prospective husband, and has not settled on a career. At lines 1–2, Mom blames herself for what she perceives as her daughter's failings. Debbie's response consists of a click followed by an in-breath, and then an imperative which rejects Mom's self-blame and seeks to close it down. The form of the click is the same as in earlier examples: a plain voiceless click accompanied by an in-breath. We saw no evidence in other examples that such clicks were displaying a stance: they mark only that the speaker is getting ready to talk. If we consider the placement of Debbie's click relative to Mom's talk, it comes much earlier than the other [plain click] + [in-breath] constructions. It is placed at a point in time where Mom's self-blame becomes unambiguously recognisable as such (obviously I didn't do a good enough j-). Thus by virtue of its timing relative to the prior turn, this click projects an immediate response by Debbie to Mom's self-blame; and indeed her resistance in oh stop that is located exactly at a TRP, just after the main accent of Mom's turn, in raising (see Wells & MacFarlane Reference Wells, MacFarlane, Swerts and Hirschberg1998). Thus the click projects a turn dealing with Mom's self-blame by its placement; but the form of the click does not, per se, express resistance to Mom. Any sense of resistance arises from the click's timing relative to Mom's on-going turn, and the accompanying actions expressed through oh stop that which together deliver a stance towards Mom's turn.

Example (10), discussed in more detail in Drew (Reference Drew1998), illustrates another click expressing disapproval.

  1. (10) Holt C38:4

Lesley has been telling Joyce a story about a mutual acquaintance, Mr R, at a sale in her local vicarage. Lesley launched her story by saying I'm broiling about something. The punchline of her story is at lines 1–2. Mr R's reported words in line 2 are the thing which she is broiling about; and Joyce's response at line 3 is her appreciation of Lesley's story. Joyce displays her shared outrage at Mr R's behaviour with a click followed by a long voiceless ingressive sound made with an open vocal tract, [!↓ːː]: it is close to a ‘gasp’. Joyce's turn thus does two things: it simultaneously displays an understanding of the point of Lesley's story, and affiliates with Lesley's own stance towards Mr R's comment to her. It is followed by further talk in which Lesley and Joyce share their outrage at Mr R.

The TCU which is initiated with the click conveys several actions at once: it displays ‘outrage’ at the complainable behaviour reported in Lesley's story, which is projected right at its beginning and in doing so, the TCU affiliates with Lesley in her complaint. In this case again, the click is understandable through its context (after the punchline of a story, and the actions that it makes relevant from the recipient of the story); but there is nothing in the form of the click itself which presents it as a click designed to do these things.

In Example (11), the click is again turn-initial and accompanied by a response particle, this time ah, which is produced with breathy voice quality, and followed by an assessment, that's lovely. As in (10), the click is in a turn which displays an understanding of the kind of story that has just been told; in this case, a romantic one, where a young woman has finally got together with a man who has been very persistent. Kate's appreciation of the story is positive.

  1. (11) nrb reluctant lover

Examples (10) and (11) display two different kinds of story which are differently appreciated, but the appreciation of the story is initiated with a click. The click does not occur by itself though. Although the click marks the onset of the appreciation of the story, the kind of appreciation that the recipient of the story is expected to display is projected at the launch of the story, and through the telling of the story itself. The click is accompanied by a response token which is fitted to the story. The [click] + [response token] are followed by further lexical material which displays verbally an appreciation of the story. Thus the ‘meaning’ of the click is part of a more complex set of interactional practices for story-telling.

In Example (12), there is a rare case of a nasalised click. Kate has dyed her hair red, and Jade is complimenting her on the result. In line 4, she makes a positive assessment of Kate's hair, and Kate responds in overlap, but at a TRP. Like the clicks that we saw in stories, it is accompanied by a response token ([`ɑː.], with intonation falling to low), and then verbal material which explicates the action of the turn, in this case thanks.

  1. (12) nrb red hair

In summary, clicks that display a stance are often accompanied by other material which conveys a stance. This is expectable. Response tokens such as oh, ah, aw, etc., are vocalic and thus amenable to a range of phonetic modifications which serve to mark the kind of stance they display (Norrick Reference Norrick2012). Clicks, on the other hand, are much more difficult to modify: they are punctual, they are usually voiceless, and modifications to them are more difficult to produce and to perceive. A reasonable conclusion is that clicks do not by themselves display a stance, but are part of a display of a stance which is handled and negotiated interactionally over a longer stretch of talk, e.g. through the projection of the kind of appreciation that is due, the use of response tokens, and the inclusion of overt lexical material which provides an assessment or displays gratitude. This is compatible with recent studies which show that many stance-related actions in conversation are not instantaneous, but negotiated over longer sequences of talk (e.g. Wilkinson & Kitzinger Reference Wilkinson and Kitzinger2006; Local & Walker Reference Local and Walker2008; Ogden Reference Ogden, Barth-Weingarten, Reber and Selting2010, Reference Ogden2012b). The task for the producer of the click is to produce a click at the right place in a sequence, and at the right time. However, we have been unable to show that the particular form of the click is consequential: modifications to place of articulation or accompaniments to the click articulation do not seem to be used to modify what is displayed through the click. Rather, if clicks have any meaning, this comes about through other features of the collocations in which they occur. A possible exception are nasal clicks and lateral clicks, which feature only rarely in the data; further study is needed to show empirically whether these kinds of clicks have particular functions.

7 Kinetics

We now consider some aspects of the kinetic production of clicks and percussives. So far we have considered only the auditory/acoustic aspects of clicks, but there are frequently other accompanying behaviours involved in the production of both clicks and percussives.

One physical action which may accompany clicks is swallowing. In Example (13), there is a word search which contains the particle uh, a pause, and a click: it shares many similarities with Example (8) above.

  1. (13) RCE25 Bench 00.57 library card

A transcript based on auditory data alone at line 1 shows a pause of 0.6 seconds, but inspection of the video shows that L is swallowing during that part. The click is the culmination of the swallow, where L opens her vocal tract. Swallowing involves a movement of the tongue and pharyngeal muscles which produce a downward movement of the bolus (which in this case is likely to be saliva). This downward movement, accompanied by closure of the surface of the tongue against the roof of the mouth, and closure of the lips, is conducive to velaric ingressive airflow as the vocal tract is opened again.

Figures 5a–c are all taken in the 0.6 second silence between to-uhm and the click. Figure 5a is the bilabial closure at the end of [m]. Note that it is not merely bilabial closure involving the surface of the lips, but involves contact of the whole of the outer surface of the lips; this is a very extreme closure as compared to that normally used for speech. Figure 5b is taken at the point where the speaker swallows. In the video, the tendons in the neck are seen to become more prominent as the swallow occurs, and the cheeks are also drawn in. As can be seen, the lips are starting to be released, although there is still a tight closure. Figure 5c is taken at the point of release, where the click is made. The ‘lipsmack’ at the resolution of the word search is therefore not just audible; it is the culmination of a bigger physical process which is also visible from the speaker's face and neck.

Figure 5 Example (13), lines 3–4: (a) bilabial closure; (b) mid closure, at the peak of the swallowing; (c) just before release into to-uhm (0.6) [!].

The next example of a kinetic event co-occurring with a click is taken from a family dinner in the United States in the 1960s. In previous examples, we saw that clicks that present a stance are typically accompanied by turn-initial particles and followed immediately by lexical material which expands on the action of the turn with the click, or of the [click] + [response token]. We turn now to a click which does occur by itself. In this example, Virginia, a teenager who is not yet old enough to work, is pestering her mother about getting a new dress (lines 1–2, 4). That this is not the first time she has done this is clear from Mom's turn in lines 5–6. Mom resists Virginia's request for a dress, to which Virginia responds at various points. Virginia's uhhh! at line 8 is accompanied by a head turn away from Mom.

  1. (14) Virginia p.3 01:27

  2. Context: Virginia (V) and Mom (M) at a family dinner table. (Some overlapping talk from Virginia's sister Prudence at line 13 has been removed for clarity.)

At line 11, Virginia produces a click. It is accompanied by a head-turn away from Mom, with Virginia shifting her gaze away, and iconically disengaging from her mother (Figure 6) (see Goodwin Reference Goodwin1981, Reference Goodwin1986).

Figure 6 Upper panel: Mom (left) and Virginia (right) as Mom says fall stuff (Example (14), line 9). They share mutual gaze. Lower panel: Mom (left) and Virginia (right) as Virginia produces her click (Example (14), line 11). Virginia and Mom are visibly disengaged from one another.

Virginia's click can be understood as part of a display of resistance to Mom on a number of grounds. Firstly, there are other, earlier signs of her disaffiliation with her mother's stance, and indeed signs of her mother's resistance to her: her request for the dress in line 1 is immediately followed by a repeated form lemme get that-. Her mother's response at lines 5–6 rejects the request – not for the first time – and in lines 7 and 9, Mom presents reasons why she does not go along with Virginia's request. Virginia in the meantime makes a whimpering noise (line 8), and turns away from her mother. The click then is located in an extended sequence where the two parties have a stand-off: on completion of Mom's accounts for why she will not grant the request, Virginia displays her response. The click is therefore but one part of how Virginia displays her stance towards Mom, both in the turn itself and in the sequence more generally. Clearly, Virginia's click is implicated in displaying a stance towards her mother, but it does not in and of itself display these things. While the click is the only audible material in the turn, it is not the only physical action that Virginia produces at this moment: Virginia's turn delivers an action through a blend of vocal and kinetic activities.

8 Clicks as metronomes

The final set of clicks we consider are those which are timed isochronously with other events in the speech signal. Clicks work uniquely well for this: as they are transient and relatively loud, they serve well as temporal markers in speech. For this reason, we claim that these clicks function as ‘metronomes’; alongside any other functions they might have, they handle the intersubjective timing of participants’ turns relative to one another.

The first case is a click which is part of a display of sympathy, in Example (15).

  1. (15) SW/CK Sym #63

In this extract, A is complaining about the dull and miserable weekend she has just spent. In line 2, she repeats the word miserable, then there is a gap in her talk, before she produces weekend. In this gap, B comes in and produces a click, which is located just between the second miserable and weekend (see Figure 7). In overlap with weekend, B starts a turn which displays sympathy with A: it has the format [click] + [response token aw] + [verbal expression of sympathy]. This makes it structurally similar to other cases considered earlier. These two turns are produced with isochronous beats. In her turn, A produces words aligned with the beats (on God, miserable, miserable and weekend) with a ‘beat of silence’ between miserable and weekend. Figure 7 shows a labelled waveform with three tiers: A's words, B's words, and beats (marked b), as produced automatically using a program which places beats half way up amplitude contours in the speech signal, as in Cummins & Port (Reference Cummins and Port1998).Footnote 1 As can be seen, an isochronous rhythm holds across the two turns, with B's click produced on the fourth beat, and then three further beats on aw, that's and shame. The temporal placement of B's click thus demonstrates an orientation to the rhythm established by A, and projects an up-coming, on-beat turn. B's turn is indeed aligned in time and in action with A's. The isochrony across the two turns at talk (Couper-Kuhlen Reference Couper-Kuhlen1993, Auer, Couper-Kuhlen & Müller Reference Auer, Couper-Kuhlen and Müller1999) is thus prosodically co-constructed, in a manner similar to syntactic co-constructions (e.g. Lerner Reference Lerner and Lerner2004).

Figure 7 Spectrogram, waveform and beats for Example (15). Beats placed automatically by algorithm (http://cspeech.ucd.ie/~fred/beatExtraction.php).

While Example (15) showed a click projecting an on-time and on-beat response, the next example of clicks as metronomes involves clicks produced in one turn being used as a device for incoming laughter.

  1. (16) Salford A & R lateral click 1654 topless beaches

Anthony and Ray have been talking about places they would like to go to. Anthony has mentioned that he would like to go to Australia. Ray, in line 2, dismisses Anthony's plans, and Anthony in line 3 rejects Ray's rejection. Anthony's TCU is then extended by a series of four lateral clicks, which are isochronously spaced. These clicks have incrementally falling secondary resonances, probably achieved by progressive lip rounding. This change in resonance may be part of how a series of four is projected. Ray displays his understanding of Anthony's response as something laughable. Ray's laughter is pulsed, and the first pulse comes in on beat with Anthony's last click. Since the intervals between Anthony's four clicks are equidistant, the clicks set up a fast beat, which Ray orients to with his turn that consists of laughter: this laughter constitutes an appreciation of Anthony's rebuttal of Ray's rejection in line 2. Figure 8 shows Anthony's clicks, again with beats marked automatically.

Figure 8 Example (16): Multiple clicks followed by laughter from another speaker. Beats are placed on each of the clicks, and the final click coincides with the onset of the laughter pulses. The red grid is aligned with the clicks, and the lines are spaced without reference to the audio. The laughter pulses are initially timed with shorter intervals than the clicks, but start on beat with the final click.

Anthony's laughter pulses are spaced more closely together than Anthony's clicks, but nonetheless, he displays an orientation to the beats set up by Anthony's clicks by the way he times his incoming laughter, which displays an appreciation of Anthony's prior turn. These two examples (and others like them) show that clicks can simultaneously handle matters of interpersonal timing in talk and issues of stance-taking. Because they are punctual, clicks provide participants with clear signals about rhythm and timing in talk. It is known (e.g. Auer et al. Reference Auer, Couper-Kuhlen and Müller1999) that the temporal location of incoming talk affects the interpretation of turns at talk. Clicks can serve as a resource – a metronome – for participants (both speakers and hearers) to either solicit or to provide a well-timed and affiliating response.

9 Conclusions

Overall, our study has shown that clicks occur in (at least) three distinct systems with distinct functions but partially overlapping phonetic exponents. The details of these systems are not complete, but in outline they seem to be:

  1. 1. Marking incipient speakership. Clicks in turn-initial position can serve to project speech, and therefore can mark the transition from listener to incipient speaker. This system includes percussives which arise from the separation of the articulators, often simultaneously with inhalation. The sounds in this system are united by the features of audible separation of articulators, and ingressive airflow, which may be either pulmonic (percussive) or velaric (click).

  2. 2. Aspects of sequence management. Clicks and percussives both commonly occur as elements of word-search, in conjunction with other, more linguistic features: particles such as uh, uhm, mmm, uttered on a mid-level pitch, perhaps tagged on to (full forms of) words; in-breaths and swallowing; gaps in talk at points where longer syntactic units are projected. Clicks also index new sequences of talk, at different syntactic locations from word searches. These uses of clicks, which appear to be more linguistically structured and particular, may have evolved from those sounds which mark incipient speakership, which appear more organic.

  3. 3. Displaying a stance. Clicks, but apparently not percussives, are part of a family of verbal and non-verbal practices for displaying a stance turn-initially. Such clicks tend to be loud and deliberate, giving the impression that they are designed for a recipient to hear (unlike at least some sounds marking incipient speakership). They are commonly, but not necessarily, followed by response tokens whose prosodic design carries a great deal of communicative weight which we have not examined in this paper. Stance-taking clicks may also be accompanied by other bodily activities, such as head-turns, and probably facial expressions too. They are embedded in sequences of talk which can project the stance displayed in the turn with the click, and/or elaborated on after the click is produced, e.g. through overt linguistic material.

While we have not considered other languages, system 1, ‘incipient speakership’, may be subject to individual variation due to differences in e.g. amount and texture of saliva, and perhaps personal habits such as ‘neutral’ mouth postures (Scobbie et al. Reference Scobbie, Schaeffler and Mennen2011). We might therefore expect to see similar patterns in other languages. On the other hand, word searches, new sequence indexing, and displays of stance involve more overtly linguistic elements, so we might expect to find significant differences between languages in terms of form and function. This remains a question for further research (though Pohjola Reference Pohjola2011 has very similar findings for Finnish conversation).

Clicks, as the lay interpretation assumes, are involved in displays of stance. But the form of clicks in different kinds of stance-taking does not seem to be very variable. Rather, the variable elements are the location of clicks in a sequence of talk, and the accompanying response tokens. In other words, it is hard to demonstrate that clicks by themselves display a stance. It is much easier to show that turns displaying a stance occur in sequential locations where displays of stance are relevant, and that the kind of stance displayed is projected through the talk itself. Indeed, we have seen examples of both broadly positive and negative stance-taking performed with the click [!]. This indicates that in order to understand the linguistic components of stance-taking better, it is important to look at a longer sequence; and to (literally) look at non-verbal behaviour as well.

In summary, clicks and percussives in English have a complex but regular distribution. It is true that they are not phonemic, but there are good arguments for treating them as part of a speaker's communicative competence. They have a regular distribution in pre-turn position (or mid-turn position in the case of word searches); and they are elements of bigger meaning-bearing prosodic constructions. It seems very likely that there are individual variations in the precise forms of clicks, and the frequency with which they are used. For more conclusive statements about their linguistic value, or their specific functions, cross-linguistic studies are needed.

The analysis in this paper is deficient in that it depends heavily on auditory and acoustic analysis. This is necessarily so, as the competing demands between natural data – where participants have a real social, interactional and emotional investment in what they are doing through their talk – and laboratory data – where the social situation is highly controlled – are difficult to reconcile. Articulatory studies would provide much more detailed information about the precise articulation of clicks and percussives, and may help us to explore in better detail the acoustics of such sounds. Access to video data indicates that a fuller study could also usefully consider the body more generally: breathing and swallowing are two basic activities that are intimately bound up with clicks and percussives in the data considered here. Securing such data, while retaining natural contexts, is however extremely challenging. A useful next step would be to conduct laboratory studies so that the relation between articulation and acoustics is clearer, in order to make acoustic analysis of natural data, alongside a careful sequential analysis, more robust.

Acknowledgements

Thanks to Daniel Clubb, Marianna Kaimaki, and Emma Piggott for help with data collection; to Bill Haddican and Hazel Richards for access to the ESRC Northern Englishes data; to Emanuela Buizza for help with the illustrations; to anonymous reviewers of JIPA whose comments helped to improve the paper; to Erica Gold, Auli Hakulinen, Sara Howard, Tamar Keren-Portnoy, John Local, Anna Pohjola, Jim Scobbie, Marja-Leena Sorjonen, Jane Stuart-Smith, Jürgen Trouvain, Marilyn Vihman, Melissa Wright and many other colleagues for useful discussions along the way. The University of York Research Priming Fund supported some of the data collection and analysis.

Appendix: Data sources

Footnotes

1 Code available from http://cspeech.ucd.ie/~fred/beatExtraction.php. The code places beats slightly before the expected location in the case of transients such as for clicks; the beats are therefore shifted leftwards by a few milliseconds. The isochronous beats are nonetheless visible by this method, which has the advantage of being consistent and placed by a computational algorithm.

References

Abercrombie, David. 1967. Elements of general phonetics. Edinburgh: Edinburgh University Press.Google Scholar
Auer, Peter, Couper-Kuhlen, Elizabeth & Müller, Frank. 1999. Language in time: The rhythm and tempo of spoken interaction. Oxford: Oxford University Press.CrossRefGoogle Scholar
Ball, Martin. 1989. Phonetics for speech pathology. London: Taylor and Francis. [Reissued 1990, London: Whurr.]Google Scholar
Button, Graham. 1987. Moving out of closings. In Button, Graham & Lee, J. R. (eds.), Talk and social organisation, 101151. Clevedon: Multilingual Matters.CrossRefGoogle Scholar
Catford, J. C. 2001. A practical introduction to phonetics. Oxford: Oxford University Press.CrossRefGoogle Scholar
Clark, John & Yallop, Colin. 1990. An introduction to phonetics and phonology. Oxford: Basil Blackwell.Google Scholar
Couper-Kuhlen, Elizabeth. 1993 English speech rhythm: Form and function in everyday interaction. Amsterdam & Philadelphia, PA: John Benjamins.CrossRefGoogle Scholar
Crystal, David. 1987. The Cambridge encyclopaedia of language. Cambridge: Cambridge University Press.Google Scholar
Cummins, Fred & Port, Robert. 1998. Rhythmic constraints on stress timing in English. Journal of Phonetics 26 (2), 145171.CrossRefGoogle Scholar
Drew, Paul. 1998. Complaints about transgressions and misconduct. Research on Language & Social Interaction 31 (3), 295325.CrossRefGoogle Scholar
Ford, Cecilia E. & Thompson, Sandra A.. 1996. Interactional units in conversation analysis: Syntactic, intonational and pragmatic resources for the management of turns. In Ochs, Elinor, Schegloff, Emanuel A. & Thompson, Sandra A. (eds.), Interaction and grammar (Studies in Interactional Sociolinguistics 13), 134184. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Gil, David. 2011. Para-linguistic usages of clicks. In Dryer, Matthew S. & Haspelmath, Martin (eds.), The world atlas of language structures online. Munich: Max Planck Digital Library.Google Scholar
Gimson, A. C. 1970. An introduction to the pronunciation of English, 2nd edn. Bristol: Edward Arnold.Google Scholar
Goodwin, Charles. 1981. Conversational organization: Interaction between speakers and hearers. New York: Academic Press.Google Scholar
Goodwin, Charles. 1986. Gestures as a resource for the organization of mutual orientation. Semiotica 62 (1–2), 2949.CrossRefGoogle Scholar
Gussenhoven, Carlos. 2004. The phonology of tone and intonation. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
IPA [International Phonetic Association]. 1999. Handbook of the International Phonetic Association. Cambridge: Cambridge University Press.Google Scholar
Jefferson, Gail. 1984. On the organization of laughter in talk about troubles. In Atkinson, J. Maxwell & Heritage, John (eds.), Structures of social action: Studies in conversation analysis, 346–369. Cambridge: Cambridge University Press.Google Scholar
Kohler, Klaus J. 2007. “Speech–smile”, “speech–laugh”, “laughter” and their sequencing in dialogic interaction. Interdisciplinary Workshop on the Phonetics of Laughter, Saarbrücken, 21–26.Google Scholar
Ladefoged, Peter. 1982. A course in phonetics, 2nd edn.New York: Harcourt Brace Jovanovich.Google Scholar
Ladefoged, Peter & Maddieson, Ian. 1996. The sounds of the world's languages. Oxford: Blackwell.Google Scholar
Ladefoged, Peter & Traill, Anthony. 1994. Clicks and their accompaniments. Journal of Phonetics 22, 3364.CrossRefGoogle Scholar
Laver, John. 1994. Principles of phonetics. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Lerner, Gene H. 2004. Collaborative turn sequences. In Lerner, Gene H. (ed.), Conversation analysis: Studies from the first generation, 225–256. Amsterdam: John Benjamins.CrossRefGoogle Scholar
Local, John & Walker, Gareth. 2008. Stance and affect in conversation: On the interplay of sequential and phonetic resources. Text & Talk – An Interdisciplinary Journal of Language, Discourse Communication Studies 28 (6), 723747.CrossRefGoogle Scholar
Norrick, Neal R. 2012. Listening practices in English conversation: The responses responses elicit. Journal of Pragmatics 44 (5), 566576.CrossRefGoogle Scholar
Ogden, Richard. 2010. Prosodic constructions in making complaints. In Barth-Weingarten, Dagmar, Reber, Elisabeth & Selting, Margret (eds.), Prosody in interaction, 81104. Amsterdam: John Benjamins.CrossRefGoogle Scholar
Ogden, Richard. 2012a. The phonetics of talk in interaction – introduction to the Special Issue. In Ogden, Richard (ed.), The phonetics of talk-in-interaction, special issue of Language and Speech 55 (1), 311.Google ScholarPubMed
Ogden, Richard. 2012b. Making sense of outliers. Phonetica 69 (1–2), 4867.CrossRefGoogle ScholarPubMed
Pike, Kenneth. 1943. Phonetics: A critical analysis of phonetic theory and a technic for the practical description of sounds. Ann Arbor, MI: University of Michigan Press.CrossRefGoogle Scholar
Pohjola, Anna. 2011. Maiskaukset keskustelussa [Clicks in conversation]. BA dissertation, Finnish Language, University of Helsinki.Google Scholar
Potter, Jonathan & Hepburn, Alexa. 2010. Putting aspiration into words: “Laugh particles”, managing descriptive trouble and modulating action. Journal of Pragmatics 42 (6), 15431555.CrossRefGoogle Scholar
Reber, Elisabeth. 2012. Affectivity in interaction: Sound objects in English. Amsterdam: John Benjamins.CrossRefGoogle Scholar
Sacks, Harvey, Schegloff, Emanuel A. & Jefferson, Gail. 1974. A simplest systematics for the organization of turn-taking for conversation. Language 50, 696735.CrossRefGoogle Scholar
Schegloff, Emanuel A. & Lerner, Gene H.. 2009. Beginning to respond: Well-prefaced responses to wh-questions. Research on Language and Social Interaction 42 (2), 91115.CrossRefGoogle Scholar
Schegloff, Emanuel A. & Sacks, Harvey. 1973. Opening up closings. Semiotica VIII (4), 289327.Google Scholar
Scobbie, Jim M., Schaeffler, Sonja & Mennen, Ineke. 2011. Audible aspects of speech preparation. 17th International Congress of the Phonetic Sciences (ICPhS XVII), Hong Kong, 1782–1785.Google Scholar
Selting, Margret, Auer, Peter, Barth-Weingarten, Dagmar, Bergmann, Jörg, Bergmann, Pia, Birkner, Karin, Couper-Kuhlen, Elizabeth, Deppermann, Arnulf, Gilles, Peter, Günthner, Susanne, Hartung, Martin, Kern, Friederike, Mertzlufft, Christine, Meyer, Christian, Morek, Miriam, Oberzaucher, Frank, Peters, Jörg, Quasthoff, Uta, Schütte, Wilfried, Stukenbrock, Anja & Uhmann, Susanne. 2009. Gesprächanalytisches Transkriptionssystem 2 (GAT 2). Gesprächsforschung – Online-Zeitschrift zur verbalen Interaktion 10, 353402.Google Scholar
Ward, Nigel. 2006. Non-lexical conversational sounds in American English. Pragmatics and Cognition 14 (1), 113184.CrossRefGoogle Scholar
Wells, Bill & MacFarlane, Sarah. 1998. Prosody as an interactional resource: Turn-projection and overlap. In Swerts, Marc & Hirschberg, Julia (ed.), Prosody and conversion, special issue of Language and Speech 41 (3–4), 265294Google Scholar
Wilkinson, Sue & Kitzinger, Celia. 2006. Surprise as an interactional achievement: Reaction tokens in conversation. Social Psychology Quarterly 69 (2), 150182.CrossRefGoogle Scholar
Wright, Melissa. 2005. Studies of the phonetics–interaction interface: Clicks and interactional structures in English conversation. Ph.D. thesis, University of York.Google Scholar
Wright, Melissa. 2007. Clicks as markers of new sequences in English conversation. 16th International Congress of the Phonetic Sciences (ICPhS XVI), Saarbrücken, 1069–1072.Google Scholar
Wright, Melissa. 2011a. On clicks in English talk-in-interaction. Journal of the International Phonetic Association 41 (2), 207229.CrossRefGoogle Scholar
Wright, Melissa. 2011b. The phonetics–interaction interface in the initiation of closings in everyday English telephone calls. Journal of Pragmatics 43 (4), 10801099.CrossRefGoogle Scholar
Figure 0

Figure 1 [! h↓ ʔɜːːm]. (Speaker from Salford; first part of a response to a question.) The two transients (marked ‘T’) correspond to the release of the anterior and posterior closures for the click [!].

Figure 1

Figure 2 Example (3), line 5: it's rubbish. The transients around 0.2 s are percussives (marked P in the waveform and visible in the spectrogram), accompanied by an inhalation till about 0.4 s.

Figure 2

Figure 3 Clicks and percussives (including uncertain cases) in a corpus of spoken data from York.

Figure 3

Figure 4 Clicks and percussives by speaker, arranged in descending order of the number of clicks produced. The patterns of usage look idiosyncratic.

Figure 4

Figure 5 Example (13), lines 3–4: (a) bilabial closure; (b) mid closure, at the peak of the swallowing; (c) just before release into to-uhm (0.6) [!].

Figure 5

Figure 6 Upper panel: Mom (left) and Virginia (right) as Mom says fall stuff (Example (14), line 9). They share mutual gaze. Lower panel: Mom (left) and Virginia (right) as Virginia produces her click (Example (14), line 11). Virginia and Mom are visibly disengaged from one another.

Figure 6

Figure 7 Spectrogram, waveform and beats for Example (15). Beats placed automatically by algorithm (http://cspeech.ucd.ie/~fred/beatExtraction.php).

Figure 7

Figure 8 Example (16): Multiple clicks followed by laughter from another speaker. Beats are placed on each of the clicks, and the final click coincides with the onset of the laughter pulses. The red grid is aligned with the clicks, and the lines are spaced without reference to the audio. The laughter pulses are initially timed with shorter intervals than the clicks, but start on beat with the final click.