During the summer and autumn of 2012, an escalation of the Sino-Japanese conflict over the sovereign status of the Diaoyu/Senkaku Islands (diaoyudao 钓鱼岛) unfolded in East Asia. The crisis, which grew out of competing Chinese and Japanese claims to the territory, began in mid-August when a small group of protesters set sail from Hong Kong with the intent of landing on the disputed terrain. Upon reaching their destination, they were immediately detained (and later released) by the Japanese. Their detention infuriated many within China. Subsequently, on 11 September, Tokyo nationalized some of the islands by purchasing them from private Japanese owners. This move may have been designed to de-escalate the tense stand-off over this maritime region; however, if so, it wildly backfired as it was widely viewed within China as an especially provocative measure. As such, it catalysed a series of mass Chinese protests against Japan. Many of these demonstrations were accompanied by harsh anti-Japanese rhetoric and attacks against property and individuals perceived to have ties to Japan.
This article grows out of the conviction that studying how this dispute was portrayed on China's leading social media site, Sina Weibo (weibo 微博), offers us a vantage point from which to contribute to the expanding academic debate about both Chinese social media and Beijing's approach to the disputed islands. The paper draws on a new big dataset of over 40,000 influential bloggers on Weibo and measures micro-bloggers' sentiments during the 2012 crisis.Footnote 1 We isolated the posts within this collection that were written in August and September 2012, then culled these down to those topically relevant to the Diaoyu/Senkaku Islands and used a combination of hand-coding, keyword analysis and computer-assisted text analysis (CATA) to categorize the sentiments expressed in these posts and identify how such expressions changed over time. This process gives us the most comprehensive study to date of how Weibo users discussed the islands as China and Japan were squaring up in the sovereignty conflict, and allows us to ask the following questions. First, what was the degree of cohesiveness, and level of volatility, of nationalist sentiments such as those that drove forward the mass protests? Second, were such sentiments allowed to be expressed, or were they censored? If the latter, to what degree were they censored, given that these sentiments unfolded simultaneously with real-world street demonstrations and in the context of the Chinese state's extensive online censorship programme? Conversely, if such virulent views were not censored at certain times, why not?
To preview our findings, we first note that there is significant heterogeneity among Weibo users in terms of the sentiments they express. Surprisingly, the referent of many micro-bloggers' harshest nationalist invective was directed not towards Japan but at a Chinese state they characterized as ineffectual and corrupt. This level of anti-government sentiment was both deeper and broader than we had anticipated. However, we also discovered that virulent voices did not always dominate Weibo traffic. In fact, moderate posts, especially those subsequent to crucial real-world events in September, served as a counterpoint to the more rabid views that until then had held sway. During the periods of greatest Weibo use, such level-headed commentaries actually eclipsed the more virulent sentiments and revealed that, at least in this particular case, the forum was not as monopolized by angry youth as conventional wisdom would lead one to expect.
This finding then leads us to question why such comments were not uniformly censored for the entire duration of the dispute, given the Chinese government's strategy of suppressing comments that “represent, reinforce, or spur social mobilization.”Footnote 2 Specifically, we develop a method for estimating the proportion of all Weibo posts in our sample that were censored on each day of the dispute (roughly 49 days) between August and September 2012. We find that while censorship was consistently high in September, it dropped precipitously in mid-August immediately after the protesters landed on one of the islands and during an initial wave of street demonstrations.
Our data strongly suggest three theoretical explanations that address the above questions: two are based on “instrumental-strategic” logics, and one on an ideational or “belief-driven” background condition. First, our finding that censorship plummeted in mid-August is strongly consistent with the logic of “strategic censorship,”Footnote 3 a phenomenon posited by scholars of anti-foreign protestsFootnote 4 and Chinese nationalism more specifically,Footnote 5 but which until now has lacked strong empirical testing. This logic has both domestic and international components. Domestically, once China's leaders sensed that the Chinese public was enraged, they realized the need for a “safety valve”Footnote 6 through which such anger could be “vented” (faxie 发泄) in order to prevent it from leading to greater social instability or turning even more sharply against the state. Internationally, Beijing had few strategic options for showing its determination to defend China's territorial claim, and it appears that allowing virulent online opinion to be expressed, along with its harsh official rhetoric to generate “audience costs,”Footnote 7 only served to tie its hands further in regard to its dispute with Japan.
While these two explanations provide the most leverage to explain exactly when and under what circumstances the drop in censorship occurred, alone they are arguably insufficient to account for why China's leaders would be willing to tolerate even a moderate amount of risk in the first place in order to carry out such an instrumental strategy, given that they routinely suppress all manner of collective actions and related speech, even in cases where such mobilization does not obviously threaten social stability or state legitimacy. To explain why the anomaly of large-scale nationalist protest in China and the accompanying internet commentary, which is not permitted to the same degree on any other issue, is even thinkable among Chinese elites requires understanding the extent to which they have become predisposed to identify with the nationalist rhetoric they have espoused as a legitimizing narrative since the 1989 Tiananmen crisis. In other words, China's leaders were primed to look favourably on protesters' anger at Japan over perceived provocations during the dispute. This then made thinkable the possibility of temporarily allowing such animosity to be expressed online, even when it was directed back at Beijing itself, in an exception to the usual repression of information to which such mass mobilizations in China are subject.
Therefore, although our data speak most strongly to the instrumental nature of the leaders' decision to ease censorship on 18 August and cannot show that such belief-driven considerations were a proximate cause of the pattern we observe, we also caution against overlooking the elites' own national identity construct as a necessary background condition that motivated them to find a way to express their own anger against Japan – while working within the “rationalist” calculus of balancing risk to social or regime stability with the tangible benefits of sending strategic signals to domestic and foreign audiences. Chinese leaders may well have bought into the nationalist narratives they themselves had propagated, and yet, within this overall belief system, acted instrumentally in their handling of the Weibo commentaries.
Finally, although our data do not allow us to identify conclusively a single “correct” explanation, we are still able to rule out two alternatives. First, we provide evidence that such expressions of anger on Weibo were authentically grassroots. They emerged from the bottom-up in direct response to real-world events and were not part of a public relations campaign orchestrated from Beijing. Second, our empirical findings and theoretical explanations are both largely consistent with the most significant recent findings in the literature regarding online censorship and collective action that contend that entire topics are censored at high rates whenever they closely accompany real-world collective action events.Footnote 8 However, our data's overall pattern, and our argument regarding what has shaped it, especially the observed drop in censorship in mid-August, does raise questions about the emerging conventional wisdom in the field about such trends.
Between a Rock and a Hard Place: The Dual Rise of Nationalism and Social Media
One of the core questions for any study of the 2012 Diaoyu/Senkaku Islands crisis concerns the role that Chinese nationalism played in this volatile stand-off. To what extent did nationalist sentiments during this period run amok, and did they reach such a fevered pitch that they came close to pushing Beijing into armed conflict with Japan? Such a query, while provocative, constitutes nothing more than an extension of a long standing debate among students of contemporary Chinese nationalism about the degree to which such a collective identity is on the rise in China, and who has agency – the state or the people?
These inquiries were first forwarded by leading Sinologists such as Allen Whiting and Michel Oksenberg.Footnote 9 They were then taken up by a handful of pioneering scholars, including Suisheng Zhao and Peter Gries, amongst others.Footnote 10 All of this work tended to agree that new Chinese nationalist narratives drew upon collective memories of China's historical greatness during the dynastic period, the country's suffering under foreign aggressors during the “century of humiliation,” and its subsequent rise under the leadership of the Chinese Communist Party (CCP). Beyond such points of agreement, however, earlier observers were divided over the balance within such stories between the belief-driven (in terms of their resonance at a popular level and the degree to which they were internalized within China) and the instrumental (as intended to inculcate loyalty and secure state legitimacy).
Nearly a decade of subsequent scholarship has repeatedly portrayed Chinese nationalism as a double-edged sword, which China's leaders both manipulate to their advantage and are periodically beholden to (as it is somewhat beyond their control).Footnote 11 However, we find this work wanting on two fronts. First, work to date has focused on events that occurred prior to the rise of social media or, when analysing more recent developments, it has not placed social media at the centre of analysis. At most, one sees an inclusion of various facets of Chinese cyberspace in analysis of Chinese nationalism, but this has largely been limited to looking at websites and blogs and not social media. Yet, it is clear that social media have become one of the most popular forms of political internet usage in China. More pointedly, the 2012 Diaoyu/Senkaku confrontation was China's first major post-Weibo international crisis, and the crisis was one of Weibo's top trends of 2012.Footnote 12 This, then, begs analysis that gives this new medium pride of place.
Even more importantly, recent work has done little to shed new light on the overarching question of the extent to which Chinese nationalism has become a Pandora's box for the Chinese state, one that it opened long ago, and can no longer close. There are two primary reasons for such an oversight. First, when students of Chinese nationalism have talked about nationalism as a threat, or a risk, to Beijing, they have done so via the use of conceptually vague language. For example, Jessica Weiss's International Organization article, which treats managing nationalist protests as a precarious balancing act for China's leaders, notes that, “In managing nationalist protests, autocrats weigh the risk to the status quo against the cost of using force or coercion to prevent citizens from gathering in the street.”Footnote 13 In the same vein, James Reilly references danger to the state, noting, “For Chinese leaders, swelling nationalist protests pose a dangerous dilemma.”Footnote 14 In addition, Suisheng Zhao's recent re-appraisal of Chinese nationalism contends that, “Enjoying an inflated sense of empowerment supported by its new quotient of wealth and military capacities, and terrified of the uncertain future due to increasing social, economic and political tensions at home, the Chinese state has become more willing to play to the popular nationalist gallery in pursuing the core interests.”Footnote 15
We suspect that such claims may have validity; however, we are also struck not only by their vagueness but also the extent to which they lack any clear approach for measuring the extent of such apparently rising challenges. To be blunt, if the theoretical danger of nationalism to the Chinese state is left this vacuous, then it is impossible to know how much of this sentiment is percolating within China. Thus, the second limitation of the discussion of Chinese nationalism has been its lack of empirical referents.Footnote 16
In contrast to the literature on Chinese nationalism, the study of censorship in China and elsewhere is only a few years old. Prior to the recent wave of methodologically sophisticated studies,Footnote 17 work by authors such as Rebecca MacKinnonFootnote 18 and Evgeny MorozovFootnote 19 explicated the mechanics and theorized about the intent of China's and other authoritarian governments' censorship programmes. These authors tended to conclude that autocrats were interested in censoring the internet either simply because citizens used online space to criticize officials or policies, or because they used it to organize collective actions. What these authors did not consider is the possibility that, in select instances, the state might choose not to censor the internet in order to derive benefit from what citizens were saying. In other words, while recent studies have made great strides in establishing a broad motive for Beijing to suppress online promotion of collective action, we are not satisfied that they have explained all, or even most, variation in precisely when and where censorship occurs.
This article, while not free of its own shortcomings, addresses the defects in both literatures. We set up a “risk” ladder for classifying different strains of nationalist sentiments according to the degree we think Beijing would view them as threatening, and from these are able to make descriptive inferences about the first research question above. To address the second question, we estimate the day-by-day censorship rate during the dispute (the percentage of all posts deleted by censors on a given day).
Weibo as Big Data: Measuring Censorship and Nationalist Sentiment in Real Time
We addressed the questions raised above by utilizing a trailblazing dataset collected by researchers at the University of Hong Kong.Footnote 20 To our knowledge, it is the most comprehensive dataset of Weibo posts currently available. The methodology used to collect it is described in detail by Fu, Chan and Chau.Footnote 21 Each row in the dataset consisted of one social media post (including reposted or “retweeted” content), plus associated meta-data. The data files did not include multimedia such as images or videos. We relied only on the post text and counted embedded reposts as part of the text.Footnote 22
To prepare the data for analysis, we first extracted posts placed between 13 August and 30 September, as we found these contained the vast majority of Diaoyu-related commentary. Next, we filtered the posts according to whether they contained a keyword, “Diaoyu Islands” (diaoyudao 钓鱼岛), that was highly predictive of discussion of the dispute. This left about 145,000 posts over 49 days. One immediate concern with filtering using this (or any) keyword is whether estimates of sentiment categories derived from such a sample are generalizable to the broader population of all topic-relevant posts. Accordingly, we present select results below for both posts containing “Diaoyu Islands” and a sample that did not but which we found, through careful reading, also to be topically relevant. To preview our findings, adding this additional sample strengthens our overall results.
How dangerous a notion? Making sense of how “risky” Weibo posts appear to the Chinese state
Beyond simply tracking how much Weibo commentators said during this period, we were especially interested in determining the target of any such commentary. To structure our inquiry in this regard, we developed a set of sentiment categories to apply to the data. The process was iterative. First, we carefully read through a random sample of posts drawn from across the time period in order to sort the posts inductively into categories based on their distinctive sentiment strains, alongside a few “residual” categories such as reposts of news content. We then organized the non-residual categories into an ordinal “collective action risk ladder,” ranging from one to five. With this scale established, the two authors, along with a third coding assistant, worked to assign a separately drawn random sample of 479 posts into one of the eight categories. After working independently, the three coders met to reconcile scores, and we report inter-coder reliability statistics in Appendix A.
The key to this enterprise was the “risk ladder” that hand-coding allowed us to operationalize. At the lowest rung (level one), we designated as “moderate” those posts which cautioned against the potential negative consequences of direct action against Japan, or which objected to the violent turn that protests took over the course of our 49-day sample. The latter of these views is exemplified by the statement: “Protecting the Diaoyu Islands does not involve harming our fellow citizens!” (baowei diaoyudao bushi shanghai ziji de tongbao 保卫钓鱼岛不是伤害自己的同胞!). As such posts mostly opposed the virulent and even violent excesses of more extreme fellow Chinese, we felt that Beijing would view these voices as posing little risk.
The second category (level two) included posters who parroted Beijing's own official statements about the Diaoyu Islands, and in particular their status as part of China, and the country's determination not to cede such territory. While such a category could encompass a wide variety of phrases, we anticipated its most common refrain to be the patriotic assertion: “The Diaoyu Islands are China's” (diaoyudao shi Zhongguode 钓鱼岛是中国的). Such posters were at least partially mobilized. Therefore, they represented an a priori risk to Beijing compared with the moderates. However, this risk was minimal because the sentiments these individuals endorsed were so orthodox vis-à-vis the state.
The third rung (level three) encompassed posts containing strongly anti-Japanese sentiments and those that actively blamed the Chinese people (not state) for their weakness in not standing up to Japan. The “anti-Japanese” subset of this category generally spouted derogatory anti-Japanese rhetoric, such as “little Japs” (xiao Riben 小日本), “Japanese pirates” (wokou 倭寇), or “Japanese devils” (Riben guizi 日本鬼子). In contrast, the second subset contained posts lamenting China's inferiority or fecklessness vis-à-vis Japan without in any way implicating Beijing. While these two subsets are substantively very different, we estimated that they posed approximately the same degree of risk to the state. The reason that this group ranks as a higher risk than the level two group is owing to the fact that posters' sentiments about the dispute went well beyond the scope of the state's official stance on the islands.
The next ordinal category (level four) consisted of posts containing specific calls to action. These included, for example, endorsing “boycotting Japanese goods” (dizhi Rihuo 抵制日货), supporting “protests” (kangyi 抗议), or even voicing demands to “deploy military forces” (pai bing 派兵). Not surprisingly, we found this category to be quite heterogeneous. Such sentiments called for more action than Beijing had taken, and were more pointed than the invective that predominated at level three as they contained specific suggestions and/or threats regarding how best to settle China's score with Japan. As such, these posts increased the risk of collective action for the government, as we believe individuals willing to vocalize the desire to “do something” would be more likely, all other things being equal, either to join street demonstrations or endorse them online.
Finally, we anticipated that some posts (level five) would pose a high risk to China's leaders in that they would directly criticize the state (either for its handling of the dispute, or along broader lines). Posts that darkly referred to the abuses of power of China's notorious City Urban Administrative and Law Enforcement Bureau (chengguan 城管) – cemented in the popular consciousness as low-level police “thugs” – or that highlighted the impotence of China's “dear military” (guibing 贵兵) and/or the Communist Party, were all conceptualized as part of this category.
After developing the categories and completing hand-coding, we paused to view a simple summary statistic: post counts of diaoyudao for each of the 49 days. We calculated these directly from the population of 145,000 posts to determine if there were sub-sets of days that merited particular attention. Figure 1 shows these results.
Looking at the graph, we noticed four dates (16 and 18 August, and 11 and 15 September) that stood out as having high post volume and that corresponded to important real-world events. The first date, 16 August, was the day after activists landed on one of the Diaoyu Islands, and captures much of netizens' initial reaction to events. On 18 August, there was a peak of public criticism of the Chinese government's perceived inactivity in securing the activists' release from Japanese authorities. This was also the day before a group of Japanese activists landed on the islands. On 11 September, the Japanese government officially nationalized the islands, and 15 September was the beginning of the weekend that saw the largest street demonstrations of any weekend during the time period. While we could have also picked other key dates of interest, we believe that these four – two from each wave – adequately capture the main peaks in netizens' reaction to events. We therefore subjected these dates to closer scrutiny, and drew four additional random samples of 150 posts, one for each date.
So, what was being said? Keywords as indicators of sentiment
Hand-coding post samples for the entire time period and for select days allowed us to estimate the proportions of our sentiment categories for the whole period as well as for those days. However, we wished to go beyond this and to infer the proportions for other, non-sampled days, thus allowing us to generate time series consisting of daily category proportions for the entire period. We used two measurement techniques towards this end. First, we used keywords to proxy for what we found, through extensive reading, to be particularly characteristic refrains in each category, and second, we used a computer-assisted text analysis (CATA) method called ReadMe.Footnote 23
We leave discussion of ReadMe to the next section, and here introduce our keyword measures. Beginning with the moderate category, we discovered that two terms, “rational patriotism” (lixing aiguo 理性爱国) and “smash” (za 砸 – with reference to admonishments not to carry out such activities against Japanese goods owned by Chinese citizens), were the most prevalent terms. Not surprisingly, the simple declaration that “the Diaoyu Islands are China's” was the dominant refrain within our patriotic/parroting category. Owing to a lack of meaningful variation, we ended up dropping level three (anti-Japanese and self-blame sentiments) from the analysis, as it did not seem to relate closely to either current events or the other series. Next, the call to action category was dominated by various statements in support of “boycotting Japanese goods,” then underway within China during the late summer and early autumn of 2012. Finally, the highest risk category, while containing a wide array of criticisms, was most consistently voiced via protesters' satirical use of the phrase “heavenly dynasty” (tianchao 天朝), a reference to the ineptitude and decline of the Qing dynasty, to refer to the contemporary Chinese state. While on the surface the phrase is one of deep respect and admiration, within the context of our posts it was used to underscore the shortcomings of China's leaders and the domestic political status quo. We derive this reading of the term from the manner in which it was consistently used in unison with sharp criticism of government policies, and also from our review of the secondary literature on Chinese social media, which has recently highlighted the term as a particularly biting form of anti-government commentary.Footnote 24
The Censorship Rate, Sentiment Categories and Weibo's Fluctuating Risk and Benefit during the Crisis
When was the censorship rate “low” and what did this mean? Estimating the rate of Weibo post deletions
Our findings depend on two critical observations: first, that the censorship rate was low during the protest wave in August but high in September, and second, that which sentiment category in each wave was largest be counterintuitive to observed low or high censorship rates for that wave – in other words, that the prevalence of certain sentiments in each wave be unanticipated by existing theory, given that censorship is high or low. First, we estimate the censorship rate, i.e. the percentage of Weibo posts deleted out of all topic-relevant posts on a given day. To narrow the comparison, we choose to focus on 18 August and 15 September as the peaks of the two waves. Table 1 shows estimated censorship rates for these two dates, both among posts containing diaoyudao and posts without this keyword.Footnote 25
While confidence intervals for sample estimates are wider owing to our smaller sample of non-diaoyudao keyword posts, it is still evident that censorship is much lower on 18 August than 15 September. Taking this a step further, we were able to estimate the censorship rate (for posts containing diaoyudao) across the entire time period, as shown in Figure 2.Footnote 26 This graph shows that the censorship rate remained fairly constant throughout the dispute except on 18 August, when it plummeted to about 22–27 per cent. This contrast suggests that authorities' ordinary response for much of the dispute was to censor at a high rate – the fact that the rate dropped so sharply in mid-August, therefore, suggests that they made a deliberate decision to censor less at that time.
Turning to the estimates for our two key dates, we note that the results for posts lacking diaoyudao accentuate those derived from sampling on that keyword. For 15 September, the mean for non-keyword posts is higher than for posts with the keyword. Similarly, while the interval for non-keyword posts on 18 August is large, its lower bound (0 per cent), which is also its mean, lies well below that for keyword posts, raising the likely possibility that censorship on that date was even lower than the mean of about 25 per cent we observed in the keyword sample.
Moving on to sentiment: not all Weibo commenters were angry, and when they were, their target was not always Japan
Confident that we had estimated the censorship rate with sufficient accuracy and that its dip in August cut against the conventional wisdom, we next turned to estimating sample proportions for all categories, both averaged across the entire time series and for specific days, as shown in Table 2. The 49-day results were estimated directly from our hand-coding of 479 observations, and the day-specific results from the 150 observations drawn for each of the four targeted days. In Table 3, we also again present results with and without the diaoyudao keyword.
While proportions for posts containing, and lacking, the diaoyudao keyword differ somewhat, the major trend of each wave – the rise and fall of the “anti-government” and “moderate” categories – is similar for both: the proportion of anti-government posts surges in August, peaking on 18 August at above 50 per cent (and as high as 71 per cent for non-keyword posts). Similarly, the moderates category shows a sharp increase in September, with the trend more prominent in non-keyword posts.
Turning to the other categories, we first note that the patriotic category differs between the two samples on 15 September. This is owing to the fact that since the non-keyword sample excluded posts containing diaoyudao, it missed what was by far the most common instance of this category: the phrase “the Diaoyu Islands are China's.” This suggests that the patriotic category consisted mainly of people content merely to repeat the above meme and to show their support for China's claim to the islands. What these individuals were not doing was either calling for real-world actions or criticizing their own government. Similarly, the news category is lower on both dates in the non-keyword sample. This reflects the fact that many uses of the diaoyudao term were embedded in news content that Weibo users then reposted.
Our initial sampling on this keyword thus biased the proportions towards patriotic parroting (per the above phrase) and towards news content, and likely correspondingly reduced the proportions in other categories. Still, our main finding holds up well: anti-government commentary dominated in August and a moderate backlash occurred in September. These two categories, for their respective time periods, are the highest of any of the five ordinal categories in both samples.
Using more sophisticated measurement techniques to examine Weibo sentiment more closely: still less anger than one might expect, yet more anti-government sentiment than Beijing would have liked
While the above category proportion estimates give a general idea of the timing of key sentiment waves, we wish to explore in greater depth the ebb and flow of key refrains we found to be characteristic of these categories. Figure 3 shows these for four key categories.Footnote 27 The graph's most distinctive aspect is the sequencing of the four time series. The patriotic refrain, “the Diaoyu Islands are China's,” as Sina Weibo's tenth most common “hot topic” (remen huati 热门话题) of 2012, proxies for micro-bloggers' general attention to the dispute, and particularly their response to real-world events. This phrase surges immediately following the activist landing on 15 August, and actually anticipates by a few days Japan's nationalization of the islands on 11 September.Footnote 28 The other keyword series then unfold in the context of this macro trend. “Heavenly dynasty” peaks on 18 August, while two phrases representing a moderate backlash – decrying the “smash[ing]” of Japanese cars and calling for “rational” patriotic expression – surge in mid-September.
In interpreting these keyword series, we emphasize that they capture only a portion of the variation in each category – a wide range of phrases were used throughout the dispute to express anti-government and moderate sentiments. To address this issue, we used the ReadMe program mentioned above, which used the information contained in our coding team's manual classification of posts to estimate category proportions from among all posts containing the diaoyudao keyword.Footnote 29Figure 4 displays the time series that resulted from this algorithm.
While ReadMe produced results that diverged from our directly estimated proportions for key dates, the fact that, despite such a discrepancy, the results nonetheless evince the same general trends as our keyword series in Figure 3 gives us confidence that by using multiple measurement techniques we have adequately characterized the timing and rough magnitude of real-world online sentiments during the dispute. For example, we note that anti-government sentiment is highest around 18 August, while moderate views trended upward throughout the dispute and prevailed in mid-September.Footnote 30
Conclusion: Given the Volatility of Expressed Sentiments, Why Was Weibo Not Heavily Censored during the Crisis?
Our most surprising empirical results from this project were threefold. First, the censorship rate of online discussion was lower at a critical juncture in August than extant literature would predict. Second, much of the netizens' anger was directed at Beijing, not Tokyo. Finally and paradoxically, we found that more moderate voices appeared to counter these virulent sentiments directly after a broader swath of the public tuned in to online discussion of the dispute in September.
With these results in hand, we turn to jointly interpreting the facts of low/high censorship at key moments in August/September, the prevalence of certain sentiment strains at these points in time, and the real-world events – both on the Chinese streets and between the Chinese and Japanese governments – that were then unfolding. Beginning in August, the Chinese activists who landed on the islands were detained on 15 August and released two days later. Initial reports of street protests in China broke on 15 August.Footnote 31 On 17 August, Japan's Kyodo news service reported that messages had appeared on Weibo calling for anti-Japan demonstrations, and noted that the posts were not immediately deleted.Footnote 32 On 19 August, a group of Japanese activists landed on the islands. Perhaps anticipating, or at any rate quickly reacting to this development, protests unfolded that same day in several major Chinese cities and were covered by Xinhua.Footnote 33 By 21 August, however, mainstream media were attempting to calm protesters and avert further violent escalation.Footnote 34
The picture that emerges from this chain of events is that Beijing had to have been well aware, from the moment the Chinese activists landed on the islands, if not before, that nationalist-minded Chinese would seize quickly on the activists' detention and immediately begin criticizing their own leaders for showing insufficient resolve. In spite of this, leaders allowed Weibo traffic in precisely this vein to manifest alongside growing street demonstrations. Although the anti-government comments we analysed from 18 August mostly did not reference specific collective actions, and thus our findings do not directly contradict those of King, Pan and Roberts,Footnote 35 our observation is still puzzling as intuitively we would expect the Chinese state to suppress all related discussion on such a sensitive topic, especially when real-world collective actions were already underway.
This finding stands in even sharper relief when we consider that, in mid-September, the government censored comments that condemned excessive actions. We believe that the theory of collective action potential, supported by King, Pan and Roberts,Footnote 36 adequately explains this, as these moderate posts were referencing actual destructive events such as car-smashing, albeit to criticize such acts. If this theory held in September, however, then why not on 18 August? Still more puzzling is the fact that while it suppressed criticism of car-smashing on Weibo, the government continued to tolerate massive street demonstrations across mainland China through 18 September. Moreover, Chinese Foreign Ministry spokesman Hong Lei 洪磊 explicitly mentioned the anti-Japan demonstrations on as late as 19 September as justified in the defence of Chinese sovereignty, while Xi Jinping 习近平 called the islands' purchase a “farce.”Footnote 37 The People's Daily and other major Chinese media outlets also published commentaries which ranged from voicing “sympathy” for the demonstrations to outright support.Footnote 38
In light of the Chinese government's tolerance of or even support for the demonstrations in both August and September, why would it relax control over social media in the first wave but not in the second? What purpose might be served in furthering Beijing's strategic priorities by selectively opening the floodgates of online criticism in China against the government while managing street demonstrations and print media differently? As stated in the introduction, we hypothesize that a combination of three forces, two instrumental and one ideational, underlay this turn of events.
First, the speed with which street demonstrations unfolded after the activists' detention and the simultaneous presence of anti-government sentiment on Weibo are together consistent with the international circumstances. Chinese leaders needed to signal their resolve both to their own people and to foreign audiences to prevent Japan from making further advances to consolidate control over the disputed territory without resorting to military action. These circumstances prompted Beijing to attempt to turn micro-bloggers' anger at their own government to its advantage. Our finding thus supports Weiss's argument that the Chinese state selectively allows nationalist protests as a form of risky behaviour that pressures Beijing to stand firm in international disputes, lest the protesters escalate their actions and turn against their leaders.Footnote 39 While Chinese leaders in 2012 followed this logic regarding real-world demonstrations, social media platforms such as Weibo are also recognized for their ability to amplify collective action, as shown by the use of Facebook and Twitter in the Arab Spring.Footnote 40 Allowing Weibo discussion to take place alongside street demonstrations during the dispute served as Beijing's intended signal to elevate Japanese (and US) perceptions of the domestic instability risk facing China.
Second, our observations raise the possibility that Beijing felt it had to let nationalist-minded netizens “blow off steam” (faxie 发泄). The 2012 Diaoyu dispute was a high-profile event which many netizens quickly learned about. Research suggests that visible censorship – censorship that online citizens know is occurring – tends to backfire by inducing greater information-seeking behaviour.Footnote 41 In other words, had Beijing resorted to crude information suppression immediately after the activists' detention in an attempt to quell the narrative that it was to blame for being spineless, this action might paradoxically have propelled angry netizens to seek out and push this narrative.
Third, just as the surge in anger at Japan and netizens' simultaneous criticism of Beijing pushed leaders to allow these sentiments to be vented domestically, it is also likely that leaders personally shared the anger expressed by citizens online and in the streets. It would be surprising to think that China's ruling elites were immune to the anti-Japanese nationalist rhetoric which they had regularly produced over the years prior to the 2012 crisis. As such, they were predisposed towards looking favourably upon an online platform that provided them with a mechanism through which anger directed towards Japan could be expressed without the greater destructiveness and risks to social stability that accompany street actions. This is not to imply that the Chinese leaders blindly congregated around such a refrain; rather they lived within a social milieu (of their own making) in which negative perceptions of Japan had become naturalized. Thus, they were inclined to turn a blind eye towards Weibo posters at such a critical juncture in Sino-Japanese relations at a time when Tokyo was widely considered within China to have trampled upon the country's sovereign rights and disrespected its rising international status. This is not to say that leaders in Beijing were governed by their emotions, but rather that their instrumental calculations emerged from a cauldron of xenophobia.
Because the dispute was still in its early stages in August, Beijing may have been more willing to tolerate some genuine risk in order to signal its resolve. By September, however, demonstrations had escalated and become more destructive. While Beijing may not have been able (or willing) to rein in street demonstrations immediately, it was much more capable of tightening online censorship in order to stem the spread of information about such destructive acts. Furthermore, as time passed and more moderate voices entered the online sphere to oppose the virulent excesses of the protesters, the government no longer faced the same pressure to permit an outlet through which extreme voices could vent their anger – they had already done so and would eventually need to be reined in on the streets as well as online. While street protests raged on for another few days and Chinese leaders continued to make tough statements, censoring Weibo represented a first step towards bringing the protest wave to an end.
Appendix A: Procedure, Inter-coder Reliability and Keyword Validity
Three coders each independently read and scored 479 Weibo posts.Footnote 42 Each post contained any original text, plus any reposted or “retweeted” content. To simplify analysis, we counted both the original text and the reposted text (if any) as part of the same message unit, i.e. we read the posts with an eye to gauging the sentiment of this overall combination, rather than considering original and reposted sentiments separately. In Appendix Table 1, we report common reliability statistics.
Additionally, we calculated a unique statistic to take account of how often we had to resort to flipping a coin to break an impasse over two codings.Footnote 43 Given the difficulty of the coding exercise, about 40 per cent of the time we resorted to brief discussion in order to reconcile different codings. These discussions usually lasted only a minute or two, and frequently one or more coders was eager to change his or her mind, having felt that he/she had incorrectly assigned a post owing to error or fatigue. Instances where coders disagreed with each other to the point where arriving at a consensus score was impossible were infrequent, and occurred only 6 per cent of the time.
Keyword validity
After completing coding, we wished to evaluate the correspondence between our human-derived sentiment categories and keyword proxies. One measure of this is what percentage of posts containing a given keyword ended up belonging to the “appropriate” category for that keyword.Footnote 44 This measure is in Appendix Table 2. The results show that the incidence of tianchao perfectly predicts a post belonging to the anti-government category, and the keywords za and lixing proxy moderately well for the moderate category.
Appendix B: Modelling the Censorship Data-generating Process
One of the major difficulties in using Chinese social media data is how to deal with the bias induced by state censorship, since researchers attempting to “harvest” such data are only able to observe the blog posts they are able to download faster than censors can delete these posts. However, as long as researchers are able to capture a fraction of all censored posts, it may be possible to estimate the true censorship rate. First, assume that out of our sample of around 43,000 individuals, a fraction decide to write a post in response to some event.Footnote 45 Also assume that individuals who choose to write a post do so immediately following the event.Footnote 46 What we want to know is how many of these posts will survive (not be censored) long enough to appear in the WeiboScope data. This information is necessary to calculate our primary quantity of interest – the true censorship rate:
Where R true is the true rate, expressed as the proportion of censored over total posts, P is posts that are never censored (all of which appear in the dataset), C obs is the number of posts marked as “censored” in the dataset, and C hid is those posts that get censored but do not appear in the dataset because they are deleted sooner than the Hong Kong team can download the Weibo user timelines that contain them. The WeiboScope data scraping process, as described by Fu, Chan and Chau, involved periodically returning to the pages of the 43,000 users, downloading a copy of the timeline each time.Footnote 47 If a post was deleted between crawls (i.e. after the team's program had crawled a page during a particular iteration, but before the next one), then the researchers could compare the new record to the old one, identify the post that had disappeared in the interim, and mark it as censored. However, owing to the limits set by Sina.com, the team could only crawl most of these pages (38,000 out of the 43,000, which constituted the “verified” user group) once every 24 hours. Given a uniform distribution of sensitive post-inducing events (i.e. that they were equally likely to occur over a given 24-hour period), the average time between when a post would go up and when that verified user's page would be crawled would be 12 hours. Since Zhu et al. find that most censorship occurs within an hour or so of the post time,Footnote 48 most censored posts from the verified users were unlikely to make it into the dataset. Thus, the dataset is truncated, and R true will be biased. What we have is the observed rate, R obs , in Equation 2:
Since C hid is missing, R obs <R true, i.e. the observed censorship rate is biased downward. But how much so? The observed rate within this project for posts with the diaoyudao keyword is around 12 per cent, an oddly low figure for an episode that saw the largest street demonstrations in mainland China since Tiananmen. To calculate the true rate, we need to know the true number of posts censored, N true, which is related to N obs, the number of censored posts that we actually observe, via some probability distribution that models the speed with which censors remove posts during episodes of collective action.Footnote 49
Since we do not know the true distribution, we need to look for an empirical example that provides a good approximation. The best available so far is the finding by Zhu et al. that “nearly 90% of deletion events happen within the first 24 hours.”Footnote 50 Conveniently, this time window is the same as that of the unbiased portion of our data: 100 per cent of posts will be observed, and correctly identified as censored or not, if they survive 24 hours or more. Looking at the WeiboScope data for the keyword diaoyudao during our study period, we find that nearly all (16,967 out of 17,640) posts marked censored, or 96.2 per cent, are marked as having been deleted more than 24 hours after their posting time. For simplicity's sake, we exclude from our calculations the other 3.8 per cent of posts in the data.Footnote 51 Instead, we pragmatically treat the 16,967 posts as if they constituted WeiboScope's entire pool of censored posts. Since Zhu et al. found that 90 per cent of censorship occurs before 24 hours, 10 per cent must occur after, sometimes days or weeks later. Since we observe this 10 per cent, and critically, assuming that the form of the censorship distribution over time is the same in our data as in that of Zhu et al., the ratio of what we observe to what gets missed must be 1:9, e.g. C hid = 9C obs. This suggests that multiplying C obs by a factor of 10 will get us close to the true rate. Plugging this into Equation 1 gives:
Applying this equation to the present study gives Appendix Table 3, which shows the observed censorship rate, the number of observed posts (including non-censored posts), and the estimated true rate. We calculate these numbers for the diaoyudao keyword sample, and the non-keyword sample (for key dates), and then estimate the joint rate among all topic-relevant posts.
Given that we are applying another Weibo study's findings to a different dataset, the question might arise, given that our data consist of journalists, dissidents, and verified users with more than 10,000 followers – all sensitive groups in censors' eyes – whether 90 per cent within 24 hours is too slow a rate for our sample. Zhu et al. and King, Pan and Roberts both find that a small number of ultimately censored posts typically linger for days after an incident – the question here is how many? Our main empirical concern in this paper is underestimating, not over-estimating, the censorship rate. If we assume that the true number is 95 per cent within 24 hours,Footnote 52 i.e. C hid = 19C obs, then plugging these numbers into Equation 1 yields the information in Appendix Table 4. The rates given in Appendix Table 4 are higher than the previous estimates. However, the estimated mean censorship rate (with the diaoyudao keyword) for 18 August still falls short of 40 per cent, a rate far less than that for September and lower than that for other collective events.