Difference between revisions of "Bamman et. al., FIRST MONDAY 2012"

From Cohen Courses
Jump to navigationJump to search
Line 22: Line 22:
 
</math>
 
</math>
 
, where <math>d_w</math> is the number of times a message with the term <math>w</math> was deleted and <math>n_w</math> is the number of messages with <math>w</math>.
 
, where <math>d_w</math> is the number of times a message with the term <math>w</math> was deleted and <math>n_w</math> is the number of messages with <math>w</math>.
 +
 +
Furthermore, a statistical test is performed (using the one–tailed binomial p-value) to find the terms whose deletion rates are abnormally high. These terms are then analysed manually.

Revision as of 22:44, 29 September 2012

Citation

David Bamman, Brendan O'Connor and Noah A. Smith. 2012. Censorship and deletion practices in Chinese social media. In First Monday.

Online version

Censorship and Content Deletion in Chinese Social Media

Summary

This Paper attempts to characterize the practices of censorship and message deletion in Sina Weibo (Chinese counterpart of Twitter).

Message Deletion

To build a corpora of messages and their annotations (whether the message was deleted), the Weibo messages were queried over a period of three months. Later, it was checked if the message still existed in the present time. If not, it means that the message was deleted.

To analyse topics that are likely to be deleted, the authors calculate the term deletion rate for each term , defined as follows.

, where is the number of times a message with the term was deleted and is the number of messages with .

Furthermore, a statistical test is performed (using the one–tailed binomial p-value) to find the terms whose deletion rates are abnormally high. These terms are then analysed manually.