Difference between revisions of "Bamman et. al., FIRST MONDAY 2012"

From Cohen Courses
Jump to navigationJump to search
Line 8: Line 8:
  
 
== Summary ==
 
== Summary ==
This [[Category::Paper]] attempts to characterize the practices of censorship and message deletion in Sina Weibo (Chinese counterpart of Twitter).
+
This [[Category::Paper]] attempts to characterize the practices of censorship and message deletion in Sina Weibo (Chinese counterpart of Twitter). The paper identifies three different approaches to analyse this issue.
  
 
== Term Deletion Rate ==
 
== Term Deletion Rate ==
Line 26: Line 26:
  
 
From these terms, the authors conclude the following. Messages containing politically sensitive items are likely to be deleted. Another type of terms are terms such as "asked to resign", which have are sentitive due to real-world events. Finally, terms that occured in false rumors also have a high deletion rate.
 
From these terms, the authors conclude the following. Messages containing politically sensitive items are likely to be deleted. Another type of terms are terms such as "asked to resign", which have are sentitive due to real-world events. Finally, terms that occured in false rumors also have a high deletion rate.
 +
 +
== Comparing Twitter with Weibo ==

Revision as of 22:53, 29 September 2012

Citation

David Bamman, Brendan O'Connor and Noah A. Smith. 2012. Censorship and deletion practices in Chinese social media. In First Monday.

Online version

Censorship and Content Deletion in Chinese Social Media

Summary

This Paper attempts to characterize the practices of censorship and message deletion in Sina Weibo (Chinese counterpart of Twitter). The paper identifies three different approaches to analyse this issue.

Term Deletion Rate

To build a corpora of messages and their annotations (whether the message was deleted), the Weibo messages were queried over a period of three months. Later, it was checked if the message still existed in the present time. If not, it means that the message was deleted.

To analyse topics that are likely to be deleted, the authors calculate the term deletion rate for each term , defined as follows.

, where is the number of times a message with the term was deleted and is the number of messages with .

Furthermore, a statistical test is performed (using the one–tailed binomial p-value) to find the terms whose deletion rates are abnormally high. These terms are then analysed manually.

From these terms, the authors conclude the following. Messages containing politically sensitive items are likely to be deleted. Another type of terms are terms such as "asked to resign", which have are sentitive due to real-world events. Finally, terms that occured in false rumors also have a high deletion rate.

Comparing Twitter with Weibo