Difference between revisions of "Twitter Dataset for Sarcasm"

From Cohen Courses
Jump to navigationJump to search
(Created page with 'This is one of the [[Category::Dataset]]. Twitter is a microblogging service, it allows users to publish and read short messages called tweets. The length of tweet is less than …')
 
 
Line 12: Line 12:
 
== Relevant Papers ==
 
== Relevant Papers ==
  
{{#ask: [[UsesDataset::Twiter Dataset of Sarcasm]]
+
{{#ask: [[UsesDataset::Twitter Dataset for Sarcasm]]
 
| ?AddressesProblem
 
| ?AddressesProblem
 
| ?UsesMethod
 
| ?UsesMethod
 
}}
 
}}

Latest revision as of 10:44, 30 September 2012

This is one of the Dataset.

Twitter is a microblogging service, it allows users to publish and read short messages called tweets. The length of tweet is less than 140 characters. This dataset contains a large amount of unique tweets for sarcasm detection.

  • # Tweets = 5.9 million
  • Average number of words in tweets = 14.2

18.7% tweets contains a url 35.3% tweets contains reference to another tweeter 6.9% tweets contains at least one hashtag (#sacasm hashtag is one of the hashtags)


Relevant Papers