Twitter Dataset for Sarcasm

From Cohen Courses
Revision as of 10:43, 30 September 2012 by Zeyuz (talk | contribs) (Created page with 'This is one of the [[Category::Dataset]]. Twitter is a microblogging service, it allows users to publish and read short messages called tweets. The length of tweet is less than …')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

This is one of the Dataset.

Twitter is a microblogging service, it allows users to publish and read short messages called tweets. The length of tweet is less than 140 characters. This dataset contains a large amount of unique tweets for sarcasm detection.

  • # Tweets = 5.9 million
  • Average number of words in tweets = 14.2

18.7% tweets contains a url 35.3% tweets contains reference to another tweeter 6.9% tweets contains at least one hashtag (#sacasm hashtag is one of the hashtags)


Relevant Papers