Twitter Dataset for Sarcasm

From Cohen Courses
Jump to navigationJump to search

This is one of the Dataset.

Twitter is a microblogging service, it allows users to publish and read short messages called tweets. The length of tweet is less than 140 characters. This dataset contains a large amount of unique tweets for sarcasm detection.

  • # Tweets = 5.9 million
  • Average number of words in tweets = 14.2

18.7% tweets contains a url 35.3% tweets contains reference to another tweeter 6.9% tweets contains at least one hashtag (#sacasm hashtag is one of the hashtags)


Relevant Papers