Twitter Dataset for Sarcasm
From Cohen Courses
This is one of the Dataset.
Twitter is a microblogging service, it allows users to publish and read short messages called tweets. The length of tweet is less than 140 characters. This dataset contains a large amount of unique tweets for sarcasm detection.
- # Tweets = 5.9 million
- Average number of words in tweets = 14.2
18.7% tweets contains a url 35.3% tweets contains reference to another tweeter 6.9% tweets contains at least one hashtag (#sacasm hashtag is one of the hashtags)