Difference between revisions of "Twitter Dataset for Sarcasm"
From Cohen Courses
Jump to navigationJump to search (Created page with 'This is one of the [[Category::Dataset]]. Twitter is a microblogging service, it allows users to publish and read short messages called tweets. The length of tweet is less than …') |
|||
Line 12: | Line 12: | ||
== Relevant Papers == | == Relevant Papers == | ||
− | {{#ask: [[UsesDataset:: | + | {{#ask: [[UsesDataset::Twitter Dataset for Sarcasm]] |
| ?AddressesProblem | | ?AddressesProblem | ||
| ?UsesMethod | | ?UsesMethod | ||
}} | }} |
Latest revision as of 10:44, 30 September 2012
This is one of the Dataset.
Twitter is a microblogging service, it allows users to publish and read short messages called tweets. The length of tweet is less than 140 characters. This dataset contains a large amount of unique tweets for sarcasm detection.
- # Tweets = 5.9 million
- Average number of words in tweets = 14.2
18.7% tweets contains a url 35.3% tweets contains reference to another tweeter 6.9% tweets contains at least one hashtag (#sacasm hashtag is one of the hashtags)