Edinburgh Corpus

From Cohen Courses
Revision as of 21:17, 26 September 2012 by Nloghman (talk | contribs) (Created page with 'Twitter data set found [http://homepages.inf.ed.ac.uk/miles/papers/socmed10.pdf here] and it was created by a group at the University of Edinburgh to provide valuable data to res…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Twitter data set found here and it was created by a group at the University of Edinburgh to provide valuable data to researchers working in social media, natural language processing, large-scale data processing, and similar areas.

Quick Stats:

  • 97 million tweets
  • 9 million users
  • 2 billion words
  • time period between November 11, 2009-February 1, 2010