Edinburgh Corpus

From Cohen Courses
Jump to navigationJump to search

Twitter data set found here and it was created by a group at the University of Edinburgh to provide valuable data to researchers working in social media, natural language processing, large-scale data processing, and similar areas.

Quick Stats:

  • 97 million tweets
  • 9 million users
  • 2 billion words
  • time period between November 11, 2009-February 1, 2010