Technorati Dataset

From Cohen Courses
Revision as of 23:37, 6 February 2011 by Nitina (talk | contribs) (Created page with 'The Technorati [[Category::Dataset|dataset]] contains blog data which was scraped of the web by the company. * A slice of data from a sixteen day period in 2006 * Contains 8.1 m…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

The Technorati dataset contains blog data which was scraped of the web by the company.

  • A slice of data from a sixteen day period in 2006
  • Contains 8.1 million blog posts
  • 1.9 million posts are tagged with 1.75 tags per post on an average


Relevant Papers