MemeTracker

From Cohen Courses
Revision as of 02:36, 4 October 2012 by Tinghuiz (talk | contribs) (Created page with 'This is a dataset consisting of 343 million short textual phrases collected from online blogs with timestamps. A cascade is considered as a phrase cluster over the aggregated dif…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

This is a dataset consisting of 343 million short textual phrases collected from online blogs with timestamps. A cascade is considered as a phrase cluster over the aggregated different textual variants of the same phrase, and it is simply a set of time-stamps when a phrase is mentioned in the blogs.