UCLA Blogocenter

From Cohen Courses
Jump to navigationJump to search

UCLA Blogocenter dataset was built by the The Blogocenter group at UCLA. The dataset contains RSS feeds from the Bloglines, Blogspot, Microsoft Live Spaces, and syndic8 aggregators covering the past several years. The dataset contains over 192 million blog posts. More information about the dataset can be found at Sia et all, KDD 2008.