Web KB dataset

The web KB dataset contains webpages from computer science departments. There are two main versions of this data set :

  • 8,282 webpages from CS departments, classified as student, faculty, staff, department, course, project, and other, but no link information between webpages.
  • 1,031 webpages classified as course or non-course, with link information between the webpages.

External Link

Relevant Papers