Web KB dataset
From Cohen Courses
Jump to navigationJump to searchThe web KB dataset contains webpages from computer science departments. There are two main versions of this data set :
- 8,282 webpages from CS departments, classified as student, faculty, staff, department, course, project, and other, but no link information between webpages.
- 1,031 webpages classified as course or non-course, with link information between the webpages.