Author-Paper dataset
From Cohen Courses
Jump to navigationJump to searchThis is one of the datasets discussed in Social Media Analysis 10-802 in Spring 2010.
- #authors = 315688
- #papers = 471514
- Every row represents an author, every column represents a paper.
- Elements of bipartite graph are either 0 or 1.
- M(i,j) = 1 : represents ith author wrote jth paper.
- On average, every author has 3 papers and every paper has 2 authors.
- The disctribution is very skewed as most of the authors have only 1 paper.