Author-Paper dataset

From Cohen Courses
Revision as of 02:43, 4 February 2011 by Mkas (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

This is one of the datasets discussed in Social Media Analysis 10-802 in Spring 2010.

  • #authors = 315688
  • #papers = 471514
  • Every row represents an author, every column represents a paper.
  • Elements of bipartite graph are either 0 or 1.
  • M(i,j) = 1 : represents ith author wrote jth paper.
  • On average, every author has 3 papers and every paper has 2 authors.
  • The distribution is very skewed as most of the authors have only 1 paper.

Relevant Papers