Difference between revisions of "Author-Paper dataset"
From Cohen Courses
Jump to navigationJump to search(One intermediate revision by one other user not shown) | |||
Line 7: | Line 7: | ||
* M(i,j) = 1 : represents ith author wrote jth paper. | * M(i,j) = 1 : represents ith author wrote jth paper. | ||
* On average, every author has 3 papers and every paper has 2 authors. | * On average, every author has 3 papers and every paper has 2 authors. | ||
− | * The | + | * The distribution is very skewed as most of the authors have only 1 paper. |
== Relevant Papers == | == Relevant Papers == |
Latest revision as of 02:43, 4 February 2011
This is one of the datasets discussed in Social Media Analysis 10-802 in Spring 2010.
- #authors = 315688
- #papers = 471514
- Every row represents an author, every column represents a paper.
- Elements of bipartite graph are either 0 or 1.
- M(i,j) = 1 : represents ith author wrote jth paper.
- On average, every author has 3 papers and every paper has 2 authors.
- The distribution is very skewed as most of the authors have only 1 paper.