Difference between revisions of "Author-Paper dataset"

From Cohen Courses
Jump to navigationJump to search
 
 
(One intermediate revision by one other user not shown)
Line 7: Line 7:
 
* M(i,j) = 1 : represents ith author wrote jth paper.
 
* M(i,j) = 1 : represents ith author wrote jth paper.
 
* On average, every author has 3 papers and every paper has 2 authors.
 
* On average, every author has 3 papers and every paper has 2 authors.
* The disctribution is very skewed as most of the authors have only 1 paper.
+
* The distribution is very skewed as most of the authors have only 1 paper.
  
 
== Relevant Papers ==
 
== Relevant Papers ==

Latest revision as of 02:43, 4 February 2011

This is one of the datasets discussed in Social Media Analysis 10-802 in Spring 2010.

  • #authors = 315688
  • #papers = 471514
  • Every row represents an author, every column represents a paper.
  • Elements of bipartite graph are either 0 or 1.
  • M(i,j) = 1 : represents ith author wrote jth paper.
  • On average, every author has 3 papers and every paper has 2 authors.
  • The distribution is very skewed as most of the authors have only 1 paper.

Relevant Papers