ArXiv Comprehensive
The data for affiliation graphs covers the period from April 1992 to March 2002 collected using the arXiv data. It covers the five largest categories in the arXiv (ASTRO–PH, HEP–TH, HEP–PH, COND–MAT and GR–QC). The smallest of the graphs (category GR–QC) had 19,309 nodes (5,855 authors, 13,454 papers) and 26,169 edges. ASTRO–PH is the largest graph, with 57,381 nodes (19,393 authors, 37,988 papers) and 133,170 edges. It has 6.87 authors per paper; most of the other categories also have similarly high numbers of authors per paper.
Used as a part of the following paper:
Citation
Leskovec, J. and Kleinberg, J. and Faloutsos, C. 2005. Graphs over time: densification laws, shrinking diameters and possible explanations. Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining (KDD’05), 177--187.