Difference between revisions of "Cora network"

From Cohen Courses
Jump to navigationJump to search
 
m (1 revision)
 
(No difference)

Latest revision as of 11:42, 3 September 2010

The cora dataset is maintained by Andrew McCallum, and there are multiple versions, for different research problems like information extraction, correference resolution, and classification using network information. The network data set is described here.

The cora network consists of around 37000 papers and 715000 citations between them. Each paper also has a research-area classification label associated with it. Different subsets of this data have been used for different papers.

External Link