W3C Email Corpus

From Cohen Courses
Revision as of 01:43, 2 November 2011 by Manajs (talk | contribs)
Jump to navigationJump to search

The W3C mailing list corpus was crawled by Microsoft Research in 2004.
More information can be read from http://research.microsoft.com/en-us/um/people/nickcr/w3c-summary.html.
A set of parsed emails can be downloaded from http://tides.umiacs.umd.edu/webtrec/trecent/parsed_w3c_corpus.html