BC3 Corpus

From Cohen Courses
Revision as of 02:49, 2 November 2011 by Manajs (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

The BC3 corpus consists of 40 email threads / 3222 sentences from the W3C_Email_Corpus. Each thread has been annotated by three different annotators. The annotation consists of the following:
Extractive Summaries
Abstractive Summaries with linked sentences
Labeled Sentences with the following labels
Speech Acts: Propose, Request, Commit, Meeting
Meta Sentences
Subjectivity

(Text taken from: http://www.cs.ubc.ca/nest/lci/bc3.html; more information can be obtained from this site)