BC3 Corpus

From Cohen Courses
Revision as of 01:49, 2 November 2011 by Manajs (talk | contribs)
Jump to navigationJump to search

The BC3 corpus consists of 40 email threads / 3222 sentences from the W3C email corpus. Each thread has been annotated by three different annotators. The annotation consists of the following:
Extractive Summaries
Abstractive Summaries with linked sentences
Labeled Sentences with the following labels
Speech Acts: Propose, Request, Commit, Meeting
Meta Sentences
Subjectivity

(Text taken from: http://www.cs.ubc.ca/nest/lci/bc3.html; more information can be obtained from this site)