Difference between revisions of "BC3 Corpus"

From Cohen Courses
Jump to navigationJump to search
 
Line 1: Line 1:
The BC3 corpus consists of 40 email threads / 3222 sentences from the [[W3C email corpus]]. Each thread has been annotated by three different annotators. The annotation consists of the following:<br>
+
The BC3 corpus consists of 40 email threads / 3222 sentences from the [[W3C_Email_Corpus]]. Each thread has been annotated by three different annotators. The annotation consists of the following:<br>
 
Extractive Summaries<br>
 
Extractive Summaries<br>
 
Abstractive Summaries with linked sentences<br>
 
Abstractive Summaries with linked sentences<br>

Latest revision as of 01:49, 2 November 2011

The BC3 corpus consists of 40 email threads / 3222 sentences from the W3C_Email_Corpus. Each thread has been annotated by three different annotators. The annotation consists of the following:
Extractive Summaries
Abstractive Summaries with linked sentences
Labeled Sentences with the following labels
Speech Acts: Propose, Request, Commit, Meeting
Meta Sentences
Subjectivity

(Text taken from: http://www.cs.ubc.ca/nest/lci/bc3.html; more information can be obtained from this site)