Difference between revisions of "Metaphor Detection in Different Topics"
From Cohen Courses
Jump to navigationJump to search (→Data) |
(→Data) |
||
Line 18: | Line 18: | ||
== Data == | == Data == | ||
− | * [http://u.cs.biu.ac.il/~koppel/BlogCorpus.htm The Blog Authorship Corpus] | + | * [http://u.cs.biu.ac.il/~koppel/BlogCorpus.htm The Blog Authorship Corpus] <br> The Blog Authorship Corpus consists of the collected posts of 19,320 bloggers gathered from blogger.com in August 2004. The corpus incorporates a total of 681,288 posts and over 140 million words - or approximately 35 posts and 7250 words per person. |
− | |||
− | The Blog Authorship Corpus consists of the collected posts of 19,320 bloggers gathered from blogger.com in August 2004. The corpus incorporates a total of 681,288 posts and over 140 million words - or approximately 35 posts and 7250 words per person. | ||
* [http://www.ark.cs.cmu.edu/blog-data/ http://www.ark.cs.cmu.edu/blog-data/] | * [http://www.ark.cs.cmu.edu/blog-data/ http://www.ark.cs.cmu.edu/blog-data/] |
Revision as of 22:42, 8 October 2012
Contents
Team Members
Project Title
Metaphor Detection in Different Topics
Project Abstract
Task
Data
- The Blog Authorship Corpus
The Blog Authorship Corpus consists of the collected posts of 19,320 bloggers gathered from blogger.com in August 2004. The corpus incorporates a total of 681,288 posts and over 140 million words - or approximately 35 posts and 7250 words per person.