Difference between revisions of "Topic Detection and Tracking"
m (Created page with 'This [[Category::Dataset|dataset]] is used for the Topic Detection and Tracking task hosted by NIST [http://www.itl.nist.gov/iad/mig/tests/tdt/]. This dataset contain == Releva…') |
m |
||
(7 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
This [[Category::Dataset|dataset]] is used for the Topic Detection and Tracking task hosted by NIST [http://www.itl.nist.gov/iad/mig/tests/tdt/]. | This [[Category::Dataset|dataset]] is used for the Topic Detection and Tracking task hosted by NIST [http://www.itl.nist.gov/iad/mig/tests/tdt/]. | ||
− | This dataset | + | Annotation guidelines are available [http://projects.ldc.upenn.edu/TDT5/Annotation/TDT2004V1.2.pdf here]. |
+ | |||
+ | This dataset contains 407,505 news articles in Arabic, Mandarin and English. The news articles are annotated for topics, events and activities. | ||
+ | |||
+ | From the annotation guidelines: | ||
+ | <blockquote>''A TDT event is defined as a particular thing that happens at a specific time and place, along with all necessary preconditions and unavoidable consequences. A TDT event might be a particular plane crash, or a single meeting, or a particular court hearing. An activity is a connected set of events that have a common focus or purpose, happening at a specific place and time; for instance, a campaign, or an investigation, or a disaster relief effort. For the purposes of TDT, a topic is defined as an event or activity, along with all directly related events and activities.''</blockquote> | ||
== Relevant Papers == | == Relevant Papers == |
Latest revision as of 19:20, 26 September 2012
This dataset is used for the Topic Detection and Tracking task hosted by NIST [1].
Annotation guidelines are available here.
This dataset contains 407,505 news articles in Arabic, Mandarin and English. The news articles are annotated for topics, events and activities.
From the annotation guidelines:
A TDT event is defined as a particular thing that happens at a specific time and place, along with all necessary preconditions and unavoidable consequences. A TDT event might be a particular plane crash, or a single meeting, or a particular court hearing. An activity is a connected set of events that have a common focus or purpose, happening at a specific place and time; for instance, a campaign, or an investigation, or a disaster relief effort. For the purposes of TDT, a topic is defined as an event or activity, along with all directly related events and activities.