Gigaword corpus

From Cohen Courses
Revision as of 00:08, 29 November 2011 by Dwijaya (talk | contribs) (Created page with 'The Gigaword [[Category::Dataset|corpus]] is a corpus of newswire text from 1994-2004 in which each text is tagged with the document creation time. The dataset is available at…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

The Gigaword corpus is a corpus of newswire text from 1994-2004 in which each text is tagged with the document creation time.

The dataset is available at the [1].

Relevant Papers