Difference between revisions of "Topic Segmentation"
From Cohen Courses
Jump to navigationJump to search(3 intermediate revisions by the same user not shown) | |||
Line 3: | Line 3: | ||
[http://en.wikipedia.org/wiki/Text_segmentation External Link] | [http://en.wikipedia.org/wiki/Text_segmentation External Link] | ||
− | {{#ask: [[ | + | == Relevant Papers == |
− | | ? | + | |
+ | {{#ask: [[AddressesProblem::Topic Segmentation]] | ||
+ | | ?UsesMethod | ||
| ?UsesDataset | | ?UsesDataset | ||
}} | }} |
Latest revision as of 01:20, 27 March 2011
Topic segmentation is the process of dividing written text into meaningful topics. In corpora such as transcripts of streaming audio, this task is non trivial as the corpora would not have explicit representation of a document or even a clear demarcations of where document breaks occur. Furthermore, a document may contain multiple topics, and the task of computerized text segmentation may be to discover these topics automatically and segment the text accordingly.