Difference between revisions of "Topic Segmentation"
From Cohen Courses
Jump to navigationJump to search (Created page with 'Topic segmentation is the process of dividing written text into meaningful topics. In corpora such as transcripts of streaming audio, this task is non trivial as the corpora woul…') |
|||
Line 2: | Line 2: | ||
[http://en.wikipedia.org/wiki/Text_segmentation External Link] | [http://en.wikipedia.org/wiki/Text_segmentation External Link] | ||
+ | |||
+ | {{#ask: [[UsesMethod::topic segmentation]] | ||
+ | | ?AddressesProblem | ||
+ | | ?UsesDataset | ||
+ | }} |
Revision as of 01:14, 27 March 2011
Topic segmentation is the process of dividing written text into meaningful topics. In corpora such as transcripts of streaming audio, this task is non trivial as the corpora would not have explicit representation of a document or even a clear demarcations of where document breaks occur. Furthermore, a document may contain multiple topics, and the task of computerized text segmentation may be to discover these topics automatically and segment the text accordingly.