Difference between revisions of "Text summarization"

Latest revision as of 14:45, 30 November 2010

Summary

Text Summarization (also known as summarization, and automatic summarization) is a natural language processing problem which focuses on creating shortened versions of texts with computer algorithms/software that retain the important points of the original piece of text.

Common Approaches

Common approaches to text summarization can typically be classified into one of the following categories:

Extraction, extracts most important information (sentences or paragraphs) from original text and copies them to make summary
Abstraction, paraphrases sections in the original text and relies on language generation to make the summaries coherent

Evaluation

One commonly used evaluation metric in summarization is ROUGE, which is used in NIST's Document Understanding Conferences' summarization tasks. It is considered as an Automatic Evaluation Method.

Example Systems

References / Links

A bit outdated website with some references related to text summarization - [1]
Wikipedia article on automatic summarization - [2]

@@ Line 1: / Line 1: @@
 == Summary ==
-Text Summarization (also known as summarization, and automatic summarization) is a natural language processing task which focuses on creating shortened versions of texts with computer algorithms/software that retain the important points of the original piece of text.
+Text Summarization (also known as summarization, and automatic summarization) is a natural language processing [[category::problem]] which focuses on creating shortened versions of texts with computer algorithms/software that retain the important points of the original piece of text.
 == Common Approaches ==
-Common approaches to text summarization can typically be broken down into one of the following categories:
+Common approaches to text summarization can typically be classified into one of the following categories:
 * '''Extraction''', extracts most important information (sentences or paragraphs) from original text and copies them to make summary
 * '''Abstraction''', paraphrases sections in the original text and relies on language generation to make the summaries coherent
-== Challenges / Issues ==
-Some major challenges in text summarization
 == Evaluation ==
-One commonly used  evaluation metric in summarization is ROUGE, which is used in NIST's Document Understanding Conferences summarization tasks.
+One commonly used  evaluation metric in summarization is [[ROUGE]], which is used in NIST's Document Understanding Conferences' summarization tasks. It is considered as an [[Automatic Evaluation Method]].
 == Example Systems ==
@@ Line 26: / Line 22: @@
 * A bit outdated website with some references related to text summarization - [http://www.summarization.com/]
 * Wikipedia article on automatic summarization - [http://en.wikipedia.org/wiki/Automatic_summarization]
+== Relevant Papers ==
+{{#ask: [[AddressesProblem::Text summarization]]
+| ?UsesMethod
+| ?UsesDataset
+}}

Difference between revisions of "Text summarization"

Latest revision as of 14:45, 30 November 2010

Contents

Summary

Common Approaches

Evaluation

Example Systems

References / Links

Relevant Papers

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools