A survey of text summarization techniques

456Citations
Citations of this article
364Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Numerous approaches for identifying important content for automatic text summarization have been developed to date. Topic representation approaches first derive an intermediate representation of the text that captures the topics discussed in the input. Based on these representations of topics, sentences in the input document are scored for importance. In contrast, in indicator representation approaches, the text is represented by a diverse set of possible indicators of importance which do not aim at discovering topicality. These indicators are combined, very often using machine learning techniques, to score the importance of each sentence. Finally, a summary is produced by selecting sentences in a greedy approach, choosing the sentences that will go in the summary one by one, or globally optimizing the selection, choosing the best set of sentences to form a summary. In this chapter we give a broad overview of existing approaches based on these distinctions, with particular attention on how representation, sentence scoring or summary selection strategies alter the overall performance of the summarizer. We also point out some of the peculiarities of the task of summarization which have posed challenges to machine learning approaches for the problem, and some of the suggested solutions.

References Powered by Scopus

Indexing by latent semantic analysis

9540Citations
N/AReaders
Get full text

Term-weighting approaches in automatic text retrieval

6819Citations
N/AReaders
Get full text

A statistical interpretation of term specificity and its application in retrieval

2982Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Automatic text summarization: A comprehensive survey

587Citations
N/AReaders
Get full text

Assessing sentence scoring techniques for extractive text summarization

281Citations
N/AReaders
Get full text

The emotional arcs of stories are dominated by six basic shapes

279Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Nenkova, A., & McKeown, K. (2012). A survey of text summarization techniques. In Mining Text Data (Vol. 9781461432234, pp. 43–76). Springer US. https://doi.org/10.1007/978-1-4614-3223-4_3

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 164

73%

Professor / Associate Prof. 24

11%

Researcher 21

9%

Lecturer / Post doc 17

8%

Readers' Discipline

Tooltip

Computer Science 188

80%

Engineering 22

9%

Business, Management and Accounting 15

6%

Social Sciences 10

4%

Article Metrics

Tooltip
Social Media
Shares, Likes & Comments: 2

Save time finding and organizing research with Mendeley

Sign up for free