A survey of text summarization techniques

Ani Nenkova; Kathleen McKeown

Book Chapter

A survey of text summarization techniques

Springer US, (2012), 43-76

DOI: 10.1007/978-1-4614-3223-4_3

458Citations

366Readers

Get full text

Abstract

Numerous approaches for identifying important content for automatic text summarization have been developed to date. Topic representation approaches first derive an intermediate representation of the text that captures the topics discussed in the input. Based on these representations of topics, sentences in the input document are scored for importance. In contrast, in indicator representation approaches, the text is represented by a diverse set of possible indicators of importance which do not aim at discovering topicality. These indicators are combined, very often using machine learning techniques, to score the importance of each sentence. Finally, a summary is produced by selecting sentences in a greedy approach, choosing the sentences that will go in the summary one by one, or globally optimizing the selection, choosing the best set of sentences to form a summary. In this chapter we give a broad overview of existing approaches based on these distinctions, with particular attention on how representation, sentence scoring or summary selection strategies alter the overall performance of the summarizer. We also point out some of the peculiarities of the task of summarization which have posed challenges to machine learning approaches for the problem, and some of the suggested solutions.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Nenkova, A., & McKeown, K. (2012). A survey of text summarization techniques. In Mining Text Data (Vol. 9781461432234, pp. 43–76). Springer US. https://doi.org/10.1007/978-1-4614-3223-4_3

Readers' Seniority

PhD / Post grad / Masters / Doc 165

73%

Professor / Associate Prof. 24

11%

Researcher 21

Lecturer / Post doc 17

Readers' Discipline

Computer Science 188

80%

Engineering 23

10%

Business, Management and Accounting 15

Social Sciences 10

Article Metrics

Social Media

Shares, Likes & Comments: 2

View details >

A survey of text summarization techniques

Abstract

Author supplied keywords

References Powered by Scopus

Indexing by latent semantic analysis

Term-weighting approaches in automatic text retrieval

A statistical interpretation of term specificity and its application in retrieval

Cited by Powered by Scopus

Automatic text summarization: A comprehensive survey

Assessing sentence scoring techniques for extractive text summarization

The emotional arcs of stories are dominated by six basic shapes

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline

Article Metrics