Combining graph degeneracy and submodularity for unsupervised extractive summarization

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We present a fully unsupervised, extractive text summarization system that leverages a submodularity framework introduced by past research. The framework allows summaries to be generated in a greedy way while preserving near-optimal performance guarantees. Our main contribution is the novel coverage reward term of the objective function optimized by the greedy algorithm. This component builds on the graph-of-words representation of text and the k-core decomposition algorithm to assign meaningful scores to words. We evaluate our approach on the AMI and ICSI meeting speech corpora, and on the DUC2001 news corpus. We reach state-of-the-art performance on all datasets. Results indicate that our method is particularly well-suited to the meeting domain.

Original languageEnglish
Title of host publicationEMNLP 2017 - Workshop on New Frontiers in Summarization, NFiS 2017 - Workshop Proceedings
PublisherAssociation for Computational Linguistics (ACL)
Pages48-58
Number of pages11
ISBN (Electronic)9781945626890
DOIs
Publication statusPublished - 1 Jan 2017
EventEMNLP 2017 Workshop on New Frontiers in Summarization, NFiS 2017 - Copenhagen, Denmark
Duration: 7 Sept 2017 → …

Publication series

NameEMNLP 2017 - Workshop on New Frontiers in Summarization, NFiS 2017 - Workshop Proceedings

Conference

ConferenceEMNLP 2017 Workshop on New Frontiers in Summarization, NFiS 2017
Country/TerritoryDenmark
CityCopenhagen
Period7/09/17 → …

Fingerprint

Dive into the research topics of 'Combining graph degeneracy and submodularity for unsupervised extractive summarization'. Together they form a unique fingerprint.

Cite this