TY - GEN
T1 - Combining graph degeneracy and submodularity for unsupervised extractive summarization
AU - Tixier, Antoine J.P.
AU - Meladianos, Polykarpos
AU - Vazirgiannis, Michalis
N1 - Publisher Copyright:
© EMNLP 2017.All right reserved.
PY - 2017/1/1
Y1 - 2017/1/1
N2 - We present a fully unsupervised, extractive text summarization system that leverages a submodularity framework introduced by past research. The framework allows summaries to be generated in a greedy way while preserving near-optimal performance guarantees. Our main contribution is the novel coverage reward term of the objective function optimized by the greedy algorithm. This component builds on the graph-of-words representation of text and the k-core decomposition algorithm to assign meaningful scores to words. We evaluate our approach on the AMI and ICSI meeting speech corpora, and on the DUC2001 news corpus. We reach state-of-the-art performance on all datasets. Results indicate that our method is particularly well-suited to the meeting domain.
AB - We present a fully unsupervised, extractive text summarization system that leverages a submodularity framework introduced by past research. The framework allows summaries to be generated in a greedy way while preserving near-optimal performance guarantees. Our main contribution is the novel coverage reward term of the objective function optimized by the greedy algorithm. This component builds on the graph-of-words representation of text and the k-core decomposition algorithm to assign meaningful scores to words. We evaluate our approach on the AMI and ICSI meeting speech corpora, and on the DUC2001 news corpus. We reach state-of-the-art performance on all datasets. Results indicate that our method is particularly well-suited to the meeting domain.
U2 - 10.18653/v1/w17-4507
DO - 10.18653/v1/w17-4507
M3 - Conference contribution
AN - SCOPUS:85059905096
T3 - EMNLP 2017 - Workshop on New Frontiers in Summarization, NFiS 2017 - Workshop Proceedings
SP - 48
EP - 58
BT - EMNLP 2017 - Workshop on New Frontiers in Summarization, NFiS 2017 - Workshop Proceedings
PB - Association for Computational Linguistics (ACL)
T2 - EMNLP 2017 Workshop on New Frontiers in Summarization, NFiS 2017
Y2 - 7 September 2017
ER -