Algorithm parallelism for improved extractive summarization

Villanueva, Arturo N., Jr., author; Simske, Steven J., author; ACM, publisher

Algorithm parallelism for improved extractive summarization

Files

FACF_ACMOA_3573128.3609350.pdf (474.26 KB)

Date

2023-08-22

Authors

Villanueva, Arturo N., Jr., author

Simske, Steven J., author

ACM, publisher

Abstract

While much work on abstractive summarization has been conducted in recent years, including state-of-the-art summarizations from GPT-4, extractive summarization's lossless nature continues to provide advantages, preserving the style and often key phrases of the original text as meant by the author. Libraries for extractive summarization abound, with a wide range of efficacy. Some do not perform much better or perform even worse than random sampling of sentences extracted from the original text. This study breathes new life to using classical algorithms by proposing parallelism through an implementation of a second order meta-algorithm in the form of the Tessellation and Recombination with Expert Decisioner (T&R) pattern, taking advantage of the abundance of already-existing algorithms and dissociating their individual performance from the implementer's biases. Resulting summaries obtained using T&R are better than any of the component algorithms.

Subject

natural language processing

extractive summarization

meta-algorithmics

machine learning

document summarization

tessellation and recombination

URI

https://hdl.handle.net/10217/239512

Collections

Publications

Full item page

Algorithm parallelism for improved extractive summarization

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Abstract

Description

Rights Access

Subject

Citation

URI

Associated Publications

Collections