Optimizing text analytics and document automation with meta-algorithmic systems engineering
dc.contributor.author | Villanueva, Arturo N., Jr., author | |
dc.contributor.author | Simske, Steven J., advisor | |
dc.contributor.author | Hefner, Rick D., committee member | |
dc.contributor.author | Krishnaswamy, Nikhil, committee member | |
dc.contributor.author | Miller, Erika, committee member | |
dc.contributor.author | Roberts, Nicholas, committee member | |
dc.date.accessioned | 2023-08-28T10:29:11Z | |
dc.date.available | 2023-08-28T10:29:11Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Natural language processing (NLP) has seen significant advances in recent years, but challenges remain in making algorithms both efficient and accurate. In this study, we examine three key areas of NLP and explore the potential of meta-algorithmics and functional analysis for improving analytic and machine learning performance and conclude with expansions for future research. The first area focuses on text classification for requirements engineering, where stakeholder requirements must be classified into appropriate categories for further processing. We investigate multiple combinations of algorithms and meta-algorithms to optimize the classification process, confirming the optimality of Naïve Bayes and highlighting a certain sensitivity to the Global Vectors (GloVe) word embeddings algorithm. The second area of focus is extractive summarization, which offers advantages to abstractive summarization due to its lossless nature. We propose a second-order meta-algorithm that uses existing algorithms and selects appropriate combinations to generate more effective summaries than any individual algorithm. The third area covers document ordering, where we propose techniques for generating an optimal reading order for use in learning, training, and content sequencing. We propose two main methods: one using document similarities and the other using entropy against topics generated through Latent Dirichlet Allocation (LDA). | |
dc.format.medium | born digital | |
dc.format.medium | doctoral dissertations | |
dc.identifier | VillanuevaJr_colostate_0053A_17799.pdf | |
dc.identifier.uri | https://hdl.handle.net/10217/236997 | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Colorado State University. Libraries | |
dc.relation.ispartof | 2020- | |
dc.rights | Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright. | |
dc.subject | extractive summarization | |
dc.subject | meta-algorithmics | |
dc.subject | text classification | |
dc.subject | functional analysis | |
dc.subject | document ordering | |
dc.subject | natural language processing | |
dc.title | Optimizing text analytics and document automation with meta-algorithmic systems engineering | |
dc.type | Text | |
dcterms.rights.dpla | This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). | |
thesis.degree.discipline | Systems Engineering | |
thesis.degree.grantor | Colorado State University | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Doctor of Engineering (Eng.D.) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- VillanuevaJr_colostate_0053A_17799.pdf
- Size:
- 2.11 MB
- Format:
- Adobe Portable Document Format