Schubert variety of best fit with applications and across domains sparse feature extraction
dc.contributor.author | Karimov, Karim, author | |
dc.contributor.author | Kirby, Michael, advisor | |
dc.contributor.author | Peterson, Chris, committee member | |
dc.contributor.author | Anderson, Charles, committee member | |
dc.contributor.author | Adams, Henry, committee member | |
dc.date.accessioned | 2024-12-23T12:00:21Z | |
dc.date.available | 2024-12-23T12:00:21Z | |
dc.date.issued | 2024 | |
dc.description.abstract | This dissertation presents two novel approaches in applied mathematics for data analysis and feature selection, addressing challenges in both geometric data representation and multi-domain biological data interpretation. The first part introduces the Schubert Variety of Best Fit (SVBF) as a new geometric framework for analyzing sets of datasets. Leveraging the structure of Grassmann manifolds and Schubert varieties, we develop the SVBF-Node, a computational unit for solving related optimization problems. We demonstrate the efficacy of this approach through three classification algorithms and a new clustering method, SVBF-LBG. These techniques are evaluated on various datasets, including synthetic data, image sets, video sequences, and hyperspectral remote sensing data, showing improved performance over existing similar methods, particularly for complex, high-dimensional data. The second part proposes a multi-domain, multi-task (MDMT) architecture for feature selection in biological data. This method integrates multi-domain learning with masked feature selection, specifically applied to gene expression data from multiple tissues. We demonstrate its ability to identify novel biomarkers in host immune responses to infection, which are not detectable through single-domain analyses. The approach is validated using bulk RNA sequences from different tissues, revealing its potential to uncover cross-domain biological insights. Both contributions offer interpretable, mathematically grounded approaches to data analysis, providing new tools for researchers in applied mathematics, machine learning, and bioinformatics. | |
dc.format.medium | born digital | |
dc.format.medium | doctoral dissertations | |
dc.identifier | karimov_colostate_0053A_18705.pdf | |
dc.identifier.uri | https://hdl.handle.net/10217/239879 | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Colorado State University. Libraries | |
dc.relation.ispartof | 2020- | |
dc.rights | Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright. | |
dc.subject | geometrical machine learning | |
dc.subject | Grassmann manifold | |
dc.subject | subspace analysis | |
dc.subject | GPU parallel | |
dc.subject | geometrical data science | |
dc.subject | Schubert variety | |
dc.title | Schubert variety of best fit with applications and across domains sparse feature extraction | |
dc.type | Text | |
dcterms.rights.dpla | This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). | |
thesis.degree.discipline | Mathematics | |
thesis.degree.grantor | Colorado State University | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Doctor of Philosophy (Ph.D.) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- karimov_colostate_0053A_18705.pdf
- Size:
- 7.88 MB
- Format:
- Adobe Portable Document Format