Repository logo
 

Schubert variety of best fit with applications and across domains sparse feature extraction

dc.contributor.authorKarimov, Karim, author
dc.contributor.authorKirby, Michael, advisor
dc.contributor.authorPeterson, Chris, committee member
dc.contributor.authorAnderson, Charles, committee member
dc.contributor.authorAdams, Henry, committee member
dc.date.accessioned2024-12-23T12:00:21Z
dc.date.available2024-12-23T12:00:21Z
dc.date.issued2024
dc.description.abstractThis dissertation presents two novel approaches in applied mathematics for data analysis and feature selection, addressing challenges in both geometric data representation and multi-domain biological data interpretation. The first part introduces the Schubert Variety of Best Fit (SVBF) as a new geometric framework for analyzing sets of datasets. Leveraging the structure of Grassmann manifolds and Schubert varieties, we develop the SVBF-Node, a computational unit for solving related optimization problems. We demonstrate the efficacy of this approach through three classification algorithms and a new clustering method, SVBF-LBG. These techniques are evaluated on various datasets, including synthetic data, image sets, video sequences, and hyperspectral remote sensing data, showing improved performance over existing similar methods, particularly for complex, high-dimensional data. The second part proposes a multi-domain, multi-task (MDMT) architecture for feature selection in biological data. This method integrates multi-domain learning with masked feature selection, specifically applied to gene expression data from multiple tissues. We demonstrate its ability to identify novel biomarkers in host immune responses to infection, which are not detectable through single-domain analyses. The approach is validated using bulk RNA sequences from different tissues, revealing its potential to uncover cross-domain biological insights. Both contributions offer interpretable, mathematically grounded approaches to data analysis, providing new tools for researchers in applied mathematics, machine learning, and bioinformatics.
dc.format.mediumborn digital
dc.format.mediumdoctoral dissertations
dc.identifierkarimov_colostate_0053A_18705.pdf
dc.identifier.urihttps://hdl.handle.net/10217/239879
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Libraries
dc.relation.ispartof2020-
dc.rightsCopyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.subjectgeometrical machine learning
dc.subjectGrassmann manifold
dc.subjectsubspace analysis
dc.subjectGPU parallel
dc.subjectgeometrical data science
dc.subjectSchubert variety
dc.titleSchubert variety of best fit with applications and across domains sparse feature extraction
dc.typeText
dcterms.rights.dplaThis Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.disciplineMathematics
thesis.degree.grantorColorado State University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy (Ph.D.)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
karimov_colostate_0053A_18705.pdf
Size:
7.88 MB
Format:
Adobe Portable Document Format