Model selection based on expected squared Hellinger distance
dc.contributor.author | Cao, Xiaofan, author | |
dc.contributor.author | Iyer, Hariharan K., advisor | |
dc.contributor.author | Wang, Haonan, advisor | |
dc.date.accessioned | 2024-03-13T18:50:56Z | |
dc.date.available | 2024-03-13T18:50:56Z | |
dc.date.issued | 2007 | |
dc.description.abstract | This dissertation is motivated by a general model selection problem such that the true model is unknown and one or more approximating parametric families of models are given along with strategies for estimating the parameters using data. We develop model selection methods based on Hellinger distance that can be applied to a wide range of modeling problems without posing the typical assumptions for the true model to be within the approximating families or to come from a particular parametric family. We propose two estimators for the expected squared Hellinger distance as the model selection criteria. | |
dc.description.abstract | In particular, the use of expected squared Hellinger distance is studied in ANOVA model selection problems where approximating models are typically sub-models of the full factorial model. The properties of the expected squared Hellinger distance are explored under balanced model structure assuming independent and identically distributed normal error terms. A model selection strategy specific to ANOVA model selection problems based on one of the estimated expected squared Hellinger distance is proposed. This strategy is illustrated using a real data set and its performance is tested by simulation studies. An example of ANOVA model selection problem with non-normal error terms that follow two-parameter exponential distribution is discussed. | |
dc.description.abstract | Model selection method based on estimated expected squared Hellinger distance is also applied to modeling the p-values from the microarray data analysis. The problem of estimating false discovery rate (FDR) from the distribution of p-values arising from statistical tests of differential gene expression in a microarray experiment is considered. A finite mixture model is studied in which one component is uniform on [0,1] corresponding to equally expressed genes and one or more additional components correspond to differentially expressed genes. Two different mixture families are explicitly investigated for estimating false discovery rate-a mixture of Beta densities and a mixture of Uniform densities. In both cases, the Minimum Hellinger distance is used to provide robust estimates of the mixture components. For the Bela mixture model we choose the number of Beta components by comparing the estimated expected squared Hellinger distance. The performance of the proposed methods is illustrated through a case study involving data from a published microarray experiment. | |
dc.format.medium | born digital | |
dc.format.medium | doctoral dissertations | |
dc.identifier | ETDF_Cao_2007_3299801.pdf | |
dc.identifier.uri | https://hdl.handle.net/10217/237624 | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Colorado State University. Libraries | |
dc.relation.ispartof | 2000-2019 | |
dc.rights | Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright. | |
dc.rights.license | Per the terms of a contractual agreement, all use of this item is limited to the non-commercial use of Colorado State University and its authorized users. | |
dc.subject | ANOVA | |
dc.subject | false discovery rate | |
dc.subject | Hellinger distance | |
dc.subject | statistics | |
dc.title | Model selection based on expected squared Hellinger distance | |
dc.type | Text | |
dcterms.rights.dpla | This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). | |
thesis.degree.discipline | Statistics | |
thesis.degree.grantor | Colorado State University | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Doctor of Philosophy (Ph.D.) |
Files
Original bundle
1 - 1 of 1