Accurate prediction of protein function using GOstruct
dc.contributor.author | Sokolov, Artem, author | |
dc.contributor.author | Ben-Hur, Asa, advisor | |
dc.contributor.author | Anderson, Chuck, committee member | |
dc.contributor.author | McConnell, Ross M., committee member | |
dc.contributor.author | Wang, Haonan, committee member | |
dc.date.accessioned | 2007-01-03T05:49:30Z | |
dc.date.available | 2007-01-03T05:49:30Z | |
dc.date.issued | 2011 | |
dc.description.abstract | With the growing number of sequenced genomes, automatic prediction of protein function is one of the central problems in computational biology. Traditional methods employ transfer of functional annotation on the basis of sequence or structural similarity and are unable to effectively deal with today's noisy high-throughput biological data. Most of the approaches based on machine learning, on the other hand, break the problem up into a collection of binary classification problems, effectively asking the question ''does this protein perform this particular function?''; such methods often produce a set of predictions that are inconsistent with each other. In this work, we present GOstruct, a structured-output framework that answers the question ''what function does this protein perform?'' in the context of hierarchical multilabel classification. We show that GOstruct is able to effectively deal with a large number of disparate data sources from multiple species. Our empirical results demonstrate that the framework achieves state-of-the-art accuracy in two of the recent challenges in automatic function prediction: Mousefunc and CAFA. | |
dc.format.medium | born digital | |
dc.format.medium | doctoral dissertations | |
dc.identifier | Sokolov_colostate_0053A_10688.pdf | |
dc.identifier.uri | http://hdl.handle.net/10217/52071 | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Colorado State University. Libraries | |
dc.relation.ispartof | 2000-2019 | |
dc.rights | Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright. | |
dc.subject | machine learning | |
dc.subject | protein function prediction | |
dc.title | Accurate prediction of protein function using GOstruct | |
dc.type | Text | |
dcterms.rights.dpla | This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | Colorado State University | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Doctor of Philosophy (Ph.D.) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Sokolov_colostate_0053A_10688.pdf
- Size:
- 1.39 MB
- Format:
- Adobe Portable Document Format
- Description: