Repository logo
 

The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions

dc.contributor.authorElliott, Daniel L., author
dc.contributor.authorAnderson, Charles W., advisor
dc.contributor.authorDraper, Bruce, committee member
dc.contributor.authorKirby, Michael, committee member
dc.contributor.authorChong, Edwin, committee member
dc.date.accessioned2018-09-10T20:05:46Z
dc.date.available2018-09-10T20:05:46Z
dc.date.issued2018
dc.description.abstractReinforcement learning agents learn by exploring the environment and then exploiting what they have learned. This frees the human trainers from having to know the preferred action or intrinsic value of each encountered state. The cost of this freedom is reinforcement learning can feel too slow and unstable during learning: exhibiting performance like that of a randomly initialized Q-function just a few parameter updates after solving the task. We explore the possibility that ensemble methods can remedy these shortcomings and do so by investigating a novel technique which harnesses the wisdom of the crowds by bagging Q-function approximator estimates. Our results show that this proposed approach improves all tasks and reinforcement learning approaches attempted. We are able to demonstrate that this is a direct result of the increased stability of the action portion of the state-action-value function used by Q-learning to select actions and by policy gradient methods to train the policy. Recently developed methods attempt to solve these RL challenges at the cost of increasing the number of interactions with the environment by several orders of magnitude. On the other hand, the proposed approach has little downside for inclusion: it addresses RL challenges while reducing the number interactions with the environment.
dc.format.mediumborn digital
dc.format.mediumdoctoral dissertations
dc.identifierElliott_colostate_0053A_15075.pdf
dc.identifier.urihttps://hdl.handle.net/10217/191477
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Libraries
dc.relation.ispartof2000-2019
dc.rightsCopyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.subjectmachine learning
dc.subjectQ-learning
dc.subjectensemble
dc.subjectreinforcement learning
dc.subjectneural networks
dc.titleThe wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions
dc.typeText
dcterms.rights.dplaThis Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.disciplineComputer Science
thesis.degree.grantorColorado State University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy (Ph.D.)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Elliott_colostate_0053A_15075.pdf
Size:
29.19 MB
Format:
Adobe Portable Document Format
Description: