Dataset associated with "Estimation of the state-value function for optimal reservoir operations using continuous action deep reinforcement learning"
dc.contributor.author | Peacock, Matthew E. | |
dc.contributor.author | Labadie, John W. | |
dc.coverage.spatial | Upper Russian River Basin, northern California | en_US |
dc.coverage.temporal | 1950-01-01 to 2010-12-31 | en_US |
dc.date.accessioned | 2020-06-30T20:52:54Z | |
dc.date.available | 2020-06-30T20:52:54Z | |
dc.date.issued | 2020 | |
dc.description | This dataset includes the source code of an implementation of the deep deterministic policy gradients algorithm to a reservoir operations problem. Also included are the input time series data of inflow and withdrawal at each node in the network and the evaporation table. | en_US |
dc.description | Department of Civil and Environmental Engineering | |
dc.description.abstract | The state-value function of a reservoir system provides information about the long-term rewards that can be accrued from any state which the system can occupy. This function can be used to determine optimal decisions and is also key piece of information needed when reservoir operators wish to incorporate real-time forecast information. Dynamic programming is the most popular method for calculating the state-value function but has well-known limitations. The "curse of dimensionality,'' which can lead to computational intractability, arises from the discrete nature of the formulation and the backwards recursive solution process precluding consideration of delayed rewards. Continuous action deep reinforcement learning (CADRL) is a recent development for estimating the state-value function when delayed rewards are present and avoids the difficulties associated with use of discrete methods. Since application of this technique to reservoir operation problems is not without its own challenges, presented herein is a computational implementation with refinements needed to provide a stable and reliable learning process. CADRL is applied to development of optimal operational strategies for Lake Mendocino in the Russian River basin of Northern California using two single-objective reward functions, along with a multi-objective reward function for verification purposes. Performance of the optimal policy functions developed from the learning process is evaluated through simulation, with results showing that the system is able to learn far-sighted strategies that outperform idealized policies with foresight. | en_US |
dc.description.sponsorship | The writers gratefully acknowledge the financial support provided by the National Oceanic and Atmospheric Administration (NOAA) Earth System Research Laboratory (ESRL), Physical Sciences Division (PSD), U.S. Department of Commerce, which was administered through the Sonoma County Water Agency (SCWA), Santa Rosa CA and the Cooperative Institute for Research in the Atmosphere (CIRA) at Colorado State University. Thanks also to Chris Delaney of the SCWA for providing important data for this project as well the HEC-ResSim model calibrated for Lake Mendocino and the Russian River basin. | en_US |
dc.format.medium | ZIP | |
dc.format.medium | TXT | |
dc.format.medium | CSV | |
dc.format.medium | PY | |
dc.format.medium | Source Code | |
dc.identifier.uri | https://hdl.handle.net/10217/208774 | |
dc.identifier.uri | http://dx.doi.org/10.25675/10217/208774 | |
dc.language | English | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Colorado State University. Libraries | en_US |
dc.relation.ispartof | Research Data | |
dc.relation.isreferencedby | in review: Estimation of the State-Value Function for Optimal Reservoir Operations using Continuous Action Deep Reinforcement Learning | en_US |
dc.rights.license | This material is distributed under the terms and conditions of the GNU General Public License, version 3 (https://www.gnu.org/licenses/gpl-3.0.en.html). | |
dc.subject | reservoir operations | en_US |
dc.subject | reinforcement learning | en_US |
dc.subject | deep deterministic policy gradients | en_US |
dc.subject | continuous action deep reinforcement learning | en_US |
dc.subject | ensemble forecast | en_US |
dc.title | Dataset associated with "Estimation of the state-value function for optimal reservoir operations using continuous action deep reinforcement learning" | en_US |
dc.type | Dataset | en_US |