Solving MDPs with thresholded lexicographic ordering using reinforcement learning

Tercan, Alperen, author; Prabhu, Vinayak S., advisor; Anderson, Charles W., advisor; Chong, Edwin K. P., committee member

Solving MDPs with thresholded lexicographic ordering using reinforcement learning

Files

Tercan_colostate_0053N_17466.pdf (912.82 KB)

Date

2022

Authors

Tercan, Alperen, author

Prabhu, Vinayak S., advisor

Anderson, Charles W., advisor

Chong, Edwin K. P., committee member

Abstract

Multiobjective problems with a strict importance order over the objectives occur in many real-life scenarios. While Reinforcement Learning (RL) is a promising approach with a great potential to solve many real-life problems, the RL literature focuses primarily on single-objective tasks, and approaches that can directly address multiobjective with importance order have been scarce. The few proposed approach were noted to be heuristics without theoretical guarantees. However, we found that their practical applicability is very limited as they fail to find a good solution even in very common scenarios. In this work, we first investigate these shortcomings of the existing approaches and propose some solutions that could improve their practical performance. Finally, we propose a completely different approach based on policy optimization using our Lexicographic Projection Optimization (LPO) algorithm and show its performance on some benchmark problems.

URI

https://hdl.handle.net/10217/235936

Collections

2020-
Theses and Dissertations

Full item page

Solving MDPs with thresholded lexicographic ordering using reinforcement learning

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Abstract

Description

Rights Access

Subject

Citation

URI

Associated Publications

Collections