Repository logo
 

Solving MDPs with thresholded lexicographic ordering using reinforcement learning

Date

2022

Authors

Tercan, Alperen, author
Prabhu, Vinayak S., advisor
Anderson, Charles W., advisor
Chong, Edwin K. P., committee member

Journal Title

Journal ISSN

Volume Title

Abstract

Multiobjective problems with a strict importance order over the objectives occur in many real-life scenarios. While Reinforcement Learning (RL) is a promising approach with a great potential to solve many real-life problems, the RL literature focuses primarily on single-objective tasks, and approaches that can directly address multiobjective with importance order have been scarce. The few proposed approach were noted to be heuristics without theoretical guarantees. However, we found that their practical applicability is very limited as they fail to find a good solution even in very common scenarios. In this work, we first investigate these shortcomings of the existing approaches and propose some solutions that could improve their practical performance. Finally, we propose a completely different approach based on policy optimization using our Lexicographic Projection Optimization (LPO) algorithm and show its performance on some benchmark problems.

Description

Rights Access

Subject

Citation

Associated Publications