Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Principled Off-Policy Imitation Learning via Boosting

Principled Off-Policy Imitation Learning via Boosting

File(s)
Sreenivas_cornell_0058O_11870.pdf (40.42 MB)
Permanent Link(s)
https://doi.org/10.7298/hjpq-qh26
https://hdl.handle.net/1813/114489
Collections
Cornell Theses and Dissertations
Author
Sreenivas, Dhruv
Abstract

Imitation learning is a promising paradigm to learn policies to solve a variety of tasks given some expert data. In particular, off-policy imitation learning is particularly nice for practitioners, as it in principle allows the policy to use previously collected data to improve, similar to standard value-based off-policy reinforcement learning algorithms such as Deep Q Learning and actor-critic methods. However, this is generally ill-defined, as the policy improvement operator can generally only in principle be applied to data collected by the most recent policy, making the algorithm on-policy. To mitigate this while still remaining off-policy, we design an actor-critic method where we treat the replay buffer as a collection of data from a set of weak learners. Our algorithm more appropriately weights each weak learner’s data when it comes to sampling for policy optimization, offering a principled way to mitigate the above distribution mismatch problem in the off-policy setting. We apply this technique to both state and vision-based tasks in the DeepMind Control Suite domain and see that our method does indeed improve learning in terms of sample efficiency.

Description
40 pages
Date Issued
2023-08
Keywords
Adversarial Learning
•
Imitation Learning
•
Reinforcement Learning
Committee Chair
Sun, Wen
Committee Member
Kleinberg, Robert
Degree Discipline
Computer Science
Degree Name
M.S., Computer Science
Degree Level
Master of Science
Type
dissertation or thesis
Link(s) to Catalog Record
https://newcatalog.library.cornell.edu/catalog/16219185

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance