Black History Month is here! Discover ERA research focused on Black experiences in Canada and worldwide. Use our general search below to get started!

CANOR COACH: Towards Noise-Robust Human-in-the-Loop Reinforcement Learning

dc.contributor.advisorMatthew E. Taylor
dc.contributor.advisorSrijita Das
dc.contributor.authorLi, Yuxuan
dc.date.accessioned2025-05-29T08:15:17Z
dc.date.available2025-05-29T08:15:17Z
dc.date.issued2024-11
dc.description.abstractReinforcement learning has been widely applied in different control tasks. However, its performance often faces the challenge of low sample efficiency. Introducing human prior knowledge is often seen as a possible solution, such as behaviour cloning, learning from advice, and inverse reinforcement learning. Learning from feedback is an example of exploiting human knowledge and it is a method to enable the agent to learn from binary feedback, which describes the teacher’s attitude towards the agent’s action. Compared to traditional learning from demonstration methods, learning from feedback does not require expert-level knowledge. But this can also be a demerit as nonexpert feedback comes with inevitable noise. In this thesis, we investigate how and to which extent noise impacts the learning performance. We also propose a series of methods to de-noise the feedback data online and achieve noise-robust human-in-the-loop reinforcement learning with different amounts of prior knowledge.
dc.identifier.doihttps://doi.org/10.7939/r3-5han-c270
dc.language.isoen
dc.rightsThis thesis is made available by the University of Alberta Library with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.
dc.subjectReinforcement Learning
dc.subjectMachine Learning
dc.subjectHuman-in-the-Loop Reinforcement Learning
dc.subjectAnomaly Detection
dc.titleCANOR COACH: Towards Noise-Robust Human-in-the-Loop Reinforcement Learning
dc.typehttp://purl.org/coar/resource_type/c_46ec
thesis.degree.grantorhttp://id.loc.gov/authorities/names/n79058482
thesis.degree.levelMaster's
thesis.degree.nameMaster of Science
ual.date.graduationFall 2024
ual.departmentDepartment of Computing Science
ual.jupiterAccesshttp://terms.library.ualberta.ca/public

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Li_Yuxuan_202408_MSc.pdf
Size:
27.18 MB
Format:
Adobe Portable Document Format