Towards Viable Imitation Learning

Cheng, Zhihao

Access status:

USyd Access

Field	Value	Language
dc.contributor.author	Cheng, Zhihao
dc.date.accessioned	2024-01-08T05:02:43Z
dc.date.available	2024-01-08T05:02:43Z
dc.date.issued	2023	en_AU
dc.identifier.uri	https://hdl.handle.net/2123/32074
dc.description	Includes publication
dc.description.abstract	Imitation learning (IL), a fundamental machine learning paradigm, has achieved remarkable success in various domains, including autonomous driving, games, and robot locomotion. However, current IL relies heavily on an adversarial training scheme, which results in training instability and high computational burdens. These limitations pose challenges, especially in practical applications. Therefore, the thesis focuses on addressing three crucial research problems: improving data flexibility, enhancing safety, and reducing computation. By tackling these challenges, the goal is to advance IL and overcome the obstacles that hinder its practicality in real-world applications. First, the thesis analyzes the difference between conducting IL with expert observations and demonstrations and establishes the almost equivalence between these two methods in deterministic robot environments or robot environments with bounded randomness, promoting the applicability of learning from observation (LfO) in solving real-world problems. Furthermore, the thesis addresses the challenge of handling expert data in the form of visual inputs and proposes an IL framework that can effectively and efficiently learn from visual inputs by extracting meaningful features with data augmentation and maximizing sample reuse with off-policy learning. Then, the thesis presents a two-stage optimization framework, which employs a Lagrange multiplier to model application-oriented safety constraints and can generate policies that satisfy the prescribed safety constraint with a theoretical guarantee. Finally, the thesis conducts pilot studies on how to empower IL algorithms with quantum computing and presents two quantum IL (QIL) algorithms that can be run on quantum computers to reap the quantum advantage, paving the way for the quantum era.	en_AU
dc.language.iso	en	en_AU
dc.subject	imitation learning	en_AU
dc.subject	reinforcement learning	en_AU
dc.subject	quantum machine learning	en_AU
dc.title	Towards Viable Imitation Learning	en_AU
dc.type	Thesis
dc.type.thesis	Doctor of Philosophy	en_AU
dc.rights.other	The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.	en_AU
usyd.faculty	SeS faculties schools::Faculty of Engineering::School of Computer Science	en_AU
usyd.degree	Doctor of Philosophy Ph.D.	en_AU
usyd.awardinginst	The University of Sydney	en_AU
usyd.advisor	Tao, Dacheng
usyd.include.pub	Yes	en_AU