Towards Viable Imitation Learning
Field | Value | Language |
dc.contributor.author | Cheng, Zhihao | |
dc.date.accessioned | 2024-01-08T05:02:43Z | |
dc.date.available | 2024-01-08T05:02:43Z | |
dc.date.issued | 2023 | en_AU |
dc.identifier.uri | https://hdl.handle.net/2123/32074 | |
dc.description | Includes publication | |
dc.description.abstract | Imitation learning (IL), a fundamental machine learning paradigm, has achieved remarkable success in various domains, including autonomous driving, games, and robot locomotion. However, current IL relies heavily on an adversarial training scheme, which results in training instability and high computational burdens. These limitations pose challenges, especially in practical applications. Therefore, the thesis focuses on addressing three crucial research problems: improving data flexibility, enhancing safety, and reducing computation. By tackling these challenges, the goal is to advance IL and overcome the obstacles that hinder its practicality in real-world applications. First, the thesis analyzes the difference between conducting IL with expert observations and demonstrations and establishes the almost equivalence between these two methods in deterministic robot environments or robot environments with bounded randomness, promoting the applicability of learning from observation (LfO) in solving real-world problems. Furthermore, the thesis addresses the challenge of handling expert data in the form of visual inputs and proposes an IL framework that can effectively and efficiently learn from visual inputs by extracting meaningful features with data augmentation and maximizing sample reuse with off-policy learning. Then, the thesis presents a two-stage optimization framework, which employs a Lagrange multiplier to model application-oriented safety constraints and can generate policies that satisfy the prescribed safety constraint with a theoretical guarantee. Finally, the thesis conducts pilot studies on how to empower IL algorithms with quantum computing and presents two quantum IL (QIL) algorithms that can be run on quantum computers to reap the quantum advantage, paving the way for the quantum era. | en_AU |
dc.language.iso | en | en_AU |
dc.subject | imitation learning | en_AU |
dc.subject | reinforcement learning | en_AU |
dc.subject | quantum machine learning | en_AU |
dc.title | Towards Viable Imitation Learning | en_AU |
dc.type | Thesis | |
dc.type.thesis | Doctor of Philosophy | en_AU |
dc.rights.other | The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission. | en_AU |
usyd.faculty | SeS faculties schools::Faculty of Engineering::School of Computer Science | en_AU |
usyd.degree | Doctor of Philosophy Ph.D. | en_AU |
usyd.awardinginst | The University of Sydney | en_AU |
usyd.advisor | Tao, Dacheng | |
usyd.include.pub | Yes | en_AU |
Associated file/s
Associated collections