Show simple item record

FieldValueLanguage
dc.contributor.authorCai, Shizhe
dc.date.accessioned2026-01-22T23:22:26Z
dc.date.available2026-01-22T23:22:26Z
dc.date.issued2026en
dc.identifier.urihttps://hdl.handle.net/2123/34759
dc.description.abstractDeep Reinforcement Learning (DRL) has demonstrated remarkable success in continuous control tasks. However, it often requires extensive training data, struggles with complex long-horizon planning, and may fail to maintain safety constraints during operation. Meanwhile, Model Predictive Control (MPC) provides explainability and constraint satisfaction but typically leads to only locally optimal solutions and demands careful manual design of cost functions. To address these complementary limitations, this thesis develops and validates Q-guided Stein variational model pre- dictive Actor-Critic (Q-STAC), a novel framework that bridges these approaches by integrating Bayesian Model Predictive Control (Bayesian MPC) with actor-critic reinforcement learning through Stein Variational Gradient Descent (SVGD). A core innovation within this framework is the direct optimization of control sequences using learned Q-values as objectives, an approach that eliminates the need for explicit cost function design while leveraging the dynamics of the system to improve sample efficiency and forces that control signals remain within safe boundaries. Extensive experiments on 2D navigation, robotic manipulation tasks and real-world picking task demonstrate that Q-STAC achieves superior sample efficiency, robustness, and optimality compared to State-of-the-Art (SOTA) algorithms.en
dc.language.isoenen
dc.subjectReinforcement Learningen
dc.subjectModel Predictive Controlen
dc.subjectBayesian Inferenceen
dc.subjectStein Variational Gradient Descenten
dc.titleCombining Actor-Critic Methods with Model Predictive Control via Stein Variational Inferenceen
dc.typeThesis
dc.type.thesisMasters by Researchen
dc.rights.otherThe author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.en
usyd.facultySeS faculties schools::Faculty of Engineering::School of Computer Scienceen
usyd.degreeMaster of Philosophy M.Philen
usyd.awardinginstThe University of Sydneyen
usyd.advisorRamos, Fabio
usyd.include.pubNoen


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.