3D Human Motion Recovery and Generation: From Noisy Pose to Language Description
Field | Value | Language |
dc.contributor.author | Ma, Sihan | |
dc.date.accessioned | 2024-11-05T23:13:10Z | |
dc.date.available | 2024-11-05T23:13:10Z | |
dc.date.issued | 2024 | en_AU |
dc.identifier.uri | https://hdl.handle.net/2123/33234 | |
dc.description | Includes publication | |
dc.description.abstract | Understanding and generating human behaviors are crucial and fundamental to numerous computer vision applications, including embodied AI, autonomous driving, and human-machine interaction. In the field of human behavior modeling, there are two primary components: perception and generation. Perception involves detecting 2D and 3D poses from sensor data to interpret and understand human behaviors in real-world scenarios. In contrast, generation focuses on creating realistic or desired human behaviors based on conditional signals, such as natural language directions, offering significant advantages in animation creation, filmmaking, and robot control. Within human behavior modeling, this thesis focuses on three critical aspects of perception and generation: robust pose estimation from images, realistic motion recovery from monocular videos and noisy inputs, and interactive motion synthesis conditioned on natural language descriptions. We address these three key areas in human behavior research, proposing solutions that lay the foundation for a comprehensive human motion framework in future work. It will pave the way for applications in VR/AR, animation, healthcare, and human-robot interaction. | en_AU |
dc.language.iso | en | en_AU |
dc.subject | human behavior modeling | en_AU |
dc.subject | motion capture | en_AU |
dc.subject | motion reconstruction | en_AU |
dc.subject | motion generation | en_AU |
dc.subject | pose estimation | en_AU |
dc.title | 3D Human Motion Recovery and Generation: From Noisy Pose to Language Description | en_AU |
dc.type | Thesis | |
dc.type.thesis | Doctor of Philosophy | en_AU |
dc.rights.other | The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission. | en_AU |
usyd.faculty | SeS faculties schools::Faculty of Engineering::School of Computer Science | en_AU |
usyd.degree | Doctor of Philosophy Ph.D. | en_AU |
usyd.awardinginst | The University of Sydney | en_AU |
usyd.advisor | Tao, Dacheng | |
usyd.include.pub | Yes | en_AU |
Associated file/s
Associated collections