An End-to-end Framework for Unconstrained Monocular 3D Hand Pose Estimation
Access status:
USyd Access
Type
ThesisThesis type
Masters by ResearchAuthor/s
Sanjeev, SanjeevAbstract
This thesis addresses the challenging problem of unconstrained 3D hand pose estimation using monocular RGB images. Most of the existing approaches assume some prior knowledge of hand (such as hand locations and side information) is available for 3D hand pose estimation. This restricts ...
See moreThis thesis addresses the challenging problem of unconstrained 3D hand pose estimation using monocular RGB images. Most of the existing approaches assume some prior knowledge of hand (such as hand locations and side information) is available for 3D hand pose estimation. This restricts their use in unconstrained environments. We, therefore, present an end-to-end framework that robustly predicts hand prior information and accurately infers 3D hand pose by learning ConvNet models while only using keypoint annotations. Unlike existing methods that suffer from background color confusion caused by using segmentation or detection-based technology, we achieve a more robust result by using a novel keypoint-based method to simultaneously predict hand regions and side labels. Moreover, inspired by the biological structure of the human hand, we introduce two geometric constraints directly into the 3D coordinates prediction that further improves its performance with a weakly-supervised training schedule. Experimental results show that our proposed framework not only performs robustly on unconstrained setting, but also outperforms the state-of-art methods on standard benchmark datasets.
See less
See moreThis thesis addresses the challenging problem of unconstrained 3D hand pose estimation using monocular RGB images. Most of the existing approaches assume some prior knowledge of hand (such as hand locations and side information) is available for 3D hand pose estimation. This restricts their use in unconstrained environments. We, therefore, present an end-to-end framework that robustly predicts hand prior information and accurately infers 3D hand pose by learning ConvNet models while only using keypoint annotations. Unlike existing methods that suffer from background color confusion caused by using segmentation or detection-based technology, we achieve a more robust result by using a novel keypoint-based method to simultaneously predict hand regions and side labels. Moreover, inspired by the biological structure of the human hand, we introduce two geometric constraints directly into the 3D coordinates prediction that further improves its performance with a weakly-supervised training schedule. Experimental results show that our proposed framework not only performs robustly on unconstrained setting, but also outperforms the state-of-art methods on standard benchmark datasets.
See less
Date
2020Publisher
University of SydneyRights statement
The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.Faculty/School
Faculty of Engineering, School of Computer ScienceDepartment, Discipline or Centre
Computer ScienceAwarding institution
The University of SydneyShare