Show simple item record

FieldValueLanguage
dc.contributor.authorLu, Yan
dc.date.accessioned2024-04-09T03:59:41Z
dc.date.available2024-04-09T03:59:41Z
dc.date.issued2024en_AU
dc.identifier.urihttps://hdl.handle.net/2123/32441
dc.descriptionIncludes publication
dc.description.abstractMonocular 3D object detection is a key technology in image-based autonomous driving perception. Due to its challenging of the lack of depth. Existing methods solve it by introducing geometry prior such as estimating depth by perspective projection. This thesis explores the limitations of the previous geometry-based approaches, providing an error amplification problem suffered by such methods and concludes its specific influence on the error amplification problem as two negative effects: training stability and inference reliability. For the inference reliability problem, a Geometry Uncertainty Projection (GUP) module is proposed to model the perspective projection under the probabilistic modeling framework. It combines the uncertainty modeling and the geometry prior, which can provide a reliable uncertainty to reflect the quality for each projected depth and 3D object detection, making the monocular 3D detector more reliable. To solve the training instability, this thesis further proposes a heuristic Hierarchical Task Learning (HTL) scheme. It treats the learning of perspective projection as a cascaded multi-task learning problem, making the projection method learn more stably. With this, a GUPNet is proposed to combine that GUP module with the HTL scheme to increase the model inference reliability and stabilize the training process simultaneously. Finally, this thesis provides a Geometry Uncertainty Propagation Network (GUPNet++) to further utilize uncertainty in tackling the error amplification. The GUPNet++ introduces the uncertainty propagation theory and provides a new depth uncertainty. With the novel uncertainty, an uncertainty-based optimization and a novel uncertainty-based score scheme are proposed to make the model more reliable and stable than the GUPNet. The findings and extensive experiments in this thesis confirm the effectiveness of the proposed methods in tackling the error amplification problem of the monocular 3D object detection methods.en_AU
dc.language.isoenen_AU
dc.subjectDeep learningen_AU
dc.subjectcomputer visionen_AU
dc.subject3D visionen_AU
dc.subject3D from a single imageen_AU
dc.subjectuncertaintyen_AU
dc.subjectobject detectionen_AU
dc.titleUncertainty-based Geometry Modeling for Monocular 3D Object Detectionen_AU
dc.typeThesis
dc.type.thesisDoctor of Philosophyen_AU
dc.rights.otherThe author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.en_AU
usyd.facultySeS faculties schools::Faculty of Engineering::School of Electrical and Information Engineeringen_AU
usyd.degreeDoctor of Philosophy Ph.D.en_AU
usyd.awardinginstThe University of Sydneyen_AU
usyd.advisorLi, Yonghui
usyd.include.pubYesen_AU


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.