Show simple item record

FieldValueLanguage
dc.contributor.authorHu, Zeke Zexi
dc.date.accessioned2025-07-24T05:14:35Z
dc.date.available2025-07-24T05:14:35Z
dc.date.issued2025en
dc.identifier.urihttps://hdl.handle.net/2123/34144
dc.descriptionIncludes publication
dc.description.abstractLight field (LF) imaging captures both spatial and angular information of light rays, offering a four-dimensional representation of a scene that enables applications such as refocusing, depth estimation, and immersive media. However, practical deployment of LF imaging faces two critical challenges: low spatial resolution due to inherent hardware constraints, and high data volume impeding efficient transmission. This thesis addresses these challenges through novel contributions in light field super-resolution (LFSR) and light field transmission. To enhance spatial resolution, we first propose the Many-to-Many Transformer (M2MT), which mitigates the subspace isolation problem common in existing deep learning approaches. M2MT encodes unexposed LF dimensions as channel embeddings, enabling global spatial-angular modelling and achieving state-of-the-art LFSR performance. We then introduce SkimLFSR, a lightweight yet effective network that alleviates disparity entanglement by selectively incorporating structurally informative subviews, following a “less is more” strategy that improves both accuracy and efficiency. In the context of LF transmission, we develop a user-centric framework that integrates angular attention modelling with video compression. To support this, we construct LF-EMT12, the first eye-tracking dataset for LF viewing, and design LF3A-Net to estimate user attention over subviews. Our approach enables selective subview transmission based on predicted user focus, significantly reducing bandwidth requirements while maintaining perceptual quality. Collectively, these contributions advance the state-of-the-art in LF image processing and transmission. They provide new insights into structured representation learning and user-adaptive LF systems, laying a foundation for future research and practical applications in computational photography, immersive media, and beyond.en
dc.language.isoenen
dc.subjectlight fielden
dc.subjectsuper-resolutionen
dc.subjectimage processingen
dc.subjectself-attentionen
dc.subjectTransformeren
dc.titleLight Field Image Processing and Applications with Deep Learningen
dc.typeThesis
dc.type.thesisDoctor of Philosophyen
dc.rights.otherThe author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.en
usyd.facultySeS faculties schools::Faculty of Engineering::School of Computer Scienceen
usyd.degreeDoctor of Philosophy Ph.D.en
usyd.awardinginstThe University of Sydneyen
usyd.advisorChung, Vera
usyd.include.pubYesen


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.