Self-Supervised Visual Representation Learning in Distributed Systems - Generalisation Analysis and Training Framework Design
| Field | Value | Language |
| dc.contributor.author | Chen, Xuanyu | |
| dc.date.accessioned | 2026-03-30T03:12:01Z | |
| dc.date.available | 2026-03-30T03:12:01Z | |
| dc.date.issued | 2026 | en |
| dc.identifier.uri | https://hdl.handle.net/2123/35050 | |
| dc.description.abstract | This thesis investigates distributed self-supervised learning as a paradigm for training visual representation models directly from distributed and unlabelled data. While large-scale supervised datasets have driven advances in visual artificial intelligence, their centralised collection and annotation are costly, and real-world data is inherently fragmented across edge devices, institutions, and sensors. Distributed self-supervised learning aims to leverage such data without labels or central coordination, raising key challenges including robustness under heterogeneous client distributions, feasibility on resource-limited devices, and whether distributed training can approach centralised performance. This thesis makes four main contributions. First, it shows how decentralisation reshapes scaling laws, proving that the compute-optimal model size decreases as data becomes more distributed, explaining the effectiveness of lightweight models on edge devices. Second, it demonstrates that distributed training exhibits a generalisation gap compared to centralised training under equal compute, which can be reduced by increasing data through more clients or larger local datasets. Third, it provides a systematic theoretical analysis under heterogeneous data, showing that Masked Image Modelling is more robust than contrastive learning, with robustness improving with network connectivity, and introduces MAR loss to enhance robustness. Finally, it proposes DeNAV, a decentralised framework that removes server dependence and incorporates informed client selection, staleness-aware aggregation, and lightweight masked autoencoder pre-training for efficient communication, with theoretical guarantees and strong empirical performance. Overall, this thesis establishes distributed self-supervised learning as a feasible and effective paradigm and shows how theoretical insights on scaling laws, generalisation, and robustness guide the design of practical distributed learning systems. | en |
| dc.language.iso | en | en |
| dc.subject | Distributed Self-Supervised Learning | en |
| dc.subject | Federated Learning | en |
| dc.subject | Decentralized Learning | en |
| dc.subject | Heterogeneous Data | en |
| dc.subject | Scaling Laws | en |
| dc.subject | PAC-Bayesian Generalisation Analysis | en |
| dc.title | Self-Supervised Visual Representation Learning in Distributed Systems - Generalisation Analysis and Training Framework Design | en |
| dc.type | Thesis | |
| dc.type.thesis | Doctor of Philosophy | en |
| dc.rights.other | The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission. | en |
| usyd.faculty | SeS faculties schools::Faculty of Engineering::School of Electrical and Information Engineering | en |
| usyd.degree | Doctor of Philosophy Ph.D. | en |
| usyd.awardinginst | The University of Sydney | en |
| usyd.advisor | Yuan, Dong | |
| usyd.include.pub | No | en |
Associated file/s
Associated collections