Self-Supervised Visual Representation Learning in Distributed Systems - Generalisation Analysis and Training Framework Design

Chen, Xuanyu

Access status:

Open Access

Field	Value	Language
dc.contributor.author	Chen, Xuanyu
dc.date.accessioned	2026-03-30T03:12:01Z
dc.date.available	2026-03-30T03:12:01Z
dc.date.issued	2026	en
dc.identifier.uri	https://hdl.handle.net/2123/35050
dc.description.abstract	This thesis investigates distributed self-supervised learning as a paradigm for training visual representation models directly from distributed and unlabelled data. While large-scale supervised datasets have driven advances in visual artificial intelligence, their centralised collection and annotation are costly, and real-world data is inherently fragmented across edge devices, institutions, and sensors. Distributed self-supervised learning aims to leverage such data without labels or central coordination, raising key challenges including robustness under heterogeneous client distributions, feasibility on resource-limited devices, and whether distributed training can approach centralised performance. This thesis makes four main contributions. First, it shows how decentralisation reshapes scaling laws, proving that the compute-optimal model size decreases as data becomes more distributed, explaining the effectiveness of lightweight models on edge devices. Second, it demonstrates that distributed training exhibits a generalisation gap compared to centralised training under equal compute, which can be reduced by increasing data through more clients or larger local datasets. Third, it provides a systematic theoretical analysis under heterogeneous data, showing that Masked Image Modelling is more robust than contrastive learning, with robustness improving with network connectivity, and introduces MAR loss to enhance robustness. Finally, it proposes DeNAV, a decentralised framework that removes server dependence and incorporates informed client selection, staleness-aware aggregation, and lightweight masked autoencoder pre-training for efficient communication, with theoretical guarantees and strong empirical performance. Overall, this thesis establishes distributed self-supervised learning as a feasible and effective paradigm and shows how theoretical insights on scaling laws, generalisation, and robustness guide the design of practical distributed learning systems.	en
dc.language.iso	en	en
dc.subject	Distributed Self-Supervised Learning	en
dc.subject	Federated Learning	en
dc.subject	Decentralized Learning	en
dc.subject	Heterogeneous Data	en
dc.subject	Scaling Laws	en
dc.subject	PAC-Bayesian Generalisation Analysis	en
dc.title	Self-Supervised Visual Representation Learning in Distributed Systems - Generalisation Analysis and Training Framework Design	en
dc.type	Thesis
dc.type.thesis	Doctor of Philosophy	en
dc.rights.other	The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.	en
usyd.faculty	SeS faculties schools::Faculty of Engineering::School of Electrical and Information Engineering	en
usyd.degree	Doctor of Philosophy Ph.D.	en
usyd.awardinginst	The University of Sydney	en
usyd.advisor	Yuan, Dong
usyd.include.pub	No	en