Show simple item record

FieldValueLanguage
dc.contributor.authorSmith, Connor James
dc.date.accessioned2022-02-09T04:14:49Z
dc.date.available2022-02-09T04:14:49Z
dc.date.issued2022en_AU
dc.identifier.urihttps://hdl.handle.net/2123/27428
dc.description.abstractVariable selection is a key component of regression modelling but slight changes to the initial data can result in changes to the models identified. In this thesis, we identify and examine multiple problems within the variable selection space and how through the use of stability based approaches we can construct solutions, where there is a current lack of statistical frameworks. At its core, this thesis tackles complex data in a generalized linear model (GLM) framework; both in robust and higher dimensional settings. We target three main aspects: - The inability to use exhaustive variable selection approaches within the robust generalized linear model space. - The struggles of stable variable selection for omics micro-array data where the number of variables is significantly larger than the total number of observations. - Extracting information from multiple penalized regression solution paths to classify variables into different classes through both automated and visual classification. In Chapter 1, we provide an overview of variable selection methods with the main focus placed on GLMs. In Chapter 2, we bring variable selection methods in a robust GLM space closer to the gold standard of the exhaustive search through the new RobStab (Robust Stability) framework. In Chapter 3, we propose a novel stability based variable selection method, VIVID (VIsulationation of Variable Importance Differences), for omics GLM data. In Chapter 4, we expand upon the use of a single tuning parameter within penalized regression for variable selection with the new method ParSPaS. In Chapter 5 we make some final remarks and conclude the thesis. For all proposed methods, we provide publicly available computational implementations through R.en_AU
dc.language.isoenen_AU
dc.subjectVariable Selectionen_AU
dc.subjectStability Selectionen_AU
dc.subjectStatisticsen_AU
dc.titleResampling Based Model Selection for Correlated and Complex Dataen_AU
dc.typeThesis
dc.type.thesisDoctor of Philosophyen_AU
dc.rights.otherThe author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.en_AU
usyd.facultySeS faculties schools::Faculty of Science::School of Mathematics and Statisticsen_AU
usyd.degreeDoctor of Philosophy Ph.D.en_AU
usyd.awardinginstThe University of Sydneyen_AU
usyd.advisorMueller, Samuel


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.