Analysing the False Discovery Proportion in Competition-Based Multiple Hypothesis Testing
| Field | Value | Language |
| dc.contributor.author | Ebadi, Arya | |
| dc.date.accessioned | 2023-11-20T01:24:02Z | |
| dc.date.available | 2023-11-20T01:24:02Z | |
| dc.date.issued | 2023 | en |
| dc.identifier.uri | https://hdl.handle.net/2123/31879 | |
| dc.description.abstract | The commonly practised modern approach to analysing a large number of hypotheses is to control the false discovery rate (FDR), the expected value of the false discovery proportion (FDP). Several such popular procedures are available when p-values can be assigned to each hypothesis, but the mass spectrometry community had encountered a problem where such p-values were not readily available. To resolve this issue, they developed target-decoy competition (TDC), a competition-based approach to controlling the FDR where, instead of a p-value, each observation (referred to as a target score) is paired with a competing null-generated score (referred to as a decoy score). The paired target and decoy scores are each competed and only the winning score, as well as a target/decoy-win label, is kept. Such scores and labels allow us to estimate, and hence control, the FDR. Controlling the FDR among a set of rejected hypotheses, however, only controls the FDP in that set in an average sense. While we can enforce the FDR to be appropriately bounded, the actual FDP may undesirably exceed our desired tolerance. Arguably, it makes more sense to gauge the FDP in our list of discoveries instead of an expected value that could only be obtained through an infinite number of experiment repetitions. Hence, we interest ourselves in the analysis of the FDP for competition-based multiple hypothesis testing. We examine in this thesis three major topics. First, we introduce ways to give an upper confidence bound on the FDP in the list of discoveries generated using the FDR as the chosen error metric. Second, we introduce ways to derive a list of discoveries in which the FDP is directly controlled with some pre-specified confidence. Third, and finally, we introduce ways to derive a similar list in cases where the set of hypotheses can be partitioned into groups possessing distinct characteristics. | en |
| dc.language.iso | en | en |
| dc.subject | False discovery proportion (FDP) | en |
| dc.subject | Target-decoy competition (TDC) | en |
| dc.subject | Peptide detection | en |
| dc.subject | Sequential hypothesis testing | en |
| dc.subject | Knockoffs. | en |
| dc.title | Analysing the False Discovery Proportion in Competition-Based Multiple Hypothesis Testing | en |
| dc.type | Thesis | |
| dc.type.thesis | Masters by Research | en |
| dc.rights.other | The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission. | en |
| usyd.faculty | SeS faculties schools::Faculty of Science | en |
| usyd.degree | Master of Philosophy (Science) | en |
| usyd.awardinginst | The University of Sydney | en |
| usyd.advisor | Keich, Uri |
Associated file/s
Associated collections