Please use this identifier to cite or link to this item:
|Title:||Human Promoter Recognition Based on Principal Component Analysis|
Transcription Start Sites
Principal Component Analysis
|Publisher:||University of Sydney.|
School of Electrical and Information Engineering
|Abstract:||This thesis presents an innovative human promoter recognition model HPR-PCA. Principal component analysis (PCA) is applied on context feature selection DNA sequences and the prediction network is built with the artificial neural network (ANN). A thorough literature review of all the relevant topics in the promoter prediction field is also provided. As the main technique of HPR-PCA, the application of PCA on feature selection is firstly developed. In order to find informative and discriminative features for effective classification, PCA is applied on the different n-mer promoter and exon combined frequency matrices, and principal components (PCs) of each matrix are generated to construct the new feature space. ANN built classifiers are used to test the discriminability of each feature space. Finally, the 3 and 5-mer feature matrix is selected as the context feature in this model. Two proposed schemes of HPR-PCA model are discussed and the implementations of sub-modules in each scheme are introduced. The context features selected by PCA are III used to build three promoter and non-promoter classifiers. CpG-island modules are embedded into models in different ways. In the comparison, Scheme I obtains better prediction results on two test sets so it is adopted as the model for HPR-PCA for further evaluation. Three existing promoter prediction systems are used to compare to HPR-PCA on three test sets including the chromosome 22 sequence. The performance of HPR-PCA is outstanding compared to the other four systems.|
|Description:||Master of Engineering|
|Rights and Permissions:||The author retains copyright of this thesis.|
|Type of Work:||Masters Thesis|
|Appears in Collections:||Sydney Digital Theses (Open Access)|
This work is protected by Copyright. All rights reserved. Access to this work is provided for the purposes of personal research and study. Except where permitted under the Copyright Act 1968, this work must not be copied or communicated to others without the express permission of the copyright owner. Use the persistent URI in this record to enable others to access this work.
|Final_version_of_Xiaomeng_Li's_thesis.pdf||804.32 kB||Adobe PDF||View/Open|
Items in Sydney eScholarship Repository are protected by copyright, with all rights reserved, unless otherwise indicated.