Sangam: A Confluence of Knowledge Streams

Building an Essential Gene Classification Framework

Show simple item record

dc.contributor Dr. Dennis Bahler, Committee Member
dc.contributor Dr. Xiaosong Ma, Committee Member
dc.contributor Dr. Steffen Heber, Committee Chair
dc.creator Saha, Soma
dc.date 2010-04-02T18:15:06Z
dc.date 2010-04-02T18:15:06Z
dc.date 2006-01-05
dc.date.accessioned 2023-02-24T07:32:34Z
dc.date.available 2023-02-24T07:32:34Z
dc.identifier etd-01032006-230402
dc.identifier http://www.lib.ncsu.edu/resolver/1840.16/2546
dc.identifier.uri http://localhost:8080/xmlui/handle/CUHPOERS/258870
dc.description The analysis of gene deletions is a fundamental approach for investigating gene function. We applied machine learning techniques to predict phenotypic effects of gene deletions in yeast. We created a dataset containing features that potentially have predictive power and then used feature processing techniques to improve the dataset and identify features that are important for our classification problem. We evaluated four different classification algorithms, K-Nearest Neighbors, Support Vector Machine, Decision Tree, and Random Forest, with respect to this problem. We used our framework to complement the set of experimentally determined essential yeast genes produced by the Saccharomyces Genome Deletion Project and produce more than 2000 annotations for genes that might cause morphological alterations in yeast.
dc.rights I hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.
dc.subject machine learning
dc.subject classification
dc.subject yeast
dc.subject morphological alterations
dc.subject essential genes
dc.title Building an Essential Gene Classification Framework


Files in this item

Files Size Format View
etd.pdf 522.6Kb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse