Addressing Sources of Bias in Genetic Association Studies

Miclaus, Kelci  Jo

Sangam Home
→
Electronic Theses and Dissertations (ETDs)
→
NC State Theses and Dissertations
→
View Item

dc.contributor	Dahlia Nielsen, Committee Member
dc.contributor	Lexin Li, Committee Member
dc.contributor	Russ Wolfinger, Committee Chair
dc.contributor	Jason Osborne, Committee Co-Chair
dc.creator	Miclaus, Kelci Jo
dc.date	2010-04-02T19:12:10Z
dc.date	2010-04-02T19:12:10Z
dc.date	2009-11-01
dc.date.accessioned	2023-02-28T17:09:46Z
dc.date.available	2023-02-28T17:09:46Z
dc.identifier	etd-05062009-115841
dc.identifier	http://www.lib.ncsu.edu/resolver/1840.16/5350
dc.identifier.uri	http://localhost:8080/xmlui/handle/CUHPOERS/265879
dc.description	Genome-wide association studies (GWAS) have become a popular method for the discovery of genetic variants associated with complex diseases or traits. As the size and scope of these studies increase in order to obtain higher power for determining significant associations, careful consideration of population structure becomes paramount. If individ- uals in a study come from different ethnic or ancestral backgrounds, variation in allele frequencies and disproportionate ancestry representation in cases and controls can lead to inflated Type I error rates. Over the years, several methods for controlling population stratification have been introduced, many of which rely on the use of multivariate dimension reduction methods. An important aspect of population stratification is to determine which loci exhibit evidence of population allele frequency differences. We introduce a method based on Hardy-Weinberg Disequilibrium to find substructure-informative markers coupled with the use of nonmetric Multidimensional Scaling (NMDS) in order to visualize popula- tion structure in a sample. We extend the use of NMDS in conjunction with nonparametric clustering to develop a test for association that corrects for population stratification. We show that NMDS is a preferable visualization technique for detecting multiple levels of relatedness within a set of individuals and that the subsequent test correction model is a more powerful test under realistic scenarios. Recent research has shown that technical bias due to differential genotyping errors between cases and controls can also inflate the Type I error rate, possibly an even more severe source of bias in GWAS. Current genotype calling algorithms rely on processing samples in batches due to computational constraints as well as concerns of differences in DNA collection, lab preparation and heterogeneous samples that can skew results of genotype calls. This thesis also addresses possible bias caused by differential genotyping due to batch size and composition effects for the widely used BRLMM algorithm recommended for the Affymetrix GeneChip Human Mapping 500 K ar- ray set. Samples obtained from the Wellcome Trust Case Control Consortium are utilized to determine differential results due to genotype calling batch differences.
dc.rights	I hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dis sertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.
dc.subject	genotype calling discordance
dc.subject	population stratification
dc.subject	nonmetric multidimensional scaling
dc.subject	genome-wide association studies
dc.title	Addressing Sources of Bias in Genetic Association Studies

Files in this item

Files	Size	Format	View
etd.pdf	2.264Mb	application/pdf	View/Open

This item appears in the following Collection(s)

NC State Theses and Dissertations [7248]

Show simple item record

Search DSpace

Advanced Search

Browse

All of DSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

Addressing Sources of Bias in Genetic Association Studies

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection