Browsing by Subject "Single Nucleotide Polymorphisms"

Now showing 1 - 2 of 2

Context-Driven Prior Distributions in Genome–Wide Association Studies, Medical Device Adaptive Clinical Trials, and Genetic Fine-Mapping
(2021-02) Kaplan , Adam
Present research has gravitated towards making inferences from high-dimensional data, the scenario when we have considerably more variables than the number of observations we have to estimate their effects on a given outcome, and standard statistical methodology cannot be used here. Models assuming that a majority of these variables do not associate with the outcome, or shrinkage models, have been used for such situations. However, most standard shrinkage models tend to ignore the natural dependencies within high-dimensional data. For instance, geneticists have well understood that rare mutations along the chromosome are correlated, and this correlation decreases as the spatial distance between the mutations increases. As a result, treating these variants as independent from one another misses supplementing the estimation of their relationships with the outcome, with this contextual information. In this dissertation, instead, we present models assuming that the variables’ effects are dependent on each other which appends the shortage in observations. Specifically, we introduce this version of shrinkage models in the contexts of a clinical trial for optimizing a medical device for one patient and detecting genetic variants that are associated with a disease.
Maize Mo17 SNPs
(2018-07-11) Zhou, Peng; zhoux379@umn.edu; Zhou, Peng; University of Minnesota Springer Lab
Genome resequencing of Mo17 was done as part of the bioMAP project (REF). 477 million 100bp paired-end reads were generated for Mo17 giving an average of 95x coverage. Reads were first trimmed by Trimmomatic (Bolger et al. 2014) and mapped to the maize B73 genome AGPv4 (Jiao et al. 2017) using BWA-MEM (Li and Durbin 2010). PCR duplicates were marked and removed using GATK (McKenna et al. 2010). Variants were called by GATK haplotypecaller and filtered using different filters for SNPs: (QD > 2, FS < 60, MQ > 40, MQRankSum > -12.5, ReadPosRankSum > -8, SOR < 4) and for InDels: (QD > 2, FS < 200, ReadPosRankSum > -20, SOR < 10). Moreover, variants located in regions with unusually high coverage (DP > Mean + 2*SD) and heterozygous calls (GT == '0/1') were also removed. The final variant file containing 8.04 million variants with 164 thousand CDS variants was deposited in DRUM.

University Digital Conservancy

Browse by Subject

Browsing by Subject "Single Nucleotide Polymorphisms"