Supporting data for The Fate of Deleterious Variants in a Barley Genomic Prediction Population
2019-10-21
Loading...
Persistent link to this item
Statistics
View StatisticsKeywords
Collection period
2006
2014
2014
Date completed
Date updated
Time period coverage
Geographic coverage
Source information
Journal Title
Journal ISSN
Volume Title
Title
Supporting data for The Fate of Deleterious Variants in a Barley Genomic Prediction Population
Published Date
2019-10-21
Author Contact
Morrell, Peter L
pmorrell@umn.edu
pmorrell@umn.edu
Type
Dataset
Genomics Data
Genomics Data
Abstract
Targeted identification and purging of segregating deleterious genetic variants has been proposed as a novel approach to plant breeding. This approach is motivated in part by the observation that demographic events and strong selection associated with cultivated species pose a “cost of domestication.” This includes an increase in the proportion of genetic variants at phylogenetically-constrained sites where a mutation is likely to reduce fitness. Recent advances in DNA resequencing technology and sequence constraint-based approaches to predict the functional impact of a mutation now permit the identification of putatively deleterious SNPs (dSNPs) on a genome-wide scale. Using exome capture resequencing of 21 barley 6-row spring breeding lines, we identify 3,855 dSNPs among 497,754 total SNPs. The dSNPs are more frequent in portions of the genome with a higher recombination rate, as measured by cM/Mb, than in pericentromeric regions with lower recombination rate and gene density. Using 5,215 progeny from a genomic prediction experiment, we examine the fate of dSNPs over three breeding cycles. Average derived allele frequency is lower for dSNPs than any other class of variants. Adjusting for frequency, derived alleles at dSNPs reduce in frequency or are lost more frequently than other classes of SNPs. Using a linear mixed model applied to 677 lines phenotyped at 5 year-locations, we find that a genomic region with the strongest association with a fungal disease resistance trait that was selected for in the population also negatively impacts yield. Finally, the highest yielding lines in the experiment, as chosen by standard genomic prediction approaches, carry fewer homozygous dSNPs than randomly selected lines.
Description
Referenced by
Related to
Replaces
item.page.isreplacedby
Publisher
Collections
Funding information
item.page.sponsorshipfunderid
item.page.sponsorshipfundingagency
item.page.sponsorshipgrant
Previously Published Citation
Other identifiers
Suggested citation
Morrell, Peter L; Smith, Kevin P; Vonderharr, Emily E; Kono, Thomas John Y; Fay, Justin C; Koenig, Daniel. (2019). Supporting data for The Fate of Deleterious Variants in a Barley Genomic Prediction Population. Retrieved from the Data Repository for the University of Minnesota (DRUM), https://doi.org/10.13020/d6w990.
View/Download File
File View/Open
Description
Size
README.txt
(17.42 KB)
50x_Capture.bed
BED file that describes regions that are covered by the liquid-phase exome capture probe set designed by Mascher et al. (2013)
(3.24 MB)
ALCHEMY_Raw_Data_and_Calls.tar.bz2
Raw intensities and genotype probabilities for a subset of the CAP lines that contain the founders of the experimental population
(15.69 MB)
Adjusted_Phenotypic_Data_800Lines.csv
Spatially adjusted best linear unbiased estimates (BLUEs) for yield (kg/ha), DON concentration (ppm), and plant height (cm)
(63.87 KB)
GP_AP.bed
Genotypes for exome capture variants imputed onto progeny with AlphaPeel. Files are in PLINK format and the bed/bim/fam must be kept together.
(611.76 MB)
GP_AP.bim
Genotypes for exome capture variants imputed onto progeny with AlphaPeel. Files are in PLINK format and the bed/bim/fam must be kept together.
(22.03 MB)
GP_AP.fam
Genotypes for exome capture variants imputed onto progeny with AlphaPeel. Files are in PLINK format and the bed/bim/fam must be kept together.
(273.78 KB)
BOPA_cMMb.txt.gz
LOWESS-smoothed recombination rate estimates across the Morex draft assembly
(33.42 KB)
DON_LMM.assoc.txt.gz
SNP-by-SNP regression coefficients and P-values for the linear mixed model association implemented in GEMMA. The trait used for association was DON concentration.
(11.73 MB)
ExomeCaptureTargets_per_Mb.txt.gz
The number of exome capture targets in non-overlapping 1Mb-windows.
(23.64 KB)
GATK_Capture_WithID.vcf.gz
The VCF of variants identified by exome capture resequencing of the 21 founder lines
(76.44 MB)
GP_BOPA_Physical.bed
PLINK files that contain the 377 SNPs genotyped on the fixed SNP platform
(480.65 KB)
GP_BOPA_Physical.bim
PLINK files that contain the 377 SNPs genotyped on the fixed SNP platform
(12.8 KB)
GP_BOPA_Physical.fam
PLINK files that contain the 377 SNPs genotyped on the fixed SNP platform
(273.23 KB)
GP_ExomeCap_Functional_Annotation.gz
Functional annotation for each variant identified in exome capture resequencing of the founder lines
(11.73 MB)
Height_LMM.assoc.txt.gz
SNP-by-SNP regression coefficients and P-values for the linear mixed model association implemented in GEMMA. The trait used for association was plant height.
(11.78 MB)
Representative_Transcript_IDs.txt.gz
Transcript IDs of the representative transcripts from each barley gene
(118.83 KB)
Representative_Transcripts.gtf.gz
Position and strand annotation in a GTF for the representative transcripts.
(4.67 MB)
T3_Full_Pedigree.csv
Pedigree information for each family in this population
(7.51 KB)
Yield_LMM.assoc.txt.gz
SNP-by-SNP regression coefficients and P-values for the linear mixed model association implemented in GEMMA. The trait used for association was grain yield.
(11.78 MB)
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.