Supporting data for The Fate of Deleterious Variants in a Barley Genomic Prediction Population

Loading...
Thumbnail Image
Statistics
View Statistics

Keywords

Collection period

2006
2014

Date completed

Date updated

Time period coverage

Geographic coverage

Source information

Journal Title

Journal ISSN

Volume Title

Title

Supporting data for The Fate of Deleterious Variants in a Barley Genomic Prediction Population

Published Date

2019-10-21

Author Contact

Morrell, Peter L
pmorrell@umn.edu

Type

Dataset
Genomics Data

Abstract

Targeted identification and purging of segregating deleterious genetic variants has been proposed as a novel approach to plant breeding. This approach is motivated in part by the observation that demographic events and strong selection associated with cultivated species pose a “cost of domestication.” This includes an increase in the proportion of genetic variants at phylogenetically-constrained sites where a mutation is likely to reduce fitness. Recent advances in DNA resequencing technology and sequence constraint-based approaches to predict the functional impact of a mutation now permit the identification of putatively deleterious SNPs (dSNPs) on a genome-wide scale. Using exome capture resequencing of 21 barley 6-row spring breeding lines, we identify 3,855 dSNPs among 497,754 total SNPs. The dSNPs are more frequent in portions of the genome with a higher recombination rate, as measured by cM/Mb, than in pericentromeric regions with lower recombination rate and gene density. Using 5,215 progeny from a genomic prediction experiment, we examine the fate of dSNPs over three breeding cycles. Average derived allele frequency is lower for dSNPs than any other class of variants. Adjusting for frequency, derived alleles at dSNPs reduce in frequency or are lost more frequently than other classes of SNPs. Using a linear mixed model applied to 677 lines phenotyped at 5 year-locations, we find that a genomic region with the strongest association with a fungal disease resistance trait that was selected for in the population also negatively impacts yield. Finally, the highest yielding lines in the experiment, as chosen by standard genomic prediction approaches, carry fewer homozygous dSNPs than randomly selected lines.

Description

Referenced by

Related to

Replaces

item.page.isreplacedby

Publisher

Funding information

item.page.sponsorshipfunderid

item.page.sponsorshipfundingagency

item.page.sponsorshipgrant

Previously Published Citation

Other identifiers

Suggested citation

Morrell, Peter L; Smith, Kevin P; Vonderharr, Emily E; Kono, Thomas John Y; Fay, Justin C; Koenig, Daniel. (2019). Supporting data for The Fate of Deleterious Variants in a Barley Genomic Prediction Population. Retrieved from the Data Repository for the University of Minnesota (DRUM), https://doi.org/10.13020/d6w990.

View/Download File

File View/Open
Description
Size
README.txt
(17.42 KB)

50x_Capture.bed
BED file that describes regions that are covered by the liquid-phase exome capture probe set designed by Mascher et al. (2013)
(3.24 MB)

ALCHEMY_Raw_Data_and_Calls.tar.bz2
Raw intensities and genotype probabilities for a subset of the CAP lines that contain the founders of the experimental population
(15.69 MB)

Adjusted_Phenotypic_Data_800Lines.csv
Spatially adjusted best linear unbiased estimates (BLUEs) for yield (kg/ha), DON concentration (ppm), and plant height (cm)
(63.87 KB)

GP_AP.bed
Genotypes for exome capture variants imputed onto progeny with AlphaPeel. Files are in PLINK format and the bed/bim/fam must be kept together.
(611.76 MB)

GP_AP.bim
Genotypes for exome capture variants imputed onto progeny with AlphaPeel. Files are in PLINK format and the bed/bim/fam must be kept together.
(22.03 MB)

GP_AP.fam
Genotypes for exome capture variants imputed onto progeny with AlphaPeel. Files are in PLINK format and the bed/bim/fam must be kept together.
(273.78 KB)

BOPA_cMMb.txt.gz
LOWESS-smoothed recombination rate estimates across the Morex draft assembly
(33.42 KB)

DON_LMM.assoc.txt.gz
SNP-by-SNP regression coefficients and P-values for the linear mixed model association implemented in GEMMA. The trait used for association was DON concentration.
(11.73 MB)

ExomeCaptureTargets_per_Mb.txt.gz
The number of exome capture targets in non-overlapping 1Mb-windows.
(23.64 KB)

GATK_Capture_WithID.vcf.gz
The VCF of variants identified by exome capture resequencing of the 21 founder lines
(76.44 MB)

GP_BOPA_Physical.bed
PLINK files that contain the 377 SNPs genotyped on the fixed SNP platform
(480.65 KB)

GP_BOPA_Physical.bim
PLINK files that contain the 377 SNPs genotyped on the fixed SNP platform
(12.8 KB)

GP_BOPA_Physical.fam
PLINK files that contain the 377 SNPs genotyped on the fixed SNP platform
(273.23 KB)

GP_ExomeCap_Functional_Annotation.gz
Functional annotation for each variant identified in exome capture resequencing of the founder lines
(11.73 MB)

Height_LMM.assoc.txt.gz
SNP-by-SNP regression coefficients and P-values for the linear mixed model association implemented in GEMMA. The trait used for association was plant height.
(11.78 MB)

Representative_Transcript_IDs.txt.gz
Transcript IDs of the representative transcripts from each barley gene
(118.83 KB)

Representative_Transcripts.gtf.gz
Position and strand annotation in a GTF for the representative transcripts.
(4.67 MB)

T3_Full_Pedigree.csv
Pedigree information for each family in this population
(7.51 KB)

Yield_LMM.assoc.txt.gz
SNP-by-SNP regression coefficients and P-values for the linear mixed model association implemented in GEMMA. The trait used for association was grain yield.
(11.78 MB)

Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.