Constrained Likelihood Inference in Instrumental Variable Regression with Invalid Instruments and Its Application to GWAS Summary Data
2021-05
Loading...
View/Download File
Persistent link to this item
Statistics
View StatisticsJournal Title
Journal ISSN
Volume Title
Title
Constrained Likelihood Inference in Instrumental Variable Regression with Invalid Instruments and Its Application to GWAS Summary Data
Authors
Published Date
2021-05
Publisher
Type
Thesis or Dissertation
Abstract
There has been increasing interest in instrumental variables regression for causal inference. In genetics, transcriptome-wide association studies (TWAS), also known as PrediXcan, have recently emerged as a widely applied tool to discover causal/target genes by integrating an outcome GWAS dataset with another gene expression/ transcriptome GWAS (called eQTL) dataset; they can not only boost statistical power but also offer biological insights by identifying (putative) causal genes for a GWAS trait, e.g. low-density lipoprotein cholesterol (LDL). Statistically TWAS apply (two-sample) two-stage least squares (2SLS) with multiple correlated SNPs as instrumental variables (IVs) to predict/impute gene expression, in contrast to typical (two-sample) Mendelian randomization (MR) approaches using independent SNPs as IVs, which are expected to be lower-powered. However, some of the SNPs used may not be valid IVs as a result of their (horizontal) pleiotropic/direct effects on the trait not mediated through the gene of interest, leading to false conclusions by TWAS (or MR). We propose a general inferential method for possibly high-dimensional data to account for confounding and invalid IVs while selecting valid IVs simultaneously via two-stage constrained maximum likelihood; we develop a theory for the likelihood method subject to a truncated L1-constraint approximating the L0-constraint for asymptotically valid and efficient statistical inference on causal effects. We demonstrate both theoretically and numerically the superior performance of the proposed method over the standard 2SLS/TWAS and other methods. We apply the methods to identify causal genes for LDL by integrating GWAS summary data with eQTL data.
Description
University of Minnesota Ph.D. dissertation. May 2021. Major: Statistics. Advisors: Xiaotong Shen, Wei Pan. 1 computer file (PDF); viii, 102 pages.
Related to
Replaces
License
Collections
Series/Report Number
Funding information
Isbn identifier
Doi identifier
Previously Published Citation
Other identifiers
Suggested citation
Xue, Haoran. (2021). Constrained Likelihood Inference in Instrumental Variable Regression with Invalid Instruments and Its Application to GWAS Summary Data. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/223167.
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.