Between Dec 19, 2024 and Jan 2, 2025, datasets can be submitted to DRUM but will not be processed until after the break. Staff will not be available to answer email during this period, and will not be able to provide DOIs until after Jan 2. If you are in need of a DOI during this period, consider Dryad or OpenICPSR. Submission responses to the UDC may also be delayed during this time.
 

Integrative Statistical Methods in Genomics and Neuroimaging

2021-07
Loading...
Thumbnail Image

Persistent link to this item

Statistics
View Statistics

Journal Title

Journal ISSN

Volume Title

Title

Integrative Statistical Methods in Genomics and Neuroimaging

Authors

Published Date

2021-07

Publisher

Type

Thesis or Dissertation

Abstract

High-throughput data with complex structures are becoming common with advancing technologies. Multiple types of data are often collected within and across studies. In genomics, many genome-wide association studies (GWAS) have been conducted to identify genetic variants associated with disease. Most GWAS are conducted in deeply phenotyped cohorts with many correlated traits measured. In neuroimaging studies, for a sample of subjects, data are often measured over multiple dimensions, such as multiple tissues, brain regions, and time points. The variety and complexity of these data create an unprecedented demand for new statistical methods that can capture the underlying essence of the data by integrating related information across multiple facets. The first part of this dissertation focuses on developing integrative statistical models to identify novel disease-gene associations by exploiting the joint power of multiple GWASs. I first propose a statistical method that can integrate multiple related traits to identify novel genetic variants that are associated with various disease and traits. This work is then extended to a joint model that can integrate both multiple traits and multiple variants, which are expected to have improved power to identify disease-associated genes. These two proposed methods are developed based on publicly available GWAS summary data without accessing individual-level genotype and phenotype data. We have found many novel and interesting genes that may help to advance our understanding of genetic architecture of human diseases and traits. The second part of this dissertation is on integrative analysis of multi-dimensional neuroimaging data. A common problem in this area is to distinguish groups of subjects based on a large number of characteristics. Most existing classification methods are designed for two-way data matrix, and are not feasible to use or have poor performance when the data have multi-dimensional structure. I propose a general framework for multiway classification which is applicable to any number of dimensions and is able to enforce sparsity in the model. Promising future directions of this research include integrating multiple related traits to improve genetic risk prediction and integrating multi-dimensional neuroimaging data from multiple modalities to further improve classification accuracy.

Keywords

Description

University of Minnesota Ph.D. dissertation. July 2021. Major: Biostatistics. Advisors: Baolin Wu, Lynn Eberly. 1 computer file (PDF); xxv 150 pages.

Related to

Replaces

License

Collections

Series/Report Number

Funding information

Isbn identifier

Doi identifier

Previously Published Citation

Other identifiers

Suggested citation

Guo, Bin. (2021). Integrative Statistical Methods in Genomics and Neuroimaging. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/224582.

Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.