Methodologies in Dimension Reduction and Error Variance Estimation of High-dimensional Regressions

Loading...
Thumbnail Image

Persistent link to this item

Statistics
View Statistics

Published Date

Publisher

Abstract

This thesis discusses several methodologies in dimension reduction and error variance estimation of high-dimensional regression models. The models of interest mainly include the principal components regression, high-dimensional additive regression, and high-dimensional varying-coefficients regression. For dimension reduction, we address the ``blind selection" issue in the conventional principal components regression and provide a simple regression method to solve this issue. The conventional principal components regression retains the principal components with large variances and discards those with small variances. This principal components selection process can easily lead to a poor prediction when the response variable is highly correlated to principal components with small variances. We propose a simple remedy named response-guided principal components regression (RgPCR) that selects principal components for regression based on both the variance of principal components and the goodness of fit to the response. RgPCR is easy to implement without using any optimization algorithms and works naturally for both low dimensional and high dimensional data. We derive a Cp type statistic for selecting the tuning parameter in RgPCR. For high-dimensional regressions, we focus on the error variance estimation of high-dimensional additive regressions and high-dimensional varying-coefficient regressions. The error variance estimation serves a crucial role in the statistical inference of regression models, and it is challenging under high-dimensional settings. We propose a novel error variance estimation procedure for high-dimensional additive regressions and high-dimensional varying-coefficient regressions. As part of the estimation procedure, we propose the scaled group-Lasso regression, which accomplishes the variable selection, coefficients estimation, and error variance estimation jointly in one stage. We provide a fast computing algorithm of the scaled group-Lasso regression through groupwise coordinate descent. We also discuss other candidate error variance estimators of high-dimensional additive regressions and high-dimensional varying-coefficient regressions. We present extensive simulations and real data studies to evaluate the performance of error variance estimators under various model settings.

Keywords

Description

University of Minnesota Ph.D. dissertation. October 2021. Major: Statistics. Advisor: Hui Zou. 1 computer file (PDF); xi, 107 pages.

Related to

item.page.replaces

License

Collections

Series/Report Number

Funding Information

item.page.isbn

DOI identifier

Previously Published Citation

Other identifiers

Suggested Citation

Lang, Wenjun. (2021). Methodologies in Dimension Reduction and Error Variance Estimation of High-dimensional Regressions. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/257113.

Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.