Datta, Abhirup2018-08-142018-08-142016-05https://hdl.handle.net/11299/199089University of Minnesota Ph.D. dissertation. 2016. Major: Biostatistics. Advisors: Sudipto Banerjee, Hui Zou. 1 computer file (PDF); 175 pages.Modern technological advancements have enabled massive-scale collection, processing and storage of information triggering the onset of the `big data' era where in every two days now we create as much data as we did in the entire twentieth century. This thesis aims at developing novel statistical methods that can efficiently analyze a variety of large complex datasets. Underlying the umbrella theme of big data modeling, we present statistical methods for two different classes of large complex datasets. The first half of the thesis focuses on the 'large n' problem for large spatial or spatio-temporal datasets where observations exhibit strong dependencies across space and time. In the second half of this thesis we present methods for high-dimensional regression in the `large p small n' setting for datasets that contain measurement errors or change points.enBig dataHigh dimensional dataLarge spatial dataStatistical Methods for Large Complex DatasetsThesis or Dissertation