Clustering Methods for Correlated Data

Loading...
Thumbnail Image

Persistent link to this item

Statistics
View Statistics

Journal Title

Journal ISSN

Volume Title

Title

Clustering Methods for Correlated Data

Published Date

2022-08

Publisher

Type

Thesis or Dissertation

Abstract

Hierarchical Clustering is one of the most popular unsupervised clustering methods.Using a simple agglomerative algorithm, it iteratively combines similar clusters together forming cohesive groups of observations. This work focuses on Hierarchical Clustering and how it may be adapted to accommodate correlated observations. Chapter 2 investigates how to develop a statistical framework for Hierarchical Clustering so we may derive statistical properties from the clustering method. In Chapter 3, a new method, Hierarchical Cohesion Clustering is proposed. This method is a modification of the traditional methods which aims to accommodate correlated observations. This approach explores how repeated measurements may be preprocessed into intermediate clusters to improve clustering outcomes. The method is applied to a sequence-based time use dataset about how people spend their time throughout the day. In Chapter 4, we focus on how to incorporate spatial adjacency data when clustering. We continue to investigate Hierarchical Clustering methods, with a special focus on Hierarchical Cohesion Clustering. Applying the collection of methods to COVID-19 case rate data within counties, a comparison of the methods is performed with summaries of their respective strengths and weaknesses. Spatial simulations are included to better determine each approach’s efficacy and when certain approaches are preferable.

Description

University of Minnesota Ph.D. dissertation. August 2022. Major: Biostatistics. Advisor: Julian Wolfson. 1 computer file (PDF); 183 pages.

Related to

Replaces

License

Collections

Series/Report Number

Funding information

Isbn identifier

Doi identifier

Previously Published Citation

Other identifiers

Suggested citation

Becker, Andrew. (2022). Clustering Methods for Correlated Data. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/243081.

Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.