A Monte Carlo Study of the Effects of Number of Clusters and Level-2 Residual Distributions on Multilevel Models
2021-11
Loading...
View/Download File
Persistent link to this item
Statistics
View StatisticsJournal Title
Journal ISSN
Volume Title
Title
A Monte Carlo Study of the Effects of Number of Clusters and Level-2 Residual Distributions on Multilevel Models
Authors
Published Date
2021-11
Publisher
Type
Thesis or Dissertation
Abstract
Hierarchical Linear Modeling (HLM) has become an important approach to analyzing hierarchically structured data, which is common in educational research. But the accuracy of estimators and precision of statistical inference of HLM rely heavily on sufficiently large numbers of clusters, as well as the normality assumption of the residual distributions. The current study had two purposes. First, to synthesize the existing Monte Carlo research literature and identify gaps in the recommended number of clusters. This synthesis prompted two research questions with important implications for educational data analyses involving HLM: 1) What is the minimum required number of clusters for accurate estimation of level-2 parameters when assumptions are satisfied? 2) What is the minimum required number of clusters for accurate estimation of level-2 parameters when assumptions are violated? Much of the rationale for identifying minimum values of J for realistic data conditions is because clusters often require significant resources, leading to an interest in identifying a minimum J. To answer the research questions a Monte Carlo study was used to provide comprehensive recommendations for the minimum required sample size at level-2 of a two-level model for cross-sectional data. In order to fill the gaps of previous literature, the study adopted Latin Hypercube Sampling in the design of the simulation so that the sample sizes of both levels were randomly sampled from a wide range to mirror environments commonly found in educational research. A total of 40 combinations of J and n_j × 3 levels of ICC × 4 level-2 residual distributions × 4 covariate correlates = 1,920 combinations of conditions were studied. Bias in estimating fixed effects and variance components via ABs, ARBs, ln(RMSE)s, as well as Type I error rate and statistical power for corresponding statistical tests of those parameters, were investigated.
The results showed that the fixed effects estimates were unbiased and were more accurately estimated when the number of clusters increased. A larger J was required for accurate Type I error rates of tests of fixed effects. In general, the fixed effects had sufficiently large statistical power. On the other hand, J > 75 was required for accurate variance components estimates and J > 100 was required for acceptable Type I error rates. Additionally, variance components were underpowered unless the sample sizes at both levels were large (J>100 and n_j>30) and ICC was bigger than .10. Finally, this current study provided guidance on minimum required sample size for future empirical research.
Description
University of Minnesota Ph.D. dissertation. November 2021. Major: Educational Psychology. Advisor: Michael Harwell. 1 computer file (PDF); 442 pages.
Related to
Replaces
License
Collections
Series/Report Number
Funding information
Isbn identifier
Doi identifier
Previously Published Citation
Other identifiers
Suggested citation
Jia, Hao. (2021). A Monte Carlo Study of the Effects of Number of Clusters and Level-2 Residual Distributions on Multilevel Models. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/226401.
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.