Browsing by Subject "Longitudinal Data"
Now showing 1 - 2 of 2
- Results Per Page
- Sort Options
Item Misspecification of the covariance matrix in the linear mixed model: a Monte Carlo simulation(2013-02) LeBeau, Brandon C.The linear mixed model has become a popular method for analyzing longitudinal and cross sectional data due to its ability to overcome many of the limitations found using classical methods such as repeated measures analysis of variance or multivariate analysis of variance. Although the linear mixed model allows for flexible modeling of clustered data, the simulation research literature is not nearly as extensive as classical methods. This current study looks to add to this literature and the statistical properties associated with the linear mixed model under longitudinal data conditions. Historically when using the linear mixed model to analyze longitudinal data, researchers have allowed the random effects to solely account for the dependency due to repeated measurements. This dependency arises in this case, from repeated measurements on the same individual and measurements taken closer in time would be more correlated than measurements taken further apart in time. If measurements are taken close in time (i.e. every hour, daily, weekly, etc.), the random effects alone may not adequately account for the dependency due to repeated measurements. In this case serial correlation may be present and need to be modeled. Previous simulation work exploring the effects of misspecification of serial correlation have shown that the fixed effects tend to be unbiased, however evidence of bias show up in the variance of the random components of the model. In addition, some evidence of bias was found in the standard errors of the fixed effects. These simulation studies were done with all other model conditions being "perfect," including normally distributed random effects and larger sample size. The current simulation study looks to generalize to a wider variety of data conditions. The current simulation study used a factorial design with four simulation conditions manipulated. These included: covariance structure, random effect distribution, number of subjects, and number of measurement occasions. Relative bias of the fixed and random components were explored descriptively and inferentially. In addition, the type I error rate was explored to examine any impact the simulation conditions had on the robustness of hypothesis testing. A second smaller study was also conducted that explicitly misspecified the random slope for time to see if serial correlation could overcome the misspecification of that random effect. Results for the larger simulation study found no bias in the fixed effects. There was however evidence of bias in the random components of the model. The fitted and generated serial correlation structures as well as their interaction explained significant variation in the bias of the random components. The largest amounts of bias were found when the fitted structure was underspecified as independent. Type I error rates for the five fixed effects were just over 0.05, with many around 0.06. Many of the simulation conditions explained significant variation in the empirical type I error rates. Study two again found no bias in the fixed effects. Just as in study one, bias was found in the random components and the fitted and generated serial correlation structures as well as the interaction between the two explaining significant variation in the relative bias statistics. Of most concern were the severely inflated type I error rates for the fixed effects associated with the slope terms. The average type I error rate was on average twice what would be expected and ranged as high as 0.25. The fitted serial correlation structure and the interaction between the fitted and generated serial correlation structure explained significant variation in these terms. More specifically, when the serial correlation was underspecified as independent in conjunction with a missing random effect for time, the type I error rate can become severely inflated. Serial correlation does not appear to bias the fixed effects, therefore if point estimates are all that are desired serial correlation does not need to be modeled. However, if estimates of the random components or inference are concerned care needs to be taken to at least include serial correlation in the model when it is found in the data. In addition, if serial correlation is present and the model is misspecified without the random effect for time serious distortions of the empirical type I error rate occur. This would lead to rejecting many more true null hypotheses which would make conclusions extremely uncertain.Item Model selection with the linear mixed effects model for longitudinal data.(2010-05) Ryoo, Ji HoonModel building or model selection with linear mixed models (LMM) is complicated by the presence of both fixed effects and random effects. The fixed effects structure and random effects structure are co-dependent, so selection of one influences the other. Most presentations of LMM in psychology and education are based on a multi-level or hierarchical approach in which the variance-covariance matrix of the random effects is assumed to be positive definite with non-zero values for the variances. When the number of fixed effects and random effects is not known, the predominant approach to model building is a step-up procedure in which one starts with a limited model (e.g., few fixed and random intercepts) and then additional fixed effects and random effects are added based on statistical tests. A procedure that has received less attention in psychology and education is top-down model building. In the top-down procedure, the initial model has a single random intercept but is loaded with fixed effects (also known as an ”over-elaborate” model). Based on the over-elaborate fixed effects model, the need for additional random effects is determined. Once the number of random effects is selected, the fixed effects are tested to see if any can be omitted from the model. There has been little if any examination of the ability of these procedures to identify a true population model (i.e., identifying a model that generated the data). The purpose of this dissertation is to examine the performance of the various model building procedures for exploratory longitudinal data analysis. Exploratory refers to the situation in which the correct number of fixed effects and random effects is unknown before the analysis.