Model Selection in Estimation of Fitness Landscapes
2009-07-06
Loading...
View/Download File
Persistent link to this item
Statistics
View StatisticsJournal Title
Journal ISSN
Volume Title
Title
Model Selection in Estimation of Fitness Landscapes
Authors
Published Date
2009-07-06
Publisher
School of Statistics, University of Minnesota
Type
Report
Abstract
A solution to the problem of estimating fitness landscapes was proposed by Lande
and Arnold (1983). Another solution, which avoids problematic aspects of the
Lande-Arnold methodology, was proposed by Shaw, Geyer, Wagenius, Hangelbroek, and Etterson (2008), who also provided an illustrative example involving real data.
An earlier technical report (Geyer and Shaw, 2008) gave an example that was simpler in some ways (the data are simulated from the aster model so there are no issues making the
data fit the model one has with real data) and much more complicated in others (each
individual has five measured components of fitness over four time periods, 20 variables
in all) and illustrates the full richness possible in aster analysis of fitness landscapes. The one issue that technical report did not deal with is model selection. When many phenotypic variables are measured, one often does not know which to put in the model. Lande and Arnold (1983) proposed using principal components regression as a method of dimension reduction, but this method is known to have no theoretical basis. Much of late 20th century and 21st century statistics is about model selection and model averaging, and we apply some of this methodology (which does have strong theoretical basis) to estimation of fitness landscapes using another simulated data set.
All analyses are done in R (R Development Core Team, 2008) using the aster contributed
package described by Geyer, Wagenius and Shaw (2007) except for analyses in
the style of Lande and Arnold (1983), which use ordinary least squares regression. Furthermore, all analyses are done using the Sweave function in R, so this entire technical report and all of the analyses reported in it are completely reproducible by anyone who has R with the aster package installed and the R noweb file specifying the document.
This revision corrects major errors in the frequentist model averaging calculations
(Section 8) in the first version of the technical report.
Keywords
Description
Related to
Replaces
License
Collections
Series/Report Number
Technical Report
671 revised
671 revised
Funding information
Isbn identifier
Doi identifier
Previously Published Citation
Other identifiers
Suggested citation
Geyer, Charles J.; Shaw, Ruth G.. (2009). Model Selection in Estimation of Fitness Landscapes. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/56219.
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.