This codebook.txt file was generated on 2023 09 08 by Shyam Thomas ------------------- GENERAL INFORMATION ------------------- Recommended citation for the data: Thomas, Shyam. (2021). Invasive Eurasian watermilfoil data from Minnesota lake point-intercept surveys between 1995 and 2019. Retrieved from the Data Repository for the University of Minnesota, https://doi.org/10.13020/yrds-h783. 1. Title of Dataset: Invasive eurasian watermilfoil data from Minnesota lake point-intercept surveys between 1995 and 2019 2. Author Information Principal Investigator Contact Information Name: Shyam Thomas Institution: University of Minnesota Email: thom7552@umn.edu ORCID: 0000-0003-0816-8601 3. Date of data collection: 1995 - 2019 4. Geographic location of data collection (where was data collected?): Minnesota 5. Information about funding sources that supported the collection of the data: Minnesota Aquatic Invasive Species Research Center (MAISRC) -------------------------- SHARING/ACCESS INFORMATION -------------------------- 1. Licenses/restrictions placed on the data: CC0 1.0 Universal 2. Links to publications that cite or use the data: Thomas, S.M. et al. (2021) Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics. Ecology & Evolution. https://doi.org/10.1002/ece3.8002 3. Links to other publicly accessible locations of the data: None 4. Links/relationships to ancillary data sets: 5. Was data derived from another source? Yes If yes, list source(s): MAISRC point intercept vegetation survey: Access link will be updated once published. MN DNR's Infested waters list: (https://www.dnr.state.mn.us/invasives/ais/infested.html) --------------------- DATA & FILE OVERVIEW --------------------- 1. File List A. Filename: EWM_prsabs_RF_SDMdata Short description: Eurasian watermilfoil occurrence dataset for surveyed and unsurveyed MN lakes B. Filename: EWM_abun_RF_SDMdata Short description: Eurasian watermilfoil abundance dataset for surveyed MN lakes 2. Relationship between files: File B contains a subset of data contained in File A. File B contains data with the additional field "MYS_relfrq" that denotes the abundance of EWM in invaded locations. 3. Additional related data collected that was not included in the current data package: None. 4. Are there multiple versions of the dataset? No -------------------------- METHODOLOGICAL INFORMATION -------------------------- 1. Description of methods used for collection/generation of data: Publication link: https://doi.org/10.1002/ece3.8002 Github link: https://github.com/ShyamThomas/Watermilfoil_RF_SDMs 2. Methods for processing the data: Multiple sources of data was merged and joined using R data manipulation packages.Details of the data manipulation can be found in the Methods section of the above publication / github documentation. 3. Instrument- or software-specific information needed to interpret the data: All files can be opened and read using MS Excel 4. Standards and calibration information, if appropriate: None. 5. Environmental/experimental conditions: Observational data collected across the state of MN. 6. Describe any quality-assurance procedures performed on the data: NA 7. People involved with sample collection, processing, analysis and/or submission: All authors of the assoicated publication (link above) ----------------------------------------- DATA-SPECIFIC INFORMATION FOR: EWM_prsabs_RF_SDMdata ----------------------------------------- 1. Number of variables: 15 2. Number of cases/rows: 1870 3. Missing data codes: None. 4. Variable List A. Name: LON Description: Longitude in degree decimals B. Name: LAT Description: Latitude in degree decimals C. Name: EWMSTATUS_corrRelFrq Description: EWM invasion status (binary field) indicating presence (1) or absence (0) in lakes D. Name: max_depth Description: Maximum depth of lake (meters) E. Name: size_acres Description: Size of lake in acres F. Name: avg_ph Description: average lake pH G. Name: avg_secchi Description: Lake Secchi depth (meters) H. Name: avg_conductance Description: average conductance of lake water (uS/cm) I. Name: avg_phosphorus Description: average phospurus levels in lake (mg/L) J. Name: avg_chlorophylla Description: average levels of cholorphyll a in lakes (ug/L) K. Name: a440_cdom Description: absorbance at 440 nm of colored dissolved organic matter (m^-1) L. Name: mean.gdd Description: Mean growing degree days (base 10 degrees Celsius, degr) M. Name: allstreams_density_mperha Description: stream density in 500 m buffer (meters / ha) N. Name: Model Description: Name of the model type. Categorical variable with 4 levels (PrsAbs, random, Proximal, Distant) ----------------------------------------- DATA-SPECIFIC INFORMATION FOR: EWM_abun_RF_SDMdata ----------------------------------------- 1. Number of variables:14 2. Number of cases/rows: 393 3. Missing data codes: None. 4. Variable List A. Name: LON Description: Longitude in degree decimals B. Name: LAT Description: Latitude in degree decimals C. Name: EWM_relfrq Description: EWM abundance (contunuous field) is realtive measure of EWM occurrence with values that can range from complete absence (0) to dominating presence (1.0) D. Name: max_depth Description: Maximum depth of lake (meters) E. Name: size_acres Description: Size of lake in acres F. Name: avg_ph Description: average lake pH G. Name: avg_secchi Description: Lake Secchi depth (meters) H. Name: avg_conductance Description: average conductance of lake water (uS/cm) I. Name: avg_phosphorus Description: average phospurus levels in lake (mg/L) J. Name: avg_chlorophylla Description: average levels of cholorphyll a in lakes (ug/L) K. Name: a440_cdom Description: absorbance at 440 nm of colored dissolved organic matter (m^-1) L. Name: mean.gdd Description: Mean growing degree days (base 10 degrees Celsius, degr) M. Name: allstreams_density_mperha Description: stream density in 500 m buffer (meters / ha)