This ReadMe.txt file was generated on 2024-03-11 by Katelin M. Goebel ------------------- GENERAL INFORMATION ------------------- 1. Title of dataset: Data and R code supporting: Effects of insecticide spray drift on arthropod prey resources of birds in grasslands in Minnesota 2. Authors: Katelin M. Goebel, Minnesota Cooperative Fish and Wildlife Research Unit, Department of Fisheries, Wildlife, and Conservation Biology, University of Minnesota; ORCID iD 0000-0001-7464-8676 David E. Andersen, U.S. Geological Survey, Minnesota Cooperative Fish and Wildlife Research Unit, Department of Fisheries, Wildlife, and Conservation Biology, University of Minnesota; ORCID iD 0000-0001-9535-3404 Pamela J. Rice, Agricultural Research Service, U.S. Department of Agriculture, University of Minnesota Nicole M. Davros, Farmland Wildlife Populations and Research Group, Minnesota Department of Natural Resources; ORCID iD 0000-0003-2560-8891 3. Date published or finalized for release: 2024-03-11 4. Date of data collection: 2017-2019 5. Geographic location of data collection: Minnesota, USA 6. Funding sources that supported the collection of the data: This project was funded by the Environment and Natural Resources Trust Fund as recommended by the Legislative-Citizen Commission on Minnesota Resources (ML 2016, Ch. 186, Sec. 2, Subd. 03n). The Minnesota Department of Natural Resources provided logistical support and contributed funding through the Federal Aid in Wildlife Restoration Act (Grant F19AF00431/ W-71-R-7). This research was also supported by the Minnesota Cooperative Fish and Wildlife Research Unit (cooperators include the U.S. Geological Survey, Minnesota Department of Natural Resources, the Wildlife Management Institute, the U.S. Fish and Wildlife Service, and the University of Minnesota). 7. Overview of the data (abstract): This repository contains the data and R code used to conduct the analyses in the article "Effects of insecticide spray drift on arthropod prey resources of birds in grasslands in Minnesota" published in The Journal of Wildlife Management. -------------------------- SHARING/ACCESS INFORMATION -------------------------- 1. Licenses/restrictions placed on the data: Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International 2. Publication that cites or uses the data: Goebel, K. M., D. E. Andersen, P. J. Rice, and N. M. Davros. 2024. Effects of insecticide spray drift on arthropod prey resources of birds in grasslands in Minnesota. The Journal of Wildlife Management. 3. Were data derived from another source? No 4. Terms of Use: Data Repository for the U of Minnesota (DRUM) By using these files, users agree to the Terms of Use. https://conservancy.umn.edu/pages/drum/policies/#terms-of-use --------------------- DATA & FILE OVERVIEW --------------------- 1. File list: A. Filename: 20230303_combined_id_meas_manually_cleaned.csv Short description: Arthropod ID and measuring data B. Filename: 20190610_all_veg_cc.csv Short description: Vegetation data C. Filename: 20240311_JWM_model_abund_sqrt.R Short description: R script for total abundance models D. Filename: 20240311_JWM_model_bio_sqrt.R Short description: R script for total biomass models E. Filename: 20240311_JWM_model_bp_abund_sqrt.R Short description: R script for bird prey abundance models F. Filename: 20240311_JWM_model_bp_bio_sqrt.R Short description: R script for bird prey biomass models G. Filename: 20240311_JWM_model_fam_ara_poisson.R Short description: R script for family richness of Araneae models H. Filename: 20240311_JWM_model_fam_col_poisson.R Short description: R script for family richness of Coleoptera models I. Filename: 20240311_JWM_model_fam_hem_poisson.R Short description: R script for family richness of Hemiptera models J. Filename: 20240311_JWM_model_fam_ort_poisson.R Short description: R script for family richness of Orthoptera models K. Filename: 20240218_biomass_abund_figs_JWM.R Short description: R script for figure of arthropod abundance, biomass, and family richness (untransformed data) L. Filename: 20240219_ttest_abund_order.R Short description: R script for Welch’s 2-sample t-tests comparing total abundance of arthropods M. Filename: 20240219_ttest_biomass_order.R Short description: R script for Welch’s 2-sample t-tests comparing total biomass of arthropods N. Filename: 20240219_insect_sample_percentages.R Short description: R script for percentages of total arthropod abundance and biomass by taxon O. Filename: 20240219_veg_summary_JWM.R Short description: R script for means and standard deviations of vegetation measurements P. Filename: 20240219_insect_abundance_tables.R Short description: R script for arthropod abundance means and standard deviations on primary and supplementary transects Q. Filename: 20240219_insect_biomass_tables.R Short description: R script for arthropod biomass means and standard deviations on primary and supplementary transects 2. Relationship between files: A & B are used in C-J. These model results are presented in Table 3 in the manuscript. A is used in K. The output is presented as Figure 3 in the manuscript. A is used in L. The output is presented in Appendix A: Table A1 in the manuscript. A is used in M. The output is presented in Appendix A: Table A2 in the manuscript. A is used in N. The output is presented in Table S1, available as Supporting Information to the manuscript. B is used in O. The output is presented in Table S2, available as Supporting Information to the manuscript. A is used in P. The output is presented in Tables S3 & S4, available as Supporting Information to the manuscript. A is used in Q. The output is presented in Tables S5 & S6, available as Supporting Information to the manuscript. 3. File structure: R scripts are set up to have the data (A & B) in a subfolder named "data". The user should create 2 empty subfolders: "output" for the exported csv files and "figures" for the exported figure files. -------------------------- METHODOLOGICAL INFORMATION -------------------------- Methods are described in: Goebel, K. M., D. E. Andersen, P. J. Rice, and N. M. Davros. 2024. Effects of insecticide spray drift on arthropod prey resources of birds in grasslands in Minnesota. The Journal of Wildlife Management. ----------------------------------------------------------------------------- DATA-SPECIFIC INFORMATION FOR: 20230303_combined_id_meas_manually_cleaned.csv ----------------------------------------------------------------------------- 1. Number of variables: 37 2. Number of cases/rows: 8165 3. Variable list: key = concatenation of these columns: site, timing, transect, distance, ID code = code written on the insect measuring data sheet: 2-digit year – IM (insect measuring) – number of data sheet ent_init = initials of the individual that entered the insect measuring data sheet ent_date = date that the insect measuring data sheet was entered date = date that the insects were measured init = initials of the individual that measured the insects year = year the sample was collected (2 digits) site = 2-letter site code timing = timing of sample collection (P = pre-spraying, 3 = 3-5 days post-spraying, 20 = 19-21 days post-spraying) transect = 1-letter transect code (X, Y, Z, or T); X, Y, and Z denote primary transects; T denotes supplementary transect distance = distance from field edge (m) ID = vial ID number count = total count of insects in the vial mm_1 = length of insect(s) (mm) ct_1 = count of insect(s) with length recorded in column mm_1 mm_2 = length of insect(s) (mm) ct_2 = count of insect(s) with length recorded in column mm_2 mm_3 = length of insect(s) (mm) ct_3 = count of insect(s) with length recorded in column mm_3 mm_4 = length of insect(s) (mm) ct_4 = count of insect(s) with length recorded in column mm_4 mm_5 = length of insect(s) (mm) ct_5 = count of insect(s) with length recorded in column mm_5 mm_6 = length of insect(s) (mm) ct_6 = count of insect(s) with length recorded in column mm_6 mm_7 = length of insect(s) (mm) ct_7 = count of insect(s) with length recorded in column mm_7 mm_8 = length of insect(s) (mm) ct_8 = count of insect(s) with length recorded in column mm_8 mm_9 = length of insect(s) (mm) ct_9 = count of insect(s) with length recorded in column mm_9 mm_10 = length of insect(s) (mm) ct_10 = count of insect(s) with length recorded in column mm_10 note = notes from insect measuring data sheet ID_family = family identification ID_order = order identification ID_notes = notes from insect identification data sheet ------------------------------------------------------ DATA-SPECIFIC INFORMATION FOR: 20190610_all_veg_cc.csv ------------------------------------------------------ 1. Number of variables: 51 2. Number of cases/rows: 984 3. Variable list: code = code written on the vegetation data sheet: 2-digit year – V (vegetation) – 2-letter site code – number of data sheet ent_init = initials of the individual that entered the vegetation data sheet ent_date = date that the vegetation data sheet was entered date = date that the vegetation data were collected site = 2-letter site code obs_1 = initials of observer 1 obs_2 = initials of observer 2 timing = timing of sample collection (P = pre-spraying, 3 = 3-5 days post-spraying, 20 = 19-21 days post-spraying) transect = 1-letter transect code (X, Y, Z, or T); X, Y, and Z denote primary transects; T denotes supplementary transect distance = distance from field edge (m) easting = easting of plot (UTM Zone 15N, NAD83) northing = northing of plot (UTM Zone 15N, NAD83) dom_grass_1 = species of grass that comprised the largest portion of canopy cover in the plot dom_grass_2 = species of grass that comprised the second largest portion of canopy cover in the plot dom_grass_3 = species of grass that comprised the third portion of canopy cover in the plot dom_forb_1 = species of forb that comprised the largest portion of canopy cover in the plot dom_forb_2 = species of forb that comprised the second largest portion of canopy cover in the plot dom_forb_3 = species of forb that comprised the third largest portion of canopy cover in the plot gc_bg = count of ground cover points comprising bare ground (out of 30) gc_lit = count of ground cover points comprising litter (out of 30) gc_oth = count of ground cover points comprising other cover (neither bare ground nor litter; out of 30) lit = litter depth (cm) mhl = maximum height of live vegetation (dm) mhd = maximum height of dead vegetation (dm) sp_grass = number of grass species in plot sp_forb = number of forb species in plot camera = ID of the camera used to take photos for canopy cover analysis sd = ID of the SD card used to store photos for canopy cover analysis pic_1 = number of the first photo taken for canopy cover analysis pic_2 = number of the second photo taken for canopy cover analysis pic_3 = number of the third photo taken for canopy cover analysis vor_n = visual obstruction reading from north of the Robel pole (dm) vor_e = visual obstruction reading from east of the Robel pole (dm) vor_s = visual obstruction reading from south of the Robel pole (dm) vor_w = visual obstruction reading from west of the Robel pole (dm) veg_notes = notes from the vegetation data sheet plot = plot location; C = center, L = left, R = right; plots were located at the passive sampling device post (C), 20 m left of the post (L), and 30 m right of the post (R; parallel to the field edge, while facing the field) on the day of spraying; on other sampling dates, plots were located at the start and end of each insect sampling transect (L or R while facing the field) cc_pic = filename suffix of the photo used to calculate canopy cover in SamplePoint cc_image_file = filename of the photo used to calculate canopy cover in SamplePoint grass = number of points comprising live grasses in the plot photo in SamplePoint forb = number of points comprising live forbs in the plot photo in SamplePoint dead = number of points comprising dead vegetation in the plot photo in SamplePoint woody = number of points comprising woody vegetation in the plot photo in SamplePoint other = number of points comprising anything that did not belong in any other classification in SamplePoint (grass, forb, dead, or woody) total = total number of points used to calculate percent canopy cover from SamplePoint outputs; sum of grass, forb, dead, woody, and other columns; 64 points were initially selected by SamplePoint for identification, but points classified as shadow, ground, or the Daubenmire frame were excluded from canopy cover analysis grass.p.adj = percent canopy cover of live grasses (grass/total) forb.p.adj = percent canopy cover of live forbs (forb/total) dead.p.adj = percent canopy cover of dead vegetation (dead/total) woody.p.adj = percent canopy cover of woody vegetation (woody/total) other.p.adj = percent canopy cover of anything that did not belong in any other canopy cover classification (other/total) cc_comment = comments made while determining canopy cover in SamplePoint ----------------------- R & PACKAGE INFORMATION ----------------------- R version 4.3.0 (2023-04-21 ucrt) Platform: x86_64-w64-mingw32/x64 (64-bit) Running under: Windows 10 x64 (build 19045) Matrix products: default locale: [1] LC_COLLATE=English_United States.utf8 LC_CTYPE=English_United States.utf8 LC_MONETARY=English_United States.utf8 [4] LC_NUMERIC=C LC_TIME=English_United States.utf8 time zone: America/Chicago tzcode source: internal attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] purrr_1.0.1 broom_1.0.5 MuMIn_1.47.5 nlme_3.1-162 r2glmm_0.1.2 MASS_7.3-58.4 tibble_3.2.1 ggpubr_0.6.0 [9] tidyselect_1.2.0 forcats_1.0.0 ggplot2_3.4.2 tidyr_1.3.0 dplyr_1.1.2 loaded via a namespace (and not attached): [1] Matrix_1.5-4 gtable_0.3.3 compiler_4.3.0 ggsignif_0.6.4 scales_1.2.1 lattice_0.21-8 R6_2.5.1 generics_0.1.3 [9] backports_1.4.1 car_3.1-2 munsell_0.5.0 pillar_1.9.0 rlang_1.1.1 utf8_1.2.3 cli_3.6.1 withr_2.5.0 [17] magrittr_2.0.3 grid_4.3.0 rstudioapi_0.14 lifecycle_1.0.3 vctrs_0.6.2 rstatix_0.7.2 glue_1.6.2 stats4_4.3.0 [25] abind_1.4-5 carData_3.0-5 fansi_1.0.4 colorspace_2.1-0 tools_4.3.0 pkgconfig_2.0.3