This codebook.txt file was generated on <20170104> by K. VanderWaal ------------------- GENERAL INFORMATION ------------------- 1. Title of Dataset: Database of publications on swine diseases, 1966-2016 2. Author Information Principal Investigator Contact Information Name: Kimberly VanderWaal Institution: University of Minnesota Department: Veterinary Population Medicine Address: 1365 Gortner Avenue, St. Paul, MN 55108 Email: kvw@umn.edu 3. Date of data collection (single date, range, approximate date): 19660101 to 20161231 4. Geographic location of data collection (where was data collected?): N/A 5. Information about funding sources that supported the collection of the data: N/A -------------------------- SHARING/ACCESS INFORMATION -------------------------- 1. Licenses/restrictions placed on the data: None 2. Links to publications that cite or use the data: VanderWaal and Deen. “Global trends in infectious diseases of swine.” As of 20180117, publication not yet submitted. 3. Links to other publicly accessible locations of the data: N/A 4. Links/relationships to ancillary data sets: N/A 5. Was data derived from another source? Yes. If yes, list source(s): Web of Science (all databases; webofknowledge.com); Scopus (www.scopus.com); NCBI PubMed (https://www.ncbi.nlm.nih.gov/pubmed/). 6. Recommended citation for the data: VanderWaal, Kimberly. (2018). Database of publications on swine diseases, 1966-2016. Retrieved from the University of Minnesota Digital Conservancy, http://hdl.handle.net/11299/192486. --------------------- DATA & FILE OVERVIEW --------------------- 1. File List A. Filename: data.swine.csv Short description: Spreadsheet of publication meta-data on swine diseases from 1966-2016. -------------------------- METHODOLOGICAL INFORMATION -------------------------- 1. Description of methods used for collection/generation of data: Database was generated from literature searches for 40 priority swine pathogens using ISI's Web of Science, Scopus, and PubMed. Terminology searched: african swine fever ascaris suum brucella campylobacter classical swine fever echinococcus echinococcus granulosus erysipelothrix rhusiopathiae escherichia coli fasciolopsis buski foot and mouth disease haemophilus parasuis hepatitis e virus influenza japanese encephalitis virus lawsonia intracellularis leptospira menangle virus metastrongylus salmi mycobacteria mycoplasma hyopneumoniae nipah virus Pasteurella multocida Pleuropneumonia porcine circovirus type 2 porcine epidemic diarrhea porcine parvovirus porcine reproductive and respiratory syndrome pseudorabies rotavirus group a salmonella serpulina hyodysenteriae streptococcus suis taenia solium toxoplasma gondii transmissible gastroenteritis virus trichinella trichuris suis trypanosoma vesicular stomatitis virus staphylococcus aureus 2. Methods for processing the data: Searches from each of the databases were compiled in a spreadsheet with a common format using the RISmed and bibliometrix packages in R Statistical Software v3.2.3. 3. Describe any quality-assurance procedures performed on the data: Data were cleaned and error-checked manually. First, because titles across databases were not always identical, we randomly selected 385 articles and manually screened as to whether our data mining scripts had appropriately combined publications appearing in the searches from more than one database (Pubmed, Scopus, ISI) were correctly combined into a single record of if they were duplicated due to slight variations in the title. We found that 3% and 1% of publications were not appropriately combined into a single record and appeared as two or three unique records, respectively, in our assembled database. We also randomly selected 385 publications to manually ensure that country assignments were correct. We found an error rate of 3.44% (publications appeared with an improper country assignment, though they also were cross-listed with the appropriate country). The error rate for country assignments when using the author affiliation field (n = 385) was 2.0%. Finally, we checked 385 publications to ensure the pathogen details were correct. This yielded an error rate of 3.3 ± 0.009% (S.E.) 4. People involved with sample collection, processing, analysis and/or submission: K. VanderWaal; J. Deen ----------------------------------------- DATA-SPECIFIC INFORMATION FOR: data.swine.csv ----------------------------------------- 1. Number of variables: 19 2. Number of cases/rows: 211,437 3. Missing data codes: N/A 4. Variable List A. Name: barcode Description: A unique identifier for each publication B. Name: affiliation Description: listed affiliation of author/s in publication record C. Name: title Description: title of publication D. Name: abstract Description: publication abstract E. Name: country Description: country where work was performed/author was affiliated. All publications are listed once as "all" and a duplicate record is listed for each country that is associated with the publication (country was extracted from title, abstract, or author affilation). F. Name: cont Description: continent in which the country is located G. Name: au.loc Description: country/s listed in author affilation (publication is listed once for each country that appears) H. Name: year Description: year of publication I. Name: journal Description: publication journal J. Name: volume Description: journal volume K. Name: first Description: surname of the first author L. Name: second Description: surname of the second author M. Name: last Description: surname of the last author N. Name: pubmed Description: 0/1 of whether this publication was found in literature searches using PubMed O. Name: scopus Description: 0/1 of whether this publication was found in literature searches using Scopus P. Name: isi Description: 0/1 of whether this publication was found in literature searches using ISI Web of Science Q. Name: loc.type Description: Indicates whether the country was assigned based on Affilation of authors or mentions in the Title/Abstract R. Name: tag2 Description: Pathogen described publication. One record occurs for each pathogen described in the publcation. S. Name: region Description: Geographic region associated with the country (i.e., Northern Europe, Southern Asia, etc.) T. Name: develop Description: Whether a country is listed as "developed" or "developing" by the United Nations.