Data Curation Network Primers
Persistent link for this collection
Archived primers from the 2018-2020 Specialized Data Curation Workshops presented by the Data Curation Network and funded by a grant from the Institute for Museum and Library Services (IMLS RE-85-18-0040-18). Data curation primers are interactive, living documents that detail a specific subject, disciplinary area or curation task and that can be used as a reference to curate research data.
Interactive primers available for download and derivatives at: https://github.com/DataCurationNetwork/data-primers
Browse
Browsing Data Curation Network Primers by Issue Date
Now showing 1 - 20 of 42
Results Per Page
Sort Options
Item SPSS Data Curation Primer(Data Curation Network, 2019) Deng, Sai; Dull, Joshua; Finn, Jeanine; Khair, ShahiraThis data curation primer primarily discusses .sav and .por files. SPSS Statistics (.sav): Data files saved in IBM SPSS Statistics format. Portable (.por): Portable format that can be read by other versions of IBM SPSS Statistics and versions on other operating systems.Item Wordpress.com (hosted) Data Curation Primer(Data Curation Network, 2019) James, HeatherWordPress.com is the hosted version of the open source WordPress.org software (https://en.support.wordpress.com/com-vs-org/; https://dailypost.wordpress.com/2013/11/14/com-or-org/) offering a free online publishing platform with optional features, plans, and custom domains available for additional cost (https://wordpress.com/about/). This primer will focus exclusively on the WordPress.com free site export and archiving process. In the future additional primers and/or additions to this primer may be beneficial in order to cover the variations with WordPress.com Business Plan sites and WordPress.org software.Item Tutorial for using the netCDF Data Curation Primer(Data Curation Network, 2019) Hou, SophieThis document is a supplemental primer to the main IMLS-Data-CurationFormat Profile-netCDF primer (http://hdl.handle.net/2027.42/145724). Within this primer, the NCAR Global Climate Four-Dimensional Data Assimilation (CFDDA) Hourly 40 km Reanalysis dataset from the Research Data Archive (RDA) at the National Center for Atmospheric Research (NCAR) is used to demonstrate how to assess a netCDF-based dataset according to the main primer’s instructions. In particular, Panoply, a curation review tool that is recommended by the main primer, is used to examine the dataset in order to help answer the questions outlined in the “Key Questions for Curation Review” section of the main primer.Item Microsoft Excel Data Curation Primer(Data Curation Network, 2019) Janée, Greg; Sawchuk, Sandra; Yoo, Ho JungMicrosoft Excel’s widespread adoption in the corporate sector is well known, but the application has also found use in many areas of scholarship. Despite the ubiquity of tabular data in CSV (comma-separated values) format, and the availability of many tools and analysis platforms that operate on CSV files, Microsoft Excel continues to be used widely in the natural sciences and social sciences. As a consequence, Excel files are routinely deposited in data repositories and curators are likely to encounter them.Item Jupyter Notebooks: A Primer for Data Curators(Data Curation Network, 2019) Bouquin, Daina; Hou, Sophie; Benzing, Matthew; Wilson, LeeJupyter Notebooks are composite digital objects used to develop, share, view, and execute interspersed, interlinked, and interactive documentation, equations, visualizations, and code. Researchers seeking to deposit software, in this case Jupyter Notebooks, in repositories do so with the expectation that repositories will provide documentation explaining “what you can deposit, the supported file formats for deposits, what metadata you may need to provide, how to provide this metadata and what happens after you make your deposit” (Jackson, 2018a). This expectation is not necessarily met by repositories that currently accept software deposits and complex objects like Jupyter Notebooks. This guide is meant to both inform curatorial practices around Jupyter Notebooks, and support the development of resources that meet researchers’ expectations to ensure long-term availability of software in curated archival repositories. Guidance provided by Jisc and the Software Sustainability Institute outlines three different kinds of software deposits: a minimal deposit, a runnable deposit, and a comprehensive deposit (Jackson, 2018b). This primer follows this same conceptual framework in dealing with Jupyter Notebooks, which even in their static, non-executable form, can be used to document how scientific research was carried out or be used as teaching models among many other use cases.Item Microsoft Access Data Curation Primer(Data Curation Network, 2019) Rios, Fernando; Fearon, DaveThis primer assumes a conceptual familiarity with relational databases (and associated terminology) and a basic level of experience with Microsoft Access.Item GeoDatabase (.gdb) Data Curation Primer(Data Curation Network, 2019) Battista, Andrew; Brittnacher, Tom; Garrett, Zenobie; Moore, Jennifer; Pirmann, CarrieThe geodatabase is a container for geospatial datasets that can also provide relational functionality between the files. Although the term geodatabase can be used more widely, this primer describes the ArcGIS geodatabase designed by Esri.Item Human Subjects Data Essentials Data Curation Primer(Data Curation Network, 2020) Darragh, Jen; Hofelich Mohr, Alicia; Hunt, Shanda; Woodbrook, Rachel; Fearon, Dave; Moore, Jennifer; Hadley, HannahItem NVivo Data Curation Primer(Data Curation Network, 2020) Hadley, HannahItem SAS Data Curation Primer(Data Curation Network, 2020) Xu, QiongItem General Databases Data Curation Primer(Data Curation Network, 2020) Xin, XuyingItem Google Docs Data Curation Primer(Data Curation Network, 2020) Dixson, Nadia; Kriesberg, AdamItem GeoTiff Data Curation Primer(Data Curation Network, 2020) Kearney, Courtney; Ruhs, Nick; Sedlins, Mara; Tien, Tracy; Trelogan, Jessica; Watts, JohnItem ISO Disk Images Data Curation Primer(Data Curation Network, 2020) Barron, Kate; Bohan, JonathanItem Twitter Data Curation Primer(Data Curation Network, 2020) Kalt, Marley; Scott, DorrisItem Neuroimaging DICOM and NIfTI Data Curation Primer(Data Curation Network, 2020) Moore, Michael; Patterson, Brandon; Samuel, Sara; Sheridan, Helenmary; Sorensen, ChrisItem Confocal Microscopy Data Curation Primer(Data Curation Network, 2020-01-03) Ivey, Susan; Koshoffer, Amy; Sneff, Gretchen; Wang, HuajinThe purpose of this primer is to guide a data curator through the curation process for confocal images. It describes the image specifics, as well as what details and metadata from the instrumentation and experiment is needed to understand the images and use them for further research or educational purposes.Item Atlas.ti Data Curation Primer(Data Curation Network, 2020-01-03) Corral, MargaritaAltas.ti is a software application that allows researchers to analyze qualitative data in a systematic and transparent way, increasing the validity of results (Friese 2019). ATLAS.ti handles different types of data that are kept in a project. The project files can contain text documents, images, audio recordings, videos, pdf files, geo data, Twitter data, citations from Evernote and reference managers, and survey data. The purpose of this primer is to guide a data curator through the curation process for Altas.ti files.Item R Data Curation Primer(Data Curation Network, 2020-01-03) Kellam, Lynda; Koziar, Katherine; Pejša, StanislavThe purpose of this primer is to guide a data curator through the curation process for text files with a “.R” extension that contain code for executing programs in the R language.Item GeoJSON Data Curation Primer(Data Curation Network, 2020-01-03) Dixson, Nadia; Milliken, Genevieve; Mukunda, Keshav; Murray, Reina; Starry, RachelGeoJSON is a geospatial data interchange format for encoding vector geographical data structures, such as point, line, and polygon geometries, as well as their non-spatial attributes. The purpose of this primer is to guide a data curator through the curation process for GeoJSON files.
- «
- 1 (current)
- 2
- 3
- »