This codebook.txt file was generated on 2020/02/03 by wilsonkm, and updated on 2020/06/18 by collinsv ------------------- GENERAL INFORMATION ------------------- 1. Title of Dataset Hard fescue (Festuca brevipila) reference transcriptome generated by PabBio Isoform sequencing 2. Author Information Principal Investigator Contact Information Name: Yinjie Qiu Institution: University of Minnesota Email: qiuxx221@umn.edu Associate or Co-investigator Contact Information Name: Ya Yang Institution: University of Minnesota Email: yangya@umn.edu Associate or Co-investigator Contact Information Name: Cory D. Hirsch Institution: University of Minnesota Email: cdhirsch@umn.edu Associate or Co-investigator Contact Information Name: Eric Watkins Institution: University of Minnesota Email: ewatkins@umn.edu 3. Date of data collection: February 2019 4. Geographic location of data collection: Plant Growth Facility, St. Paul, MN. 5. Information about funding sources that supported the collection of the data: Sponsorship: National Institute of Food and Agriculture, U.S. Department of Agriculture, Specialty Crop Research Initiative under award number 2017-51181-27222 -------------------------- SHARING/ACCESS INFORMATION -------------------------- 1. Licenses/restrictions placed on the data: Attribution 3.0 United States 2. Links to publications that cite or use the data: https://www.biorxiv.org/content/10.1101/2020.02.26.966952v1.abstract 3. Links to other publicly accessible locations of the data: 4. Links/relationships to ancillary data sets: 5. Was data derived from another source? If yes, list source(s): 6. Recommended citation for the data: Qiu, Yinjie; Yang, Ya; Hirsch, Cory D.; Watkins, Eric. (2020). Hard fescue (Festuca brevipila) reference transcriptome generated by PabBio Isoform sequencing. Retrieved from the Data Repository for the University of Minnesota, http://hdl.handle.net/11299/211413. OR Qiu, Yinjie, et al. "Building a Reference Transcriptome for the Hexaploid Hard Fescue Turfgrass (Festuca brevipila) Using a Combination of PacBio IsoSeq and Illumina Sequencing." BioRxiv (2020). bioRxiv 2020.02.26.966952; doi: https://doi.org/10.1101/2020.02.26.966952 --------------------- DATA & FILE OVERVIEW --------------------- 1. File List A. Filename: F_brevipila_isoseq_reference_transcriptome.fasta Short description: This is the PacBio Isoform sequencing generated reference transcriptome for the hard fescue SPHD-3. B. Filename: Deposit_annotation.zip Short description: Contains the following annotation files: Filename: KEGG_Annotation.txt Annotation using Kyoto Encyclopedia of Genes and Genomes database NCBI_NR_Annotation.txt Annotation using NCBI Non-Redundant protein database Pfam_Annotation.txt Annotation generated using Pfam protein database SwissProt_Annotation.txt Annotation file using SwissProt protein database Uniref_Annotation.out Annotation file using UniRef protein database C. Filename: polished_mesabi2.hq.fasta.zip Short description: This file contains PacBio Isoform sequencing before performing reads correction and transcript collapsing. It might contains material or viral sequencing contamination as the contaminants removal was performed after reads correction. -------------------------- METHODOLOGICAL INFORMATION -------------------------- All the information could be found in the manuscript released on bioRxiv. It is currently pending final acceptance at Crop Science Journal.