Marlowe, Jillian L.Durward-Akhurst, Sian A.McCue, Molly E.2024-09-262024-09-262024-09-26https://hdl.handle.net/11299/265733Simulated data is an inexpensive and timely alternative to the generation of novel whole genome sequence (WGS) data. We generated artificial DNA sequence reads representing simulated equine genomes which contain predetermined genetic variants that can be used as a benchmark dataset to evaluate genomic processing tools. The VCFs contained in this repository contain the benchmark set of genetic variants for each of the simulated genomes that were created. These VCFs can be used in conjunction with the simulated sequences which are also publicly available to evaluate WGS processing tools.CC0 1.0 Universalhttp://creativecommons.org/publicdomain/zero/1.0/equineequus caballussimulationwhole genome sequencinggenomic analysisVCF truth sets of variants inserted into simulated equine genomes (90 VCFs)Datasethttps://doi.org/10.13020/NM3A-W471