-------------------------------------------------------------------------------- SUMMARY -------------------------------------------------------------------------------- - Sun Jan 27 20:15:51 2019 - [M92_220] - software release = 2.1.0(cdf484b8f) - likely sequencers = could not detect - assembly checksum = -1,472,506,490,896,885,638 -------------------------------------------------------------------------------- INPUT - 410.68 M = READS = number of reads; ideal 800M-1200M for human - 139.50 b = MEAN READ LEN = mean read length after trimming; ideal 140 - 53.11 x = RAW COV = raw coverage; ideal ~56 - 41.50 x = EFFECTIVE COV = effective read coverage; ideal ~42 for raw 56x - 87.60 % = READ TWO Q30 = fraction of Q30 bases in read 2; ideal 75-85 - 291.00 b = MEDIAN INSERT = median insert size; ideal 350-400 - 90.50 % = PROPER PAIRS = fraction of proper read pairs; ideal >= 75 - 1.00 = BARCODE FRACTION = fraction of barcodes used; between 0 and 1 - 1.17 Gb = EST GENOME SIZE = estimated genome size - 18.61 % = REPETITIVE FRAC = genome repetitivity index - 0.95 % = HIGH AT FRACTION = high AT index - 34.55 % = ASSEMBLY GC CONTENT = GC content of assembly - 0.58 % = DINUCLEOTIDE FRACTION = dinucleotide content - 17.33 Kb = MOLECULE LEN = weighted mean molecule size; ideal 50-100 - 26.59 = P10 = molecule count extending 10 kb on both sides - 14.43 Kb = HETDIST = mean distance between heterozygous SNPs - 5.04 % = UNBAR = fraction of reads that are not barcoded - 304.00 = BARCODE N50 = N50 reads per barcode - 10.58 % = DUPS = fraction of reads that are duplicates - 66.29 % = PHASED = nonduplicate and phased reads; ideal 45-50 -------------------------------------------------------------------------------- OUTPUT - 6.79 K = LONG SCAFFOLDS = number of scaffolds >= 10 kb - 12.11 Kb = EDGE N50 = N50 edge size - 48.88 Kb = CONTIG N50 = N50 contig size - 23.61 Kb = PHASEBLOCK N50 = N50 phase block size - 370.94 Kb = SCAFFOLD N50 = N50 scaffold size - 4.96 % = MISSING 10KB = % of base assembly missing from scaffolds >= 10 kb - 840.04 Mb = ASSEMBLY SIZE = assembly size (only scaffolds >= 10 kb) -------------------------------------------------------------------------------- ALARMS - The median insert size of the sequencing library is 291. Ideally, this metric is between 350 and 400 base pairs. This could affect the quality of the assembly. - The length-weighted mean molecule length is 17328.78 bases. The molecule length estimation was successful, however, ideally we would expect a larger value. Standard methods starting from blood can yield 100 kb or larger DNA, but it can be difficult to obtain long DNA from other sample types. Short molecules may reduce the scaffold and phase block N50 length, and could result in misassemblies. We have observed assembly quality to improve with longer DNA. --------------------------------------------------------------------------------