- Split View
-
Views
-
Cite
Cite
Arthur T O Melo, Iago Hale, Expanded functionality, increased accuracy, and enhanced speed in the de novo genotyping-by-sequencing pipeline GBS-SNP-CROP, Bioinformatics, Volume 35, Issue 17, September 2019, Page 3215, https://doi.org/10.1093/bioinformatics/bty1073
- Share Icon Share
Bioinformatics (2019) doi: 10.1093/bioinformatics/bty873, 35, 1783–1785.
In the original article, there was an error in the formatting of Table 1.
Pipelinea . | MR genob . | Time (min)c . | Variants calledd . | Type I errore . | Type II errorf . | Accuracyg . |
---|---|---|---|---|---|---|
UNEAK | NA | 8.5 | 2642 | 0.9% | 92.5% | 7.5% |
GSC v.1.0 | 1 | 370.8 | 23 395 | 1.3% | 34.1% | 65.4% |
GSC v.4.0 | 1 | 121.7 | 29 738 | 0.6% | 15.6% | 84.0% |
5 | 156.9 | 26 885 | 0.6% | 23.6% | 76.0% | |
10 | 171.5 | 26 854 | 0.5% | 23.7% | 76.1% | |
15 | 179.1 | 26 897 | 0.5% | 23.6% | 76.1% | |
20 | 183.0 | 26 892 | 0.5% | 23.6% | 76.1% | |
25 | 163.2 | 26 901 | 0.5% | 23.5% | 76.2% |
Pipelinea . | MR genob . | Time (min)c . | Variants calledd . | Type I errore . | Type II errorf . | Accuracyg . |
---|---|---|---|---|---|---|
UNEAK | NA | 8.5 | 2642 | 0.9% | 92.5% | 7.5% |
GSC v.1.0 | 1 | 370.8 | 23 395 | 1.3% | 34.1% | 65.4% |
GSC v.4.0 | 1 | 121.7 | 29 738 | 0.6% | 15.6% | 84.0% |
5 | 156.9 | 26 885 | 0.6% | 23.6% | 76.0% | |
10 | 171.5 | 26 854 | 0.5% | 23.7% | 76.1% | |
15 | 179.1 | 26 897 | 0.5% | 23.6% | 76.1% | |
20 | 183.0 | 26 892 | 0.5% | 23.6% | 76.1% | |
25 | 163.2 | 26 901 | 0.5% | 23.5% | 76.2% |
Note: In total, 25 000 SNPs and 10 000 indels were simulated across a genomic space of 100 000 GBS fragments. A total of 60 002 165 single-end reads were simulated for a population of 25 individuals (average of 2.4 million reads per genotype), with a sequencing error rate of 1.1%. See Supplementary Table S1 for more details
UNEAK = TASSEL-UNEAK; GSC = GBS-SNP-CROP.
The number of genotypes used for mock reference (MR) assembly.
Computation time (minutes) required to run the full analysis on a Unix workstation with 16 GB RAM and a 2.6 GHz Dual Intel processor.
Number of variants called by a pipeline (Note: a total of 35 000 variants were simulated, consisting of 25 000 SNPs and 10 000 indels).
Percentage of called variants that could not be validated (false positives).
Percentage of true, simulated variants that were not detected by the pipeline.
Overall accuracy: 100 * [number of validated variants/(total number of simulated variants + number of non-validated variants)].
Pipelinea . | MR genob . | Time (min)c . | Variants calledd . | Type I errore . | Type II errorf . | Accuracyg . |
---|---|---|---|---|---|---|
UNEAK | NA | 8.5 | 2642 | 0.9% | 92.5% | 7.5% |
GSC v.1.0 | 1 | 370.8 | 23 395 | 1.3% | 34.1% | 65.4% |
GSC v.4.0 | 1 | 121.7 | 29 738 | 0.6% | 15.6% | 84.0% |
5 | 156.9 | 26 885 | 0.6% | 23.6% | 76.0% | |
10 | 171.5 | 26 854 | 0.5% | 23.7% | 76.1% | |
15 | 179.1 | 26 897 | 0.5% | 23.6% | 76.1% | |
20 | 183.0 | 26 892 | 0.5% | 23.6% | 76.1% | |
25 | 163.2 | 26 901 | 0.5% | 23.5% | 76.2% |
Pipelinea . | MR genob . | Time (min)c . | Variants calledd . | Type I errore . | Type II errorf . | Accuracyg . |
---|---|---|---|---|---|---|
UNEAK | NA | 8.5 | 2642 | 0.9% | 92.5% | 7.5% |
GSC v.1.0 | 1 | 370.8 | 23 395 | 1.3% | 34.1% | 65.4% |
GSC v.4.0 | 1 | 121.7 | 29 738 | 0.6% | 15.6% | 84.0% |
5 | 156.9 | 26 885 | 0.6% | 23.6% | 76.0% | |
10 | 171.5 | 26 854 | 0.5% | 23.7% | 76.1% | |
15 | 179.1 | 26 897 | 0.5% | 23.6% | 76.1% | |
20 | 183.0 | 26 892 | 0.5% | 23.6% | 76.1% | |
25 | 163.2 | 26 901 | 0.5% | 23.5% | 76.2% |
Note: In total, 25 000 SNPs and 10 000 indels were simulated across a genomic space of 100 000 GBS fragments. A total of 60 002 165 single-end reads were simulated for a population of 25 individuals (average of 2.4 million reads per genotype), with a sequencing error rate of 1.1%. See Supplementary Table S1 for more details
UNEAK = TASSEL-UNEAK; GSC = GBS-SNP-CROP.
The number of genotypes used for mock reference (MR) assembly.
Computation time (minutes) required to run the full analysis on a Unix workstation with 16 GB RAM and a 2.6 GHz Dual Intel processor.
Number of variants called by a pipeline (Note: a total of 35 000 variants were simulated, consisting of 25 000 SNPs and 10 000 indels).
Percentage of called variants that could not be validated (false positives).
Percentage of true, simulated variants that were not detected by the pipeline.
Overall accuracy: 100 * [number of validated variants/(total number of simulated variants + number of non-validated variants)].
This has been corrected and the corrected table appears below.