You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+12-12
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
## QuasiModo - Quasispecies Metric Determination on Omics
2
-
> #### Strain-level assembly and SNP calling benchmarking based on sequencing data of mixed strain samples for HCMV
2
+
> #### Strain-level assembly and variant calling benchmarking based on sequencing data of mixed strain samples for HCMV
3
3
4
-
This repository contains the scripts and pipeline that reproduces the results of the HCMV benchmarking study. In this study we evaluated genome assemblers and variant callers on 6 in vitro generated, mixed strain HCMV sequence samples, each consisting of two lab strains in different abundance ratios. This tool can also be used to evaluate assemblies and SNP calling results on other similar datasets.
4
+
This repository contains the scripts and pipeline that reproduces the results of the HCMV benchmarking study. In this study we evaluated genome assemblers and variant callers on 10 in vitro generated, mixed strain HCMV sequence samples, each consisting of two lab strains in different abundance ratios. This tool can also be used to evaluate assemblies and variant calling results on other similar datasets.
5
5
6
-
In this benchmarking study: variants callers `BCFtools` (v1.9), `VarScan` (v2.4.3), `Freebayes` (v1.2.0), `LoFreq` (v2.1.3.1), `CLC Genomics Workbench` (v11.0.1) were evaluated. For the assembly benchmarking, `ABySS` (v2.1.4), `megahit` (v1.1.3) , `IDBA` (v1.1.3), `SPAdes` (v3.12.0), `Ray` (v2.3.1), `tadpole` (v37.99) were assessed. The haplotype reconstruction program `Savage` (v0.4.0) was also evaluated.
6
+
In this benchmarking study: variants callers `BCFtools` (v1.9), `VarScan` (v2.4.3), `Freebayes` (v1.2.0), `LoFreq` (v2.1.3.1), `CLC Genomics Workbench` (v11.0.1) were evaluated. For the assembly benchmarking, `ABySS` (v2.1.4), `megahit` (v1.1.3) , `IDBA` (v1.1.3), `SPAdes` (v3.12.0), `Ray` (v2.3.1), `Tadpole` (v37.99) were assessed. The haplotype reconstruction program `Savage` (v0.4.0) was also evaluated.
Please modify the paths to the sequencing files which you have downloaded accordingly. In this example, the `<your project path>` is `../HCMV_benchmark_output` and the reads are in the `../HCMV_benchmark_output/data/seqs/reads`.
44
44
45
45
46
-
#### ! Due to the high computational and time cost, by default this program do not run the whole benchmark for HCMV dataset from scratch (based on reads), instead it benchmarks the SNP call and assembly based on the VCF files and scaffolds provided within this program under `data` directory.
46
+
#### ! Due to the high computational and time cost, by default this program do not run the whole benchmark for HCMV dataset from scratch (based on reads), instead it benchmarks the variant call and assembly based on the VCF files and scaffolds provided within this program under `data` directory.
47
47
48
48
### Adapt the configuration file
49
49
All the paths must be either relative path to the parent directory of `config` folder or absolute path.
@@ -78,11 +78,11 @@ Options:
78
78
79
79
Commands:
80
80
hcmv Benchmarking for HCMV dataset
81
-
snpeval SNP calling benchmark for customized dataset
81
+
vareval Variant calling benchmark for customized dataset
82
82
asmeval Assembly benchmark for customized dataset
83
83
```
84
84
85
-
This program consists of three subcommands: `hcmv`, `snpeval`, `asmeval`. The first one is used for the benchmarking on our HCMV datasets. And the other two are for the SNP call and assembly evaluation on customized datasets.
85
+
This program consists of three subcommands: `hcmv`, `vareval`, `asmeval`. The first one is used for the benchmarking on our HCMV datasets. And the other two are for the variant call and assembly evaluation on customized datasets.
86
86
87
87
The argumentrs and options in the `hcmv` command:
88
88
```
@@ -99,7 +99,7 @@ Options:
99
99
-t, --threads INTEGER The number of threads to use. [default: 2]
100
100
-d, --dryrun Print the details without run the pipeline.
101
101
[default: False]
102
-
-e, --evaluation [all|snpcall|assembly]
102
+
-e, --evaluation [all|variantcall|assembly]
103
103
The evaluation to run. [required]
104
104
-s, --slow Run the evaluation based on reads, which is
105
105
very slow. By default, the evaluation will
@@ -125,7 +125,7 @@ If you expect to the benchmarking based on the reads, you need to specify the `-
125
125
#### Assess variant callers and analyze the mutation context of identified variants
If you wish to the benchmarking based on the reads, you need to specify the `--slow` or `-s` option which allows you to generate the variant calling results from reads.
0 commit comments