You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,7 @@ Deployed on September 23rd, 2024
11
11
* Initial changes in `qiita_client` to have more accurate variable names: `QIITA_SERVER_CERT` -> `QIITA_ROOTCA_CERT`. Thank you @charles-cowart!
12
12
* Added `get_artifact_html_summary` to `qiita_client` to retrieve the summary file of an artifact.
13
13
* Re-added github actions to `https://github.com/qiita-spots/qiita_client`.
14
-
*`Woltka v0.1.4, paired-end` superseded `Woltka v0.1.4` in `qp-woltka`; [more information](https://qiita.ucsd.edu/static/doc/html/processingdata/woltka_pairedend.html). Thank you to @qiyunzhu for the benchmarks!
14
+
*`Woltka v0.1.6, paired-end` superseded `Woltka v0.1.6` in `qp-woltka`; [more information](https://qiita.ucsd.edu/static/doc/html/processingdata/woltka_pairedend.html). Thank you to @qiyunzhu for the benchmarks!
15
15
* Other general fixes, like [#3424](https://github.com/qiita-spots/qiita/pull/3424), [#3425](https://github.com/qiita-spots/qiita/pull/3425).
Copy file name to clipboardExpand all lines: qiita_pet/support_files/doc/source/processingdata/woltka_pairedend.rst
+6-7Lines changed: 6 additions & 7 deletions
Original file line number
Diff line number
Diff line change
@@ -6,16 +6,16 @@ Benchmarks created by Qiyun Zhu (@qiyunzhu) on Aug 1, 2024.
6
6
Summary
7
7
-------
8
8
9
-
I tested alternative read pairing schemes in the analysis of shotgun metagenomic sequencing data. Sequencing reads were aligned against a reference microbial genome database as unpaired or paired, with or without singleton and/or discordant alignments suppressed. A series of synthetic datasets were used in the analysis.
9
+
I tested alternative read pairing schemes in the analysis of shotgun metagenomic sequencing data. Sequencing reads were aligned against a reference microbial genome database as unpaired or paired. A series of synthetic datasets were used in the analysis.
10
10
11
-
The results reveal that treating reads as paired is always advantageous over unpaired. Suppressing singleton alignments further increases the accuracy of results, despite the cost of lower mapping rate. Suppressing discordant alignments has no obvious impact on the result. Regardless of accuracy, the downstream community ecology analyses are not obviously impacted by the choice of parameters.
11
+
The results reveal that treating reads as paired is always advantageous over unpaired. Regardless of accuracy, the downstream community ecology analyses are not obviously impacted by the choice of parameters.
12
12
13
-
Therefore, I recommend the general adoption of paired alignments as a standard procedure. I also endorse suppressing singleton and discordant alignments, but note the favor of further tests on whether they may reduce sensitivity with complex communities.
13
+
Therefore, I recommend the general adoption of paired alignments as a standard procedure.
14
14
15
15
Alignment parameters
16
16
--------------------
17
17
18
-
Sequencing data were aligned using Bowtie2 v2.5.1 in the “very sensitive” mode against the WoL2 database. They were treated as either unpaired or paired-end:
18
+
Sequencing data were aligned using Bowtie2 v2.5.1 in the "very sensitive" mode against the WoL2 database. They were treated as either unpaired or paired-end:
19
19
20
20
- SE: Reads are treated as unpaired (Bowtie2 input: -U merged.fq)
21
21
- PE: Reads are treated as paired (Bowtie2 input: -1 fwd.fq, -2 rev.fq)
@@ -30,11 +30,10 @@ Five synthetic datasets were generated with 25 samples each consisting of random
30
30
31
31
The results of the five Bowtie2 parameter sets were compared using nine metrics:
32
32
33
-
Three metrics that only rely on each result.
33
+
Two metrics that only rely on each result.
34
34
35
35
- Mapping rate (%)
36
36
- Number of taxa
37
-
- Entropy (i.e., Shannon index, but without subsampling)
38
37
39
38
Six metrics that rely on comparing each result against the ground truth (higher is better):
40
39
@@ -59,4 +58,4 @@ The results revealed:
59
58
#. PE outperforms SE in all metrics. Most importantly, it reduces false positive rate (higher precision) while retaining mapping rate. Meanwhile, the sensitivity (recall) of identifying true taxa is not obviously compromised (note the y-axis scale).
60
59
#. PE.NU the two additional parameters had minimum effect on the result and make the alignment step faster. This may suggest that the additional parameters are safe to use.
61
60
62
-
Therefore, I would recommend adopting paired alignment in preference to unpaired alignment. I may suggest no mixing as it has improved accuracy, but the potential adverse effect of lower mapping rate may be further explored before making a compelling recommendation. Although not having a visible effect, no discordance may be added for logical coherency.
61
+
Therefore, I would recommend adopting paired alignment in preference to unpaired alignment.
0 commit comments