2024.09 #3433

antgonza · 2024-09-12T14:05:30Z

No description provided.

antgonza · 2024-09-12T14:06:02Z

qiita_pet/support_files/doc/source/processingdata/woltka_pairedend.rst

@@ -0,0 +1,72 @@
+Wolka and Bowtie2 using Read Pairing Schemes


@qiyunzhu; could you take a look? Thank you.

coveralls · 2024-09-12T14:24:50Z

coverage: 92.753%. remained the same
when pulling b7a98aa on antgonza:2024.09
into 913a31f on qiita-spots:dev.

charles-cowart

Approved! A few text suggestions to take or leave at your discretion.

charles-cowart · 2024-09-12T19:37:40Z

qiita_pet/support_files/doc/source/processingdata/processing-recommendations.rst

-   The bowtie2 settings are maximum and minimum mismatch penalties (mp=[1,1]), a
-   penalty for ambiguities (np=1; default), read and reference gap open- and
+   The bowtie2 settings are set for interleaved processing with a maximum and minimum mismatch
+   penalties (mp=[1,1]), a penalty for ambiguities (np=1; default), read and reference gap open- and


"reference gap open- and" seems odd. Perhaps it should be "open-and"? or maybe the hyphen should be removed?

charles-cowart · 2024-09-12T19:38:47Z

qiita_pet/support_files/doc/source/processingdata/woltka_pairedend.rst

+
+Here I tested alternative read pairing schemes in the analysis of shotgun metagenomic sequencing data. Sequencing reads were aligned against a reference microbial genome database as unpaired or paired, with or without singleton and/or discordant alignments suppressed. A series of synthetic datasets were used in the analysis.
+
+The results reveal that treating reads as paired is always advantageous over unpaired. Suppressing singleton alignments further increases the accuracy of results, despite at the cost of lower mapping rate. Suppressing discordant alignments has no obvious impact on the result. Regardless of accuracy, the downstream community ecology analyses are not obviously impacted by the choice of parameters.


despite at the cost -> despite the cost

charles-cowart · 2024-09-12T19:38:57Z

qiita_pet/support_files/doc/source/processingdata/woltka_pairedend.rst

+Summary
+-------
+
+Here I tested alternative read pairing schemes in the analysis of shotgun metagenomic sequencing data. Sequencing reads were aligned against a reference microbial genome database as unpaired or paired, with or without singleton and/or discordant alignments suppressed. A series of synthetic datasets were used in the analysis.


Here I tested -> I tested

qiyunzhu · 2024-09-12T23:20:41Z

qiita_pet/support_files/doc/source/processingdata/woltka_pairedend.rst

+
+- Mapping rate (%)
+- Number of taxa
+- Entropy (i.e., Shannon index, but without subsampling)


This line (37) can be removed. Also replace "three" with "two" in line 33.

qiyunzhu · 2024-09-12T23:21:48Z

qiita_pet/support_files/doc/source/processingdata/woltka_pairedend.rst

+Summary
+-------
+
+I tested alternative read pairing schemes in the analysis of shotgun metagenomic sequencing data. Sequencing reads were aligned against a reference microbial genome database as unpaired or paired, with or without singleton and/or discordant alignments suppressed. A series of synthetic datasets were used in the analysis.


"singleton and/or discordant alignments" is now irrelevant. It needs to be removed from this paragraph and the next two.

qiyunzhu · 2024-09-12T23:22:25Z

qiita_pet/support_files/doc/source/processingdata/woltka_pairedend.rst

+Alignment parameters
+--------------------
+
+Sequencing data were aligned using Bowtie2 v2.5.1 in the “very sensitive” mode against the WoL2 database. They were treated as either unpaired or paired-end:


Double quotes may be replaced with "?

qiyunzhu · 2024-09-12T23:22:58Z

qiita_pet/support_files/doc/source/processingdata/woltka_pairedend.rst

+#. PE outperforms SE in all metrics. Most importantly, it reduces false positive rate (higher precision) while retaining mapping rate. Meanwhile, the sensitivity (recall) of identifying true taxa is not obviously compromised (note the y-axis scale).
+#. PE.NU the two additional parameters had minimum effect on the result and make the alignment step faster. This may suggest that the additional parameters are safe to use.
+
+Therefore, I would recommend adopting paired alignment in preference to unpaired alignment. I may suggest no mixing as it has improved accuracy, but the potential adverse effect of lower mapping rate may be further explored before making a compelling recommendation. Although not having a visible effect, no discordance may be added for logical coherency.


Can remove "no mixing... no discordance".

qiyunzhu · 2024-09-12T23:23:35Z

CHANGELOG.md

+* Initial changes in `qiita_client` to have more accurate variable names: `QIITA_SERVER_CERT` -> `QIITA_ROOTCA_CERT`. Thank you @charles-cowart!
+* Added `get_artifact_html_summary` to `qiita_client` to retrieve the summary file of an artifact.
+* Re-added github actions to `https://github.com/qiita-spots/qiita_client`.
+* `Woltka v0.1.4, paired-end` superseded `Woltka v0.1.4` in `qp-woltka`; [more information](https://qiita.ucsd.edu/static/doc/html/processingdata/woltka_pairedend.html). Thank you to @qiyunzhu for the benchmarks!


Should upgrade Woltka to v0.1.6. (!important!)

antgonza · 2024-09-13T13:09:50Z

Thank you @qiyunzhu, I made the suggested changes in a new PR: #3434

2024.09

2441099

antgonza commented Sep 12, 2024

View reviewed changes

antgonza requested a review from charles-cowart September 12, 2024 14:28

charles-cowart approved these changes Sep 12, 2024

View reviewed changes

antgonza added 2 commits September 12, 2024 13:52

addressing @charles-cowart comments

888f433

update based on @qiyunzhu recommendations

b7a98aa

charles-cowart approved these changes Sep 12, 2024

View reviewed changes

charles-cowart merged commit 634db46 into qiita-spots:dev Sep 12, 2024
4 checks passed

qiyunzhu reviewed Sep 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2024.09 #3433

2024.09 #3433

antgonza commented Sep 12, 2024

antgonza Sep 12, 2024

coveralls commented Sep 12, 2024 •

edited

Loading

charles-cowart left a comment

charles-cowart Sep 12, 2024

charles-cowart Sep 12, 2024

charles-cowart Sep 12, 2024

qiyunzhu Sep 12, 2024

qiyunzhu Sep 12, 2024

qiyunzhu Sep 12, 2024

qiyunzhu Sep 12, 2024

qiyunzhu Sep 12, 2024

antgonza commented Sep 13, 2024

		@@ -0,0 +1,72 @@
		Wolka and Bowtie2 using Read Pairing Schemes


		Here I tested alternative read pairing schemes in the analysis of shotgun metagenomic sequencing data. Sequencing reads were aligned against a reference microbial genome database as unpaired or paired, with or without singleton and/or discordant alignments suppressed. A series of synthetic datasets were used in the analysis.

		The results reveal that treating reads as paired is always advantageous over unpaired. Suppressing singleton alignments further increases the accuracy of results, despite at the cost of lower mapping rate. Suppressing discordant alignments has no obvious impact on the result. Regardless of accuracy, the downstream community ecology analyses are not obviously impacted by the choice of parameters.

2024.09 #3433

2024.09 #3433

Conversation

antgonza commented Sep 12, 2024

Choose a reason for hiding this comment

coveralls commented Sep 12, 2024 • edited Loading

charles-cowart left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

antgonza commented Sep 13, 2024

coveralls commented Sep 12, 2024 •

edited

Loading