forked from nf-core/kmermaid
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add option to trim low abundance reads #8
Comments
Makes sense to me! Though in my experience trim-low-abun was about 80% of
the runtime, so this would end up saving 20% of total.
…On Tue, Apr 16, 2019 at 12:15 PM Olga Botvinnik ***@***.***> wrote:
As mentioned [here(]
https://sourmash.readthedocs.io/en/latest/using-sourmash-a-guide.html#how-should-i-prepare-my-data)
and here <sourmash-bio/sourmash#601> both khmer and
sourmash support streaming, so to trim low abundance k-mers one could pipe
the output of trim-low-abund.py directly into sourmash compute, which
would save time as one would not need to wait from trim-low-abund.py to
finish (as it is quite slow).
the -M flag is for memory and one could increase it to e.g. 16GiB to make
it faster as well
trim-low-abund.py -C 3 -Z 18 -V -M 2e9 ${reads} | \
sourmash compute ${flags} -
Thoughts, @phoenixAja <https://github.com/phoenixAja> and @batson
<https://github.com/batson> ?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#8>, or mute the
thread
<https://github.com/notifications/unsubscribe-auth/ABM-atGWhx6rmISVRCGS56IxkpgK7vGvks5vhiFtgaJpZM4czhSj>
.
|
Great! I'll take a stab at writing a PR. After I fix these tests first
though ...
---
Olga Botvinnik, PhD
olgabotvinnik.com <http://www.olgabotvinnik.com>
On Tue, Apr 16, 2019 at 6:21 PM Joshua Batson <[email protected]>
wrote:
… Makes sense to me! Though in my experience trim-low-abun was about 80% of
the runtime, so this would end up saving 20% of total.
On Tue, Apr 16, 2019 at 12:15 PM Olga Botvinnik ***@***.***>
wrote:
> As mentioned [here(]
>
https://sourmash.readthedocs.io/en/latest/using-sourmash-a-guide.html#how-should-i-prepare-my-data
)
> and here <sourmash-bio/sourmash#601> both khmer and
> sourmash support streaming, so to trim low abundance k-mers one could
pipe
> the output of trim-low-abund.py directly into sourmash compute, which
> would save time as one would not need to wait from trim-low-abund.py to
> finish (as it is quite slow).
>
> the -M flag is for memory and one could increase it to e.g. 16GiB to make
> it faster as well
>
> trim-low-abund.py -C 3 -Z 18 -V -M 2e9 ${reads} | \
> sourmash compute ${flags} -
>
> Thoughts, @phoenixAja <https://github.com/phoenixAja> and @batson
> <https://github.com/batson> ?
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <#8>, or mute the
> thread
> <
https://github.com/notifications/unsubscribe-auth/ABM-atGWhx6rmISVRCGS56IxkpgK7vGvks5vhiFtgaJpZM4czhSj
>
> .
>
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#8 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAxNcEAj3DGCBJpe7jAzRiUR0Eia6qwVks5vhk0BgaJpZM4czhSj>
.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
As mentioned here and here both khmer and sourmash support streaming, so to trim low abundance k-mers one could pipe the output of
trim-low-abund.py
directly intosourmash compute
, which would save time as one would not need to wait fromtrim-low-abund.py
to finish (as it is quite slow).the
-M
flag is for memory and one could increase it to e.g. 16GiB to make it faster as wellThoughts, @phoenixAja and @batson ?
The text was updated successfully, but these errors were encountered: