-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add ability to strip sequences/qualities from SAM/BAM files #102
Comments
Could also be a flag when saving (to the |
Hi, I would like to work on this. So I've written a |
Luis suggestion above would be to have an extra attribute on the Something like::
My initial idea was to have it as part of a
The second interface has a few more use-cases but we didn't reach a decision on which to implement. @luispedro thoughts? |
The @unode: what use-cases do you see with the second interface? I am not against it, but the |
The main case I envision is optimization. Assuming a long pipeline using SAM/BAM that doesn't require qualities, removing them early could speed up processing by reducing I/O. I had a couple of such cases in the past but wouldn't call it a frequent use-case. |
In principle, we could move the stripping to earlier in the pipeline as an optimization later without changing the user-visible interface. |
Ok, so @sureyeaah can you also add a line to Line 2 in 59576e0
|
When working with very large SAM files it is often convenient to remove sequence and quality information to reduce storage and improve I/O.
Following from this it would be convenient to have a
stripSeqQual
function that replaces the two fields with*
.The text was updated successfully, but these errors were encountered: