Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python: index_summary : Cluster Count == # of seq * 2 of a PE run? #271

Open
sklages opened this issue May 19, 2021 · 3 comments
Open

Python: index_summary : Cluster Count == # of seq * 2 of a PE run? #271

sklages opened this issue May 19, 2021 · 3 comments
Labels

Comments

@sklages
Copy link

sklages commented May 19, 2021

I haven't noticed before,

index_summary(run_metrics, level='Barcode') returns as Cluster Count the total number of reads of a PE run, not the acutal number of clusters (= single reads, R1 seq count, whatever).
That's confusing ..

Is that intented?

This is interop-1.1.23.

@ezralanglois
Copy link
Member

I think you mean a dual index run, and yes, this is annoying. As far as I know, it has always been this way and we will need to rev the major minor version because this potentially will be a breaking change for downstream applications.

@sklages
Copy link
Author

sklages commented May 20, 2021

No, simple paired end run, no matter if single index, dual index run. Cluster Count always seems to be the sum of sequences of R1 and R2...

Upper is the actual R1 fastq file, the lower is the outpzt from index_summary(run_metrics, level='Barcode').
image

@ezralanglois
Copy link
Member

Ya, you are right. I misinterpreted the code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants