Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

In nrow(A_norm) * ncol(A_norm) : NAs produced by integer overflow #30

Open
haoqing12 opened this issue Apr 30, 2024 · 0 comments
Open

Comments

@haoqing12
Copy link

I am working with a particularly large dataset consisting of 2,423,133 cells and 1,091 genes. Using the ALRA, I ran into repeated warnings about integer overflow. Here’s the warning:

r$> A_norm_completed <- alra(A_norm,k=k_choice$k)[[3]]
Read matrix with 2423133 cells and 1091 genes
Getting nonzeros
Randomized SVD
Find the 0.001000 quantile of each gene
Sweep
Scaling all except for 0 columns
NA% of the values became negative in the scaling process and were set to zero
The matrix went from NA% nonzero to NA% nonzero
Warning messages:
1: In nrow(A_norm) * ncol(A_norm) : NAs produced by integer overflow
2: In nrow(A_norm) * ncol(A_norm) : NAs produced by integer overflow
3: In nrow(A_norm) * ncol(A_norm) : NAs produced by integer overflow

截屏2024-04-30 16 38 17

There doesn't seem to be a good complement of zeros.

Could you please provide any suggestions on how to mitigate this issue? Is there a recommended approach to handling large datasets with your software, or perhaps a parameter adjustment that I might not be aware of?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant