Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve handling very large amounts of peptides: make merging step more efficient, implement batches #61

Open
MarcIsak opened this issue Apr 22, 2020 · 1 comment
Assignees
Labels

Comments

@MarcIsak
Copy link

Hi,

I tried to run MS2PIP v.3.6.1 from the command line in Ubuntu 18.04. I supplied the config text file and the .peprec file like this when I ran the command:

ms2pip -c config_ms2pip.txt -n 12 precursors.peprec

The computations does start and it runs to the very end, but is then killed for some unknown reason. The console prints the following:

merging results....
killed

I also tried to reduce the number of CPUs used to 6, but the process was killed similarly to above.

As I cannot find any output file (I assume it should be put in the working folder as one only specifies the output format and not the location), my layman guess is that the program has a bug somewhere.

Please find attached the input files I used with ms2pip.

Best,

Marc

ms2pip_files.zip

@RalfG RalfG self-assigned this Apr 22, 2020
@RalfG
Copy link
Member

RalfG commented Apr 24, 2020

Hi Marc,

Thanks for sharing your input files! There is indeed a problem in MS²PIP where, while predicting a large amount (>200 000) of spectra, it takes a disproportionate amount of time (and RAM) to merge the predictions after the parallelized steps. We are planning to fix this issue in the near future by predicting the spectra in multiple batches.

In the mean time, I ran the predictions for your input file by splitting it into multiple parts. You can find the split input files and predictions, and the merged predictions, here. If you have any further questions, I'd be happy to help.

Best,
Ralf

@RalfG RalfG changed the title Process is killed after merging of results Improve handling very large amounts of peptides: make merging step more efficient, implement batches Apr 24, 2020
@RalfG RalfG added the bug label Apr 24, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants