-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Think I can't connect to Sirius because of the new 5.0 version #33
Comments
Hello, Yes the error is due to SIRIUS4 and the webservices linked to SIRIUS4 are not supported anymore. Could you let me know if you are using MAW to run your dataset and how big the size of your whole dataset (all .mzML files)? I can help with using the workflow interactively with SIRIUS5. Mahnoor |
Alright. Here are few points that you can consider and let me know:
Once you perform the MAW-R module, and have .ms files; I can also write quick script to run SIRIUS for all files instead of using the GUI, but that depends on your preference. Using the script would only be possible if you already have SIRIUS CLI installed in your system and you have already logged in using your credentials in terminal (also which OS you are using)? Also, the workflow for now takes only one file (we are working on a different route for parallelisation), so you can only run one file at a time for now. My next question would be: Would you like a docker container for this task? Let me know your thoughts here or write to me on [email protected] Hope this information helps. Kind regards, |
Dear Mahnoor, Best regards Li Zhihua |
Hi I meet another maybe similar reason problem as below,
Error in unserialize(node$con) : |
Hi I have not found the file spectral_results_for_xxxx.csv in the folder spectral_dereplication |
Hello, Please refer to the updated sections in the "provenance" branch README.md. README.md link from provenance branch I have mentioned SIRIUS5 as a separate section from MAW-R and following the steps to use SIRIUS5 should work now. It is important to make changes in the parameters to the function I hope this is clear enough to run SIRIUS5. Please let me know when you encounter any further issue. |
I would assume that the function wasn't finished because this file is generated after the function is finished. Did you encounter any error message or do you think the function was interrupted? |
If you re run the workflow, do you still get this error? Generally it should work, as it is an error from the parallelisation of the workflow. If you re run and the error persists, let me know. |
Hi, Loading required package: foreach Attaching package: ‘BiocGenerics’ The following objects are masked from ‘package:stats’:
The following objects are masked from ‘package:base’:
Attaching package: ‘S4Vectors’ The following object is masked from ‘package:future’:
The following objects are masked from ‘package:base’:
Loading required package: BiocParallel Attaching package: ‘ProtGenerics’ The following object is masked from ‘package:stats’:
Attaching package: ‘Spectra’ The following object is masked from ‘package:ProtGenerics’:
Attaching package: ‘MsCoreUtils’ The following objects are masked from ‘package:Spectra’:
The following objects are masked from ‘package:ProtGenerics’:
The following object is masked from ‘package:stats’:
Attaching package: ‘dplyr’ The following object is masked from ‘package:MsCoreUtils’:
The following objects are masked from ‘package:S4Vectors’:
The following objects are masked from ‘package:BiocGenerics’:
The following objects are masked from ‘package:stats’:
The following objects are masked from ‘package:base’:
Attaching package: ‘rvest’ The following object is masked from ‘package:readr’:
Loading required package: Rcpp Attaching package: ‘curl’ The following object is masked from ‘package:readr’:
Error in unserialize(node$con) : |
Dear Mahnoor, |
Sorry for the delayed response here, I am currently trying to fix all these issues with CWL, but it is still not ready yet. So to work around these issues here are some solutions:
You will get the spectral database results, and in insilico folder you will find metfrag parameter files (if you want to use metfrag CLI) and SIRIUS parameter files (that you can use to run SIRIUS or SIRIUS CLI). once you have these results, ideally you can use MAW-Py to perform candidate selection. I have already prepared scripts for candidate selection with Spectral DBs and Metfrag and but due to some changes in SIRIUS results, that is not currently possible. However, I will work on it after the MAW-CWL is ready. Let me know if this solves the problem. If the future library problem persists, can you send me an example file, and your systems RAM? I will try to recreate the error and solve it with Luiz. Kind regards, |
Dear Prof.Mahnoor,
Thank you very much for your explanation! I have tried to fix this problem.
1. Until now, I have changed the plan(list(
tweak(multisession, workers = ((n.cores + 5) %/% 3) %/% 2),
tweak(multisession, workers = ((n.cores + 5) %/% 3) %/% 2),
tweak(multisession, workers = 3)
)) in the Workflow_R_Script.r to plan(multisession, workers=8) to reduce (balance) the core and memory. I find it maybe works but needs a very long time such as several days (My data includes six files and each file is about 100MB), I started two days ago, but it is still not finished until now.
2. I will try your suggestions when I go back to the office and tell you the results
3. As you know, there are several very excellent software for processing MS data such as MS-DIAL, XCMS, and MZmine2(3), So I have a suggestion that would the MAW design to take the results of MS-DAIL,XCMS, and MZmine as inputs like GNPS?. This will facilitate and reduce the memory used for substream analysis.
I want to express thanks for your idea in MAW again and I really very like it.
Best regards
Li Zhihua
…________________________________
发件人: Mahnoor Zulfiqar ***@***.***>
发送时间: 2023年5月1日 20:41
收件人: zmahnoor14/MAW ***@***.***>
抄送: Li Zhihua ***@***.***>; Comment ***@***.***>
主题: Re: [zmahnoor14/MAW] Think I can't connect to Sirius because of the new 5.0 version (Issue #33)
Sorry for the delayed response here, I am currently trying to fix all these issues with CWL, but it is still not ready yet. So to work around these issues here are some solutions:
* Running spectral database dereplication: I understand the error is from future library and since my collaborator @lgadelha<https://github.com/lgadelha> added this function, I will ask him for this possible error. While I run the files that I have here I don't get this error so, I would ask you to send me one mzML example so that I also take a look at it and try to reproduce the error ***@***.******@***.***
Meanwhile you can try to run the latest R script. You will need the following:
1. Workflow_R_Script_all.r<https://github.com/zmahnoor14/MAW/blob/provenance/cwl/Workflow_R_Script_all.r>
2. All Spectral databases downloaded from Zenodo<https://zenodo.org/record/7519270>
3. your one input file .mzML
4. COCONUT database for MetFrag<https://zenodo.org/record/7704937>
Now, first you pull latest docker image and run the workflow script from the docker container. This docker container won;t have all these files, so I recommend keeping all the above files (1,2,3,4 points) in your working directory (or pwd).
docker pull zmahnoor/maw-r:1.0.8 docker run --name checkMAW_R -v $(pwd):/opt/workdir -i -t zmahnoor/maw-r:1.0.8 /bin/bash
This way, you will have all your files in your pwd(or working directory) mounted to /opt/workdir inside the container. Now from within the container you can write:
Rscript Workflow_R_Script_all.r your_input.mzML gnps.rda hmdb.rda mbankNIST.rda any-string-as-id 15 coconut COCONUT_Jan2022.csv &
This will give you a PID for this task within the container. You can disown id to be able to run it in the background. To leave the container you can CTRL+p and then CTRL+q to exit in daemon mode.
Now your container will run in the background and write all the outputs in your pwd.
You will get the spectral database results, and in insilico folder you will find metfrag parameter files (if you want to use metfrag CLI) and SIRIUS parameter files (that you can use to run SIRIUS or SIRIUS CLI).
once you have these results, ideally you can use MAW-Py to perform candidate selection. I have already prepared scripts for candidate selection with Spectral DBs and Metfrag and but due to some changes in SIRIUS results, that is not currently possible. However, I will work on it after the MAW-CWL is ready.
Let me know if this solves the problem. If the future library problem persists, can you send me an example file, and your systems RAM? I will try to recreate the error and solve it with Luiz.
Kind regards,
Mahnoor
―
Reply to this email directly, view it on GitHub<#33 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJI2BJOWVEWDOQADQEAFPPTXD6VQBANCNFSM6AAAAAAV2JNORA>.
You are receiving this because you commented.Message ID: ***@***.***>
|
Dear Prof.Mahnoor,
1. Using the example Example_Tyrosine.mzML, it will works. but it will report the error when using my data (6 mzML and each mzML is about 100MB) , I will send my mzML data to you when I go back to office (May be 2 days later)
2.https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash.jsp
[https://www.bing.com/th?id=OVP.gH_3vrOStbB5sgdNcWWMiwEsCo&pid=Api]<https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash.jsp>
GNPS - Analyze, Connect, and Network with your Mass Spectrometry Data<https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash.jsp>
The Global Natural Product Social Molecular Networking (GNPS) site creates a community for natural product researchers working with mass spectrometry data.
gnps.ucsd.edu
[cid:6a423f58-8d28-4c2e-9b76-206fb2a17ad1]
Thanks again!
Best regards
Li Zhihua
…________________________________
发件人: Mahnoor Zulfiqar ***@***.***>
发送时间: 2023年5月1日 20:41
收件人: zmahnoor14/MAW ***@***.***>
抄送: Li Zhihua ***@***.***>; Comment ***@***.***>
主题: Re: [zmahnoor14/MAW] Think I can't connect to Sirius because of the new 5.0 version (Issue #33)
Sorry for the delayed response here, I am currently trying to fix all these issues with CWL, but it is still not ready yet. So to work around these issues here are some solutions:
* Running spectral database dereplication: I understand the error is from future library and since my collaborator @lgadelha<https://github.com/lgadelha> added this function, I will ask him for this possible error. While I run the files that I have here I don't get this error so, I would ask you to send me one mzML example so that I also take a look at it and try to reproduce the error ***@***.******@***.***
Meanwhile you can try to run the latest R script. You will need the following:
1. Workflow_R_Script_all.r<https://github.com/zmahnoor14/MAW/blob/provenance/cwl/Workflow_R_Script_all.r>
2. All Spectral databases downloaded from Zenodo<https://zenodo.org/record/7519270>
3. your one input file .mzML
4. COCONUT database for MetFrag<https://zenodo.org/record/7704937>
Now, first you pull latest docker image and run the workflow script from the docker container. This docker container won;t have all these files, so I recommend keeping all the above files (1,2,3,4 points) in your working directory (or pwd).
docker pull zmahnoor/maw-r:1.0.8 docker run --name checkMAW_R -v $(pwd):/opt/workdir -i -t zmahnoor/maw-r:1.0.8 /bin/bash
This way, you will have all your files in your pwd(or working directory) mounted to /opt/workdir inside the container. Now from within the container you can write:
Rscript Workflow_R_Script_all.r your_input.mzML gnps.rda hmdb.rda mbankNIST.rda any-string-as-id 15 coconut COCONUT_Jan2022.csv &
This will give you a PID for this task within the container. You can disown id to be able to run it in the background. To leave the container you can CTRL+p and then CTRL+q to exit in daemon mode.
Now your container will run in the background and write all the outputs in your pwd.
You will get the spectral database results, and in insilico folder you will find metfrag parameter files (if you want to use metfrag CLI) and SIRIUS parameter files (that you can use to run SIRIUS or SIRIUS CLI).
once you have these results, ideally you can use MAW-Py to perform candidate selection. I have already prepared scripts for candidate selection with Spectral DBs and Metfrag and but due to some changes in SIRIUS results, that is not currently possible. However, I will work on it after the MAW-CWL is ready.
Let me know if this solves the problem. If the future library problem persists, can you send me an example file, and your systems RAM? I will try to recreate the error and solve it with Luiz.
Kind regards,
Mahnoor
―
Reply to this email directly, view it on GitHub<#33 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJI2BJOWVEWDOQADQEAFPPTXD6VQBANCNFSM6AAAAAAV2JNORA>.
You are receiving this because you commented.Message ID: ***@***.***>
|
Dear Prof.Mahnoor,
1.Just Now, the MAW is still works (May be 4-5days log), it works at the parse_data step as below. The computer is 64 RAM and Ubuntu 18.04.
2. The mzML is about 100MB and it is difficult to submit it through the hotmail Email.
3. Becuse it cannot access the zenodo in China, so I copy the GNPs.rda, hmdb.rda... from the maw-r:1.0.1, but I can not find the COCONUT database for MetFrag, So where I can find it in the maw-r:1.0.1[https://res-h3.public.cdn.office.net/assets/mail/file-icon/png/generic_16x16.png]QC21.mzML<https://1drv.ms/u/s!An_qG-a0TjfhgVO604WuokuiQU6g>
1. Workflow_R_Script_all.r<https://github.com/zmahnoor14/MAW/blob/provenance/cwl/Workflow_R_Script_all.r>
2. All Spectral databases downloaded from Zenodo<https://zenodo.org/record/7519270>
3. your one input file .mzML
4. COCONUT database for MetFrag<https://zenodo.org/record/7704937>
Best regards
Li Zhihua
[cid:aaafae04-3cce-410a-98e4-65bb641e561f]
…________________________________
发件人: Mahnoor Zulfiqar ***@***.***>
发送时间: 2023年5月1日 20:41
收件人: zmahnoor14/MAW ***@***.***>
抄送: Li Zhihua ***@***.***>; Comment ***@***.***>
主题: Re: [zmahnoor14/MAW] Think I can't connect to Sirius because of the new 5.0 version (Issue #33)
Sorry for the delayed response here, I am currently trying to fix all these issues with CWL, but it is still not ready yet. So to work around these issues here are some solutions:
* Running spectral database dereplication: I understand the error is from future library and since my collaborator @lgadelha<https://github.com/lgadelha> added this function, I will ask him for this possible error. While I run the files that I have here I don't get this error so, I would ask you to send me one mzML example so that I also take a look at it and try to reproduce the error ***@***.******@***.***
Meanwhile you can try to run the latest R script. You will need the following:
1. Workflow_R_Script_all.r<https://github.com/zmahnoor14/MAW/blob/provenance/cwl/Workflow_R_Script_all.r>
2. All Spectral databases downloaded from Zenodo<https://zenodo.org/record/7519270>
3. your one input file .mzML
4. COCONUT database for MetFrag<https://zenodo.org/record/7704937>
Now, first you pull latest docker image and run the workflow script from the docker container. This docker container won;t have all these files, so I recommend keeping all the above files (1,2,3,4 points) in your working directory (or pwd).
docker pull zmahnoor/maw-r:1.0.8 docker run --name checkMAW_R -v $(pwd):/opt/workdir -i -t zmahnoor/maw-r:1.0.8 /bin/bash
This way, you will have all your files in your pwd(or working directory) mounted to /opt/workdir inside the container. Now from within the container you can write:
Rscript Workflow_R_Script_all.r your_input.mzML gnps.rda hmdb.rda mbankNIST.rda any-string-as-id 15 coconut COCONUT_Jan2022.csv &
This will give you a PID for this task within the container. You can disown id to be able to run it in the background. To leave the container you can CTRL+p and then CTRL+q to exit in daemon mode.
Now your container will run in the background and write all the outputs in your pwd.
You will get the spectral database results, and in insilico folder you will find metfrag parameter files (if you want to use metfrag CLI) and SIRIUS parameter files (that you can use to run SIRIUS or SIRIUS CLI).
once you have these results, ideally you can use MAW-Py to perform candidate selection. I have already prepared scripts for candidate selection with Spectral DBs and Metfrag and but due to some changes in SIRIUS results, that is not currently possible. However, I will work on it after the MAW-CWL is ready.
Let me know if this solves the problem. If the future library problem persists, can you send me an example file, and your systems RAM? I will try to recreate the error and solve it with Luiz.
Kind regards,
Mahnoor
―
Reply to this email directly, view it on GitHub<#33 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJI2BJOWVEWDOQADQEAFPPTXD6VQBANCNFSM6AAAAAAV2JNORA>.
You are receiving this because you commented.Message ID: ***@***.***>
|
Hi Mahnoor,
After I pull the WAM:1.0.8, there is no Rscript Workflow_R_Script_all.r
Best regards
Li Zhihua
获取 Outlook for iOS<https://aka.ms/o0ukef>
…________________________________
发件人: li zhihua ***@***.***>
发送时间: Monday, May 1, 2023 9:30:17 PM
收件人: zmahnoor14/MAW ***@***.***>
主题: 回复: [zmahnoor14/MAW] Think I can't connect to Sirius because of the new 5.0 version (Issue #33)
Dear Prof.Mahnoor,
1. Using the example Example_Tyrosine.mzML, it will works. but it will report the error when using my data (6 mzML and each mzML is about 100MB) , I will send my mzML data to you when I go back to office (May be 2 days later)
2.https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash.jsp
[https://www.bing.com/th?id=OVP.gH_3vrOStbB5sgdNcWWMiwEsCo&pid=Api]<https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash.jsp>
GNPS - Analyze, Connect, and Network with your Mass Spectrometry Data<https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash.jsp>
The Global Natural Product Social Molecular Networking (GNPS) site creates a community for natural product researchers working with mass spectrometry data.
gnps.ucsd.edu
[cid:6a423f58-8d28-4c2e-9b76-206fb2a17ad1]
Thanks again!
Best regards
Li Zhihua
________________________________
发件人: Mahnoor Zulfiqar ***@***.***>
发送时间: 2023年5月1日 20:41
收件人: zmahnoor14/MAW ***@***.***>
抄送: Li Zhihua ***@***.***>; Comment ***@***.***>
主题: Re: [zmahnoor14/MAW] Think I can't connect to Sirius because of the new 5.0 version (Issue #33)
Sorry for the delayed response here, I am currently trying to fix all these issues with CWL, but it is still not ready yet. So to work around these issues here are some solutions:
* Running spectral database dereplication: I understand the error is from future library and since my collaborator @lgadelha<https://github.com/lgadelha> added this function, I will ask him for this possible error. While I run the files that I have here I don't get this error so, I would ask you to send me one mzML example so that I also take a look at it and try to reproduce the error ***@***.******@***.***
Meanwhile you can try to run the latest R script. You will need the following:
1. Workflow_R_Script_all.r<https://github.com/zmahnoor14/MAW/blob/provenance/cwl/Workflow_R_Script_all.r>
2. All Spectral databases downloaded from Zenodo<https://zenodo.org/record/7519270>
3. your one input file .mzML
4. COCONUT database for MetFrag<https://zenodo.org/record/7704937>
Now, first you pull latest docker image and run the workflow script from the docker container. This docker container won;t have all these files, so I recommend keeping all the above files (1,2,3,4 points) in your working directory (or pwd).
docker pull zmahnoor/maw-r:1.0.8 docker run --name checkMAW_R -v $(pwd):/opt/workdir -i -t zmahnoor/maw-r:1.0.8 /bin/bash
This way, you will have all your files in your pwd(or working directory) mounted to /opt/workdir inside the container. Now from within the container you can write:
Rscript Workflow_R_Script_all.r your_input.mzML gnps.rda hmdb.rda mbankNIST.rda any-string-as-id 15 coconut COCONUT_Jan2022.csv &
This will give you a PID for this task within the container. You can disown id to be able to run it in the background. To leave the container you can CTRL+p and then CTRL+q to exit in daemon mode.
Now your container will run in the background and write all the outputs in your pwd.
You will get the spectral database results, and in insilico folder you will find metfrag parameter files (if you want to use metfrag CLI) and SIRIUS parameter files (that you can use to run SIRIUS or SIRIUS CLI).
once you have these results, ideally you can use MAW-Py to perform candidate selection. I have already prepared scripts for candidate selection with Spectral DBs and Metfrag and but due to some changes in SIRIUS results, that is not currently possible. However, I will work on it after the MAW-CWL is ready.
Let me know if this solves the problem. If the future library problem persists, can you send me an example file, and your systems RAM? I will try to recreate the error and solve it with Luiz.
Kind regards,
Mahnoor
―
Reply to this email directly, view it on GitHub<#33 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJI2BJOWVEWDOQADQEAFPPTXD6VQBANCNFSM6AAAAAAV2JNORA>.
You are receiving this because you commented.Message ID: ***@***.***>
|
Dear Li Zhihua, sorry for my late response, I am still a PhD student and currently trying to finish my projects and have a lot of work on hands : ) Here is the link to the COCONUT installation: https://upload.uni-jena.de/data/6459b9bd932378.80487188/COCONUT_Jan2022.csv I would also suggest to use the updated HMDB5.0 version and here is the link for the download: https://upload.uni-jena.de/data/6459ba4f540612.50374105/hmdb.rda Regarding Workflow_R_Script_all.r, you can download the latest one on the GitHub repository with the link: https://github.com/zmahnoor14/MAW/blob/provenance/cwl/Workflow_R_Script_all.r |
Hi,
I would like to try your tool and workflow but when I launch your data test I obtain a lot of informations but the following also :
And after it restart trying to connect in loop and never finish.... Think that Sirius closed all connection with 4.9 version...
Can someone help me please ?
The text was updated successfully, but these errors were encountered: