Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

File Extension Mismatch in ThermoRawFileParser Conversion Using SearchGUI #364

Open
jacobfh1 opened this issue Oct 12, 2023 · 1 comment
Assignees

Comments

@jacobfh1
Copy link

Hi team!

When utilizing ThermoRawFileParser within SearchGUI to convert .raw files to either .mzML or .mgf formats, there appears to be a discrepancy in the output file extensions.

Conversion from .raw to .mzML results in an output file with the extension .mzML (uppercase 'M' and 'L'). This causes detection issues as the expected extension is .mzml (all lowercase) according to the PeptideShaker.log file.
Conversion from .raw to .mgf produces files with an unexpected extension .mzml.mgf, thereby preventing the detection of the intended [filename].mgf output.

Software Versions Tested:
SearchGUI: v.4.2.7 and v.4.3.1
PeptideShaker: v.2.22.2 and v.3.0.1
Platform: Ubuntu

This behavior suggests a file naming bug that might persist across multiple versions, thereby impacting the post-processing workflow in tools such as PeptideShaker. However, the converted files can be manually utilized with PeptideShaker.

Suggested Remediation:

  1. Ensure output file extensions match the expected formats to facilitate downstream processing.
  2. Review the file naming conventions within ThermoRawFileParser and correct any inconsistencies.

Best,
Jacob

@hbarsnes hbarsnes self-assigned this Oct 12, 2023
@hbarsnes
Copy link
Member

Hi Jacob,

Thanks for the suggestion. However, all we are doing is using ThermoRawFileParser command lines of the following types:

ThermoRawFileParser.exe -i=C:\[...]\qExactive01819.raw -b=C:\[...]\qExactive01819.mzml -f=2 -e -x
or
ThermoRawFileParser.exe -i=C:\[...]\qExactive01819.raw -b=C:\[...]\qExactive01819.mgf -f=2 -e -x

The file names we provide seem to be ignored by ThermoRawFileParser. Here is an excerpt from their GitHub pages: "Output file extension is determined by the used output format and (optional) gzip compression, for example, if format is MGF without gzip compression, the output file will receive .mgf extension, if format is mzML with gzip compression the output file will have .mzML.gz extension. All user input will be standardized to fulfill abovementioned requirements."

Note the upper case mzML extension. In other words, the changes you suggest would require changes to the ThermoRawFileParser code, as in SearchGUI we already (try) to use the conventions you are suggesting. I would recommend that you maks the same suggestion in the ThermoRawFileParser issue tracker?

With regards to the mgf file not being found in SearchGUI, you can simply stick to the default mzml output from ThermoRawFileParser as we will convert to mgf internally for the search engines that do not support mzml.

Best regards,
Harald

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants