-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cproject Structure Query #65
Comments
@danmaclean also see: ContentMine/cproject#10 .
|
Dan, CM data structure is intentionally somewhat fluid because we are reacting to the very wide range of structures and information that people use in scientific communication. The philosophy is perhaps similar to JSON and other lightly typed structures rather than the rigidity of XML schemas and DTDs.
This means that a parser will have fewer hard coded names and more that are determined at runtime. I think that JSON is a good analogy here (and indeed the output could be transformed into JSON). It makes parsing more challenging than hardcoded names and means that tools such as XPath and JSONPath are often useful. (The info is probably also out of date in places - sorry! but that is often the case with evolving projects.). |
Hi,
This document https://github.com/ContentMine/workshop-resources/tree/master/software-tutorials/cproject seems to claim to be definitive about Cproject structure, but seems to be at odds with this document about the output of
ami
https://github.com/ContentMine/workshop-resources/blob/master/software-tutorials/ami/README.md#ami2-species. In the CProject definition the extent of say asequence
results directory looks to be much simpler than the apparent results described in the tutorial.CProject folder structure:
ami output tutorial
Im trying to write a parser for CProjects, could you let me know whether the
ami
tools are going to produce lots of directories (e.gami2seq
will generatesequence/sequencetype
folders or, as the CProject document suggests, will it generate just thesequence/dnaprimer
folder? Or is the info in one of these docs out of date?Thanks for clarification.
The text was updated successfully, but these errors were encountered: