-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Include doi as required field in meta.yaml #158
Comments
More broadly, we could think about how we want recipes to be cited. Here is example we came up with today for this dataset.
What's missing here is a good citation for the recipe author (in this case, @briannapagan). Brianna, I'm curious, what sort of acknowledgement of your role would make sense here? |
I would second showing the doi/list of doi's. Recipe authors have the responsibility of properly citing the original dataset. As for acknowledging the recipe auhors - is adding a doi for the recipe itself over-doing it? Do we need recipe author acknowledgement at all? Using NASA as an example, the archivers/folks who are working at the data centers do not get acknowledgement for maintaining the data collections themselves, just a nod potentially to the data center itself. |
Thanks for that @briannapagan! I personally think we have the opportunity to change the status quo for the better here. I would personally advocate for a doi per recipe, which I think will acknowledge the important work which will be the foundation of how climate data science might be done in the future? We cannot assume that recipe maintainers are financially compensated for their work (as is the case at NASA?), so I think providing an easy way to acknowledge their efforts would be fair, and might create a needed incentive for a diverse group of people to contribute recipes? A practical consideration for reproducibility: If we e.g. decide to implement a zenodo webhook for feedstocks, we could get a doi + a secondary archived location for the code. This would increase the chance of researchers in the future to actually reproce a given dataset with a particular version of the recipe (even if it has to be run on your local computer). |
Along the same lines, is the recipe maintainer which receives the acknowledgement also responsible for maintaining in perpetuity? I am going to sound like a broken record, but data archives are very much alive. If some reprocessing error is caught, and original data source republished, the zarr store must be updated. Is the onerous on the shoulders of the maintainer to always ensure the zarr store is accurate? How do we connect the upstream data providers to this? |
Also great! +2 for doi per recipe. |
Excellent point! Naively Id think we should aim to |
👋 all, I've moved this issue here to |
I think there are several questions mixed in this discussion:
Any suggestions where to separate the discussion on 2? Moving forward here: I am a strongly for enforcing dois in the meta.yaml as a default! Perhaps we can have some sort of an opt-out option for testing though? |
I am working through a documentation with @rabernat which outlines how a proper citation using pangeo-forge data would look like. We noticed that the catalog page does not display the doi, which is needed to cite the original data source in a paper.
I propose to add a required field in the
meta.yaml
that contains the doi (or possibly a list of dois) for a given dataset. This could be then be used to have a 'copy citation' button on each catalog entry.The text was updated successfully, but these errors were encountered: