Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New meta data field to link to validation record #42

Open
jetschny opened this issue Nov 19, 2024 · 10 comments
Open

New meta data field to link to validation record #42

jetschny opened this issue Nov 19, 2024 · 10 comments
Assignees

Comments

@jetschny
Copy link

While we were discussion how to best integrate our FAIRiCUBE validate to ongoing processes: For data validation and processing validation it would make the most sense if we can create an [external] validation record (in the simplest way an online excel sheet from a MS / Google forms) and link it to the data meta record and a&p meta data record, respectively.

We would therefore need a meta-data field in both these meta data schemas which can hold a link to the validation record as text URL.

@cozzolinoac11
Copy link
Member

Hi @jetschny
regarding the analysis and processing resources metadata catalog, our idea is to create a new mandatory URI-type field called 'Validation' containing a link to a validation report.
Waiting for approval to implement.

@mari-s4e
Copy link

In the catalog editor there are already two fields, data quality and quality control. Could these be used for data validation? @baloola can either of the two fields store URLs?
image

@jetschny
Copy link
Author

Hi @jetschny regarding the analysis and processing resources metadata catalog, our idea is to create a new mandatory URI-type field called 'Validation' containing a link to a validation report. Waiting for approval to implement.

that sounds good but we should not make the field mandatory as we do not intend to enforce the validation. Not every individual processing step might have an individual validation record. I foresee that we rather have a validation record for each processing pipeline. One way forward is then to use the same link for all processing steps or leave some validation fields empty...

If there is a field called "Quality Control" in the data catalog that we can use, we might also call it QC in the a&p metadata?

@gmartirano
Copy link

@jetschny, ok to set the multiplicity as 0..*

@jetschny, @mari-s4e, I doubt about the appropriateness of Quality Control field to store one or more URIs pointing to a validation report.
Semantically speaking, from D4.2 I see that the "custom" field Quality control is meant to "Describe any quality measures (standardised calibration, repeated samples or measurements, data capture, data entry validation, peer review of data, or representation with controlled vocabularies)".
A validation report should instead contain information about the results of a validation process.
Very likely it could happen that the results of a validation process represent just a subset of information to be provided as Quality control.
Of course, it depends also on what the validation process consist of.
As conclusion, my proposal is to create a new Validation "custom" field, to be added as third field of Data Quality section in the dataset metadata template, as well as as new field in the a/p resource metadata template.
Alternatively (but it is my less preferred option), we could use the already existent Quality control field in the dataset metadata template and create a new Quality control "custom" field in the a/p resource metadata template, modifying the current definition as follows: "Describe any quality measures (standardised calibration, repeated samples or measurements, data capture, data entry validation, peer review of data, or representation with controlled vocabularies) or provide one or more URIs pointing to a validation report".
What do you think?

@jetschny
Copy link
Author

we dont want to bend / corrupt our meta data concept too much. if the clean solution is to create a new field "validation" in both meta data stacks, then we should do it....

@gmartirano
Copy link

After having spent additional time to cross-check in D1.2 the overall and specific FAIRiCUBE validation approach (including the validation process), I'm even more convinced that the option to create a new field "validation" is the best one.
So, let's go for it!
EPSIT will take care of it in the a/p resource metadata template.
@baloola, could you please take care of it in the datasets metadata template?
Thanks!

@baloola
Copy link
Member

baloola commented Nov 22, 2024

I will add a validation field, which is a url link to the validation sheet.

There are currently no stac field or an extension to use here ( in the final stac product), so it will be some new proposed validation field

@cozzolinoac11
Copy link
Member

Hi @baloola,
probably this new validation field should be inserted in the Data Quality section in Maria's screenshot. Maybe under Data Quality and Data Control?

What do you think?

@KathiSchleidt
Copy link
Member

Can confirm that this should be under Data Quality - please propose fitting fields.

@baloola
Copy link
Member

baloola commented Nov 26, 2024

added validatioin url under Data Quality

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants