Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query Tool upgrade to query both a/p resources and datasets metadata #4

Open
cozzolinoac11 opened this issue May 28, 2024 · 4 comments
Assignees

Comments

@cozzolinoac11
Copy link
Member

As an outcome of the common topic seminar held on 21.05.2024 (see presentation), below the proposed requirements for Query Tool upgrade to query both a/p resources and datasets metadata.

Concluding remark of the seminar was to document objective/verifiable reasons to discard option b), e.g. inefficiency of querying the catalog via STAC API, especially if querying two or more different collections and using many parameters.
Should option a) be approved, the related requirements and implementation plan are listed in the table below.

Req. Id Requirement Plan Deadline
1 Create an ingestion procedure to ingest dataset metadata in the Knowledge Base database. 1. Read json dataset metadata from GitHub

2. Mapping from json dataset metadata to database table

3. Definition of mechanism to run the procedure
31.10.2024
2 Upgrade current Query Tool to allow complex queries. 1. Definition of additional queries

2. Definition of layout changes

3. Implementation
31.12.2024

Feedback welcome!

@KathiSchleidt
Copy link
Member

I see the decision between option a and option b as the first step, when can we expect this?

Assuming for steps 2.1 & 2.2 (updating the design for the Query Tool), you'll be getting feedback from partners?

@gmartirano
Copy link

I see the decision between option a and option b as the first step, when can we expect this?

I would say after leaving to the coordinators and to the interested partners a reasonable time to provide their feedback. 1 or 2 weeks from now (also depending on possible feedback)?

Assuming for steps 2.1 & 2.2 (updating the design for the Query Tool), you'll be getting feedback from partners?

It would be good to jointly finalise the requirement, e.g. agreeing on what type of "complex queries" may be useful to implement.

@KathiSchleidt
Copy link
Member

@gmartirano trying to weigh the strengths and weaknesses of options A vs. B.

Are you sure that option B is that much more complex to query? Is the query functionality on the STAC API so bad?

On option A, how to you foresee maintaining alignment between the data catalog and your internal DB, nightly updates?

On steps 2.1 & 2.2 (updating the design for the Query Tool), I'd appreciate a dedicated workshop on this functionality. As we also need a workshop with the UC partners on dataset metadata requirements, could make sense to do this jointly

@gmartirano
Copy link

@gmartirano trying to weigh the strengths and weaknesses of options A vs. B.

Are you sure that option B is that much more complex to query? Is the query functionality on the STAC API so bad?

Not entirely sure, but quite confident :)
However, because disregarding option B, should be this the final decision, will have to be based on some evidence, we will investigate a bit more.

On option A, how to you foresee maintaining alignment between the data catalog and your internal DB, nightly updates?

Let's say regular updates. Nightly could be an option.

On steps 2.1 & 2.2 (updating the design for the Query Tool), I'd appreciate a dedicated workshop on this functionality. As we also need a workshop with the UC partners on dataset metadata requirements, could make sense to do this jointly

Ok, good idea!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants