Synchronise efforts and ideas on Validator and Datastore #2526
siemvaessen
started this conversation in
Ideas
Replies: 2 comments
-
Guess what, I have plenty of thoughts on this ;-) On validation itself:
On ingesting datasets:
|
Beta Was this translation helpful? Give feedback.
0 replies
-
Thanks @rolfkleef ~ we now have a branch that will check for data changes on activity level, while keeping the current dataset level change sha1 concept which ties into the Validator intact. Further down the line the Validator should follow this concept, so data is processed in an optimal way. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
As both the IATI Validator and IATI Datastore have a dependency and have been built as separate services under different contracts by different suppliers we need to align efforts and ideas to ensure both services are in par with the objectives raised in the IATI context. One major enhancements come to mind that have require a similar approach and will benefit all end user.
Process data on activity level rather than dataset level
Both Datastore and Validator will be able to process data way faster as Datastore currently provides sha1 keys for change detection in datasets. The current process is based on reviewing a dataset and when datachange is detected, sha1 key is updated, validator picks up the new sha1 key, checks the dataset and the datastore will then start processing that dataset. In the new scenario Datastore keeps track of not just the sha1 keys for each dataset that it retrieves from the IATI (metadata) Registry but keeps track of sha1 keys for each and every activity. If this is in place Datastore will only process an IATI Activity when its sha1 keys changes and Validator will only pickup the changed IATI Activity with changes. This will increase data processing (in both services)
Pro's:
Con's:
Beta Was this translation helpful? Give feedback.
All reactions