-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DynamoDB: Use ON CONFLICT DO NOTHING
clause on INSERT
CDC operations
#77
Conversation
Discussion SummaryQ&A
Do nothing on DuplicateKeyExceptions@wierdvanderhaar already responded:
@hammerhead added:
Overwrite Option@amotl asked:
@hammerhead responded:
|
... to mitigate errors when events are relayed redundantly from retries after partially failed batches on the Lambda processor.
928a99a
to
671b47d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Makes sense I believe, thanks!
@amotl Is it possible to log the conflicts? I could imagine that the user would like to know how often it happens and maybe see if there is a pattern. |
I don't know if ON CONFLICT DO ... has any means of logging, and how to retrieve it. An alternative would probably be to omit it, and catch the exception in Python code, in order to be able to converge it into a log event. Because CDC operations are not of the same high volume / throughput like full-load operations, it might be acceptable performance-wise. |
I couldn't find DEBUG/TRACE logging in the existing code that could be switched on to facilitate this. |
Thanks for checking, Marios.
@kneth: If you don't have any objections, and considering my other evaluations, let's merge and release this now, and follow up with a more elaborate external implementation later, which will be doing it how other vendors are also doing it. |
Hi again. We will integrate and release this update, and will provide a more elaborate implementation later, which is closer to what AWS DMS and others are offering. We will use that ticket to track progress: |
Reason
... to mitigate errors when events are relayed redundantly from retries after partially failed batches on the Lambda processor.
References
DuplicateKeyException
after Lambda is resuming CDC operations cratedb-toolkit#301/cc @kneth, @surister, @hlcianfagna