-
Notifications
You must be signed in to change notification settings - Fork 368
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
S3/GCS/Azure Source: Enhanced Data Reload Strategy for Specific Timef…
…rames (#1187) * First commit * Adapted the S3 integration tests to account for the new object key name including the earliest record timestamp. Refactored a few parameter/field names around the object key name. * Object Key format version only applies when the envelope storage is used. It is the only storage which guarantees the record timestamp to be preserved. Sinks tests have been updated to reflect the object keys values. * Fixes endless loop in test in case of test failure. Avoids Avro invalid sync as a result of concurrent tests writing the same file * Expand the object key value to contain the min and max records timestamp within the file. This change would reduce the complexity of the initial seek when a request to load from a specific point in time is chosen * Removes obsolete comment. For temp files/folders call the deleteOnExit --------- Co-authored-by: stheppi <[email protected]>
- Loading branch information
Showing
39 changed files
with
1,056 additions
and
526 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.