Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stability improvements #24

Merged
merged 15 commits into from
Jan 5, 2024
Merged

Stability improvements #24

merged 15 commits into from
Jan 5, 2024

Conversation

sachin-j-joshi
Copy link
Contributor

@sachin-j-joshi sachin-j-joshi commented Nov 27, 2023

Cumulative Merge of stability improvements

All tests should pass

ShwethaSNayak and others added 13 commits October 17, 2023 22:08
Improved error handling.
Separate parallel tasks for File watch and processing files
Checks for File reads likes valid directory path, file format etc
Pre-condition checks before commit/abort the transactions.
Added more logs for debugging.

Signed-off-by: Shwetha N <[email protected]>
Support ingestion from multiple directories
Refactor DB interaction. 
Signed-off-by: Shwetha N <[email protected]>
File Ingestion and Processing refactor
Signed-off-by: Shwetha N <[email protected]>
Fix logging in case of invalid files
In case of invalid files, the failed files will be moved to a separate Failed_Files folder next to the database.
This ensures the warning message for invalid files is shown only once.
Fix failing unit tests by fixing parameters.
… latest to reduce vulnerability issues (#22)

Add spotbug and checkstyle support, updated gradle and some jars to latest to reduce vulnerability issues.
Updates the code to extract the parent directory for Failed_Files folder. Path.getParent() method is used to make it platform independent.
…before processing (#28)

Delete filename from pending table if actual file doesn't exist just before processing.
Signed-off-by: Kuldeep Kumar <[email protected]>
Updated configuration properties for all file types(CSV, Parquet and Raw)
Updated wiki for configuration parameters, description and default values.

Signed-off-by: Shwetha N <[email protected]>
Optimize CsvFileIngestService and RawFileIngestService.

CSV Events are now batched in a list and entire batch is converted to json. Each record is parsed into a hashmap before being added to the batch.
For Raw file events, BufferedInputStream is used now
Also the service class names in test scripts are fixed and new script is added for testing csv files.
PR updates the APP_VERSION for pravega-sensor-collector builds to 0.2.18.
The build-installer.sh script is also modified to publish an executable jar.
Move completed files to a different directory
@sachin-j-joshi sachin-j-joshi marked this pull request as ready for review December 12, 2023 15:22
Introduces new unit tests for the Pravega sensor collector to ensure robust and reliable data collection from sensors.
The tests cover various scenarios, including normal operation, edge cases, and potential error conditions.

Signed-off-by: Shwetha N <[email protected]>
Copy link
Contributor

@ShwethaSNayak ShwethaSNayak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall LGTM

…indows (#34)

Add garbage collection of completed files by moving it into completed_files directory

2. Modify [PravegaSensorCollectorApp.xml](windows-service/PravegaSensorCollectorApp.xml). Check PRAVEGA_SENSOR_COLLECTOR_RAW1_PRAVEGA_CONTROLLER_URI.
4. Add the path to the jar file `pravega-sensor-collector/build/libs/pravega-sensor-collector-${APP_VERSION}.jar` generated by build-installer.sh in PravegaSensorCollectorApp.xml.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

3 seems to be missing

@abhinb abhinb self-requested a review January 5, 2024 10:32
@abhinb abhinb merged commit b0b7aa3 into master Jan 5, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants