Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhance logging by catching expected ERROR's #70

Open
davidverweij opened this issue Apr 8, 2021 · 2 comments
Open

Enhance logging by catching expected ERROR's #70

davidverweij opened this issue Apr 8, 2021 · 2 comments
Labels
data-transfer Data Transfer Protocol feature new feature or request

Comments

@davidverweij
Copy link
Member

In running the BTF and DRM pipelines, a few 'patient_ids' re-occur each run as they have been used for testing. It would be beneficial to have the pipeline skip these instances to free some resources, time as well as remove these from any aggregations, statistics and process overviews.

I'd suggest to add another .csv file to the local folder: just one column containing all the known test accounts. Then, before the patient_id is being resolved, we can add a check that skips when this is the case.

This allows us to manually add values to the .csv when we detect (primarily through logs) new test accounts.

NOTE: this is a low priority enhancement.

@davidverweij davidverweij added feature new feature or request data-transfer Data Transfer Protocol labels Apr 8, 2021
@davidverweij davidverweij changed the title Add a record (.csv) of known test accounts to be ignored by the pipeline Catching known test accounts to be ignored by the pipeline Apr 15, 2021
@davidverweij
Copy link
Member Author

In addition to known test accounts, device are occasionally generating data outside of known wear periods. For example, a device might be tested for 5 minutes a few days before handing to a patients.

We could think about adjusting the inventory history endpoint to return a certain status when no history could be found and the time period is way before it was ever deployed. However, this does mean we make assumptions based on inventory records. Ideally, these checks are not needed if the UCAM api is accurate and up-to-date.

I suggest we do not take action on this particular case. Instead, we can instruct future clinicians in the COS to always check out a device, even for testing, and using test accounts (with "TEST" in the name) to do so. Then, as @jawrainey suggested on Teams, we can catch any test usage from our inventory and exclude these metrics (or add a 'test' amount) to our pipeline stats.

@davidverweij davidverweij changed the title Catching known test accounts to be ignored by the pipeline Enhance logging by catching expected ERROR's Apr 29, 2021
@davidverweij
Copy link
Member Author

With running the pipeline on DEV and LIVE and resolving the unknowns / errors, there are a few expected errors which will improve logging and statistics.

This includes the above mentioned accounts that include TEST, but also device specific records. In particular, there can be a number of recordings for ByteFlies that are less than a minute long - and often in between deployments. In addition, recordings of this length are arguably not useful. I'd therefore suggest we consider recordings with known deviceIDs, but unknown users with a length shorter than ~5 minutes to be expected unknowns.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data-transfer Data Transfer Protocol feature new feature or request
Projects
None yet
Development

No branches or pull requests

1 participant