Improve masking #122

rhysgt · 2024-04-15T14:00:27Z

Masking is now performed on accessing data in the datastore.

Removed the preview function in set_mask since it is now extraneous as the original data is not being overwritten.

We should consider moving cropping and masking from hrdic into base

Masking is now performed on accessing data in the datastore

pytest_cases is not compatible with pytest 8 :/

Tests now pass

…hould be masked.

mikesmic · 2024-04-17T09:53:41Z

I've made some changes, can you check it works as expected? You can set the mask with:
dic_map.data.generate('mask', mask=bool_array).
How are these masks generated? Is it quite standard or do you change things about to fit the data?
We need to change the mask function so the stored data is not mutated. This can maybe be done by casting as a masked array and then getting the filled array.

Replace mutation of stored data with numpy masked array Better way to set null mask?

rhysgt · 2024-04-17T15:00:19Z

Have fixed the problems you stated I think - using a masked array instead of mutation and now generates a null mask in a better way (?)

Moat of the time, the masks I used are quite straightforward, for example (from docs):

To remove data points in dic_map where max_shear is above 0.8, use:

mask = dic_map.data.max_shear > 0.8

To remove data points in dic_map where e11 is above 1 or less than -1, use:

mask = (dic_map.data.e[0, 0] > 1) | (dic_map.data.e[0, 0] < -1)

To remove data points in dic_map where corrVal is less than 0.4, use:

mask = dic_map.corr_val < 0.4

rhysgt · 2024-04-18T09:43:11Z

Also - there is an inconsistency in function naming - calc_mask and set_crop?

mikesmic · 2024-04-18T11:01:53Z

I did call it set_mask but it would be confusing because it doesn't set anything, it just creates a mask image that the generate function uses. set_crop actually sets crop boundary values. I don't know about passing out masked arrays, will they work with everything else in the library? Although I looked at masking yesterday and I couldn't find a way to create an array with nans set for masked values without making a copy of the data. I need to look through the logic for the making again, my goal was to only run the masking function if a mask is set.

rhysgt · 2024-04-18T11:49:19Z

As far as I'm aware, everything still seems to works as expected with a masked array.

It does incur an overhead (but much smaller by a factor 1000 than the previous method).

Do we need to change the logic to not use masked arrays for data that isn't masked? Currently a masked array is always generated.

rhysgt · 2024-04-24T15:37:17Z

A masked array is only returned when a mask is provided. If unset, the normal map data is passed through as before.

rhysgt added 4 commits April 15, 2024 14:44

Improve masking

3cb1161

Masking is now performed on accessing data in the datastore

Fix pytest bug

b6bef39

pytest_cases is not compatible with pytest 8 :/

Update base.py

1c5459d

Tests now pass

Update hrdic.py

0ac1cde

rhysgt requested a review from mikesmic as a code owner April 15, 2024 14:00

Store mask array in datastore and use metadata key to check if data s…

3c639ca

…hould be masked.

rhysgt added 3 commits April 17, 2024 15:53

Update hrdic.py

7a4f5ae

Replace mutation of stored data with numpy masked array Better way to set null mask?

Update CHANGELOG.md

b44d330

Add paper

d2fc3b9

Pass a normal array is masking turned off

4ba91e5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve masking #122

Improve masking #122

rhysgt commented Apr 15, 2024

mikesmic commented Apr 17, 2024

rhysgt commented Apr 17, 2024

rhysgt commented Apr 18, 2024

mikesmic commented Apr 18, 2024

rhysgt commented Apr 18, 2024 •

edited

Loading

rhysgt commented Apr 24, 2024

Improve masking #122

Are you sure you want to change the base?

Improve masking #122

Conversation

rhysgt commented Apr 15, 2024

mikesmic commented Apr 17, 2024

rhysgt commented Apr 17, 2024

rhysgt commented Apr 18, 2024

mikesmic commented Apr 18, 2024

rhysgt commented Apr 18, 2024 • edited Loading

rhysgt commented Apr 24, 2024

rhysgt commented Apr 18, 2024 •

edited

Loading