nssp patching code #2000

minhkhul · 2024-07-23T23:32:41Z

Description

Add patching feature to nssp.

One detail that set this implementation apart from most of our other indicators is that nssp data comes in our db as weekly form instead of daily.

Basically, all weekly data coming in to our db has the issue_date column always marked as the first date of the epiweek that the reporting date falls into, rather than the reporting date itself. For example, I query nssp data from our api that was just put into db on July 5th 2024 and got something like this:

geo_value                  signal source geo_type time_type time_value  \
0      54079  pct_ed_visits_combined   nssp   county      week 2022-09-25
**issue**  lag  missing_value  missing_stderr  missing_sample_size  value  \
0  **2024-06-30**   93              0               1                    1   3.27

Note the issue is 2024-06-30 despite this data coming in on July 5th. It's because 2024-06-30 is the start date of epiweek 202427, so any data coming in that week will be marked under that issue date.

Furthermore, the reason why we use this directory format is because of this batch_issue_format implementation in acquisition, which demands folder format to be issue_yyyymmdd including for patches of weekly data.

Fixes

Fixes NSSP patching script and NSSP patch april 2024 to july 2024 #1998

nssp/delphi_nssp/patch.py

nssp/tests/test_patch.py

nssp/delphi_nssp/patch.py

nssp/delphi_nssp/pull.py

nssp/delphi_nssp/run.py

nssp/tests/test_patch.py

aysim319 · 2024-08-08T15:47:27Z

nssp/delphi_nssp/patch.py

+from .run import run_module
+
+
+def good_patch_config(params, logger):


I wonder if we can bake this in the read_params() in delphi_utils.... @nmdefries thoughts?
pydantic doesn't a full pledged support for json schema validation, but seeing how the read_params has no validation what so ever, we could look into it sometime in the future.

I agree. I wrote this method and half way through was like hmm this should probably be generalized since it may be applicable to other indicators too.

yeahh I'm like oh I also would like that for mine 👀 and also saw that the current read params isn't doing any validations at all

Yeah, could make sense to add this or a similar fn to delphi_utils. Probably we'd want it and other params validation to be separate fns from read_params, since we don't always want params validation.

I mentioned it the #2002 and I think for right now it's better to remove this and deal with this in another issue

@minhkhul it's up to you to keep or remove the good_patch_config function. While we do intend to move it to utils at some later point, we don't know when that will be. Pros of leaving it here are that it is performing the check, and we can use it as a starting point for the future fn. Cons are that we have to remember it's here and use it, and then also replace it with the utils version once that's available.

nssp/tests/test_patch.py

nssp/delphi_nssp/run.py

nssp/delphi_nssp/pull.py

Co-authored-by: nmdefries <[email protected]>

minhkhul · 2024-08-29T00:46:00Z

@nmdefries
I realized how confusing I wrote the readme to be. So I just made some major changes to that, then added code to auto-download the source backup files here. With this, patcher will only need to adjust params then run env/bin/python -m delphi_nssp.patch to create patch data.

None of any other extra steps.

The code chunk in the readme was not generating issue-specific dataset all at once, but just for one issue. I use that code and daily scheduling to made a daily copy of what the source API returns and store that copy as in a csv on bigchunk-dev-02.delphi.cmu.edu. So we have a history since march for this indicator. This means on the server there are these daily csv where each csv is source data for one issue:

[user@bigchunk-dev-02 nssp]$ pwd
/common/source_backup/nssp
[user@bigchunk-dev-02 nssp]$ ls
2024-04-17.csv  2024-05-21.csv  2024-06-25.csv  2024-08-03.csv
2024-04-18.csv  2024-05-22.csv  2024-06-26.csv  2024-08-04.csv
2024-04-19.csv  2024-05-23.csv  2024-06-27.csv  2024-08-05.csv
...

I originally put the code chunk in readme to show how if there'a a patching need, one can set up their own daily/weekly backup starting from the start of an outage to the end. But it's really not needed now that I made the relevant csv files on bigchunk-dev-02.delphi.cmu.edu all readable to anyone with access to that server.

Then I added the auto-download source backup csv files.

So now we avoid having patching be a multi-step process.

nssp/delphi_nssp/patch.py

nssp/delphi_nssp/pull.py

nssp/delphi_nssp/patch.py

…e_dir not exist/empty + adjust tests accordingly

minhkhul · 2024-09-04T19:53:22Z

Note: About custom_run flag, non-patch custom run doesn't really exist in nssp like it does in google-symptoms due to the differences in how the two sources handle revision data and how we grab those data. We'll keep the flag in nssp still to maintain consistency and to disambiguate params.json.

nssp/delphi_nssp/pull.py

nssp/delphi_nssp/patch.py

minhkhul added 5 commits July 23, 2024 19:26

nssp patching code

c78ae21

lint

7694c0a

add test

a3ed4c2

add test

1628d34

Add patching how-to to readme

2536b94

aysim319 reviewed Jul 25, 2024

View reviewed changes

nssp/delphi_nssp/patch.py Show resolved Hide resolved

aysim319 reviewed Jul 25, 2024

View reviewed changes

nssp/tests/test_patch.py Outdated Show resolved Hide resolved

minhkhul added 2 commits July 25, 2024 13:19

adjust current_issue_dir name for weekly data instead of daily.

e4d45e5

lint

db906fc

nmdefries mentioned this pull request Jul 25, 2024

Add discussion of params.json and defaults to indicator manual #2002

Open

adjust test for more cases

8f0bb32

minhkhul requested a review from aysim319 July 25, 2024 19:35

aysim319 reviewed Jul 25, 2024

View reviewed changes

nssp/delphi_nssp/patch.py Outdated Show resolved Hide resolved

aysim319 reviewed Jul 25, 2024

View reviewed changes

nssp/delphi_nssp/pull.py Show resolved Hide resolved

aysim319 reviewed Jul 25, 2024

View reviewed changes

nssp/delphi_nssp/run.py Outdated Show resolved Hide resolved

aysim319 reviewed Jul 25, 2024

View reviewed changes

nssp/tests/test_patch.py Outdated Show resolved Hide resolved

minhkhul added 7 commits August 5, 2024 23:01

add custom_run flag

7f151f5

handle custom flag on but bad config

c093349

make patch config check readable

a967416

make good_patch_config check comprehensive

c020da6

rewrite good_patch_config for clarity

9a6130b

add unit tests for good_patch_config check

b8a2177

add test_pull unit test for patching case + cleanup format

a7d9443

aysim319 reviewed Aug 8, 2024

View reviewed changes

nssp/tests/test_patch.py Outdated Show resolved Hide resolved

aysim319 reviewed Aug 8, 2024

View reviewed changes

nssp/tests/test_patch.py Outdated Show resolved Hide resolved

minhkhul added 2 commits August 8, 2024 14:54

split test cases + move to pytest

e29e07e

add test for multi-week patching

0a4bfb6

minhkhul requested review from aysim319 and nmdefries August 9, 2024 15:11

nmdefries reviewed Aug 21, 2024

View reviewed changes

nssp/delphi_nssp/run.py Show resolved Hide resolved

nssp/delphi_nssp/pull.py Show resolved Hide resolved

minhkhul and others added 3 commits August 28, 2024 16:43

Update nssp/README.md

8734daa

Co-authored-by: nmdefries <[email protected]>

Update nssp/README.md

5a6f8b6

Co-authored-by: nmdefries <[email protected]>

Add auto-download source backup data + update docs + test

4356494

minhkhul added 4 commits August 30, 2024 14:53

adjust custom_run flag to leave room for non-patch custom runs

ca427a4

move pull logic from run.py into pull.py

f58b068

logger to static

5e93175

adjust unit tests

2d8670d

minhkhul commented Aug 30, 2024

View reviewed changes

nssp/delphi_nssp/patch.py Outdated Show resolved Hide resolved

minhkhul requested a review from aysim319 August 30, 2024 21:05

more unit test adjustment

f0335f6

aysim319 reviewed Sep 3, 2024

View reviewed changes

nssp/delphi_nssp/pull.py Show resolved Hide resolved

aysim319 reviewed Sep 3, 2024

View reviewed changes

nssp/delphi_nssp/patch.py Outdated Show resolved Hide resolved

aysim319 reviewed Sep 3, 2024

View reviewed changes

nssp/delphi_nssp/patch.py Outdated Show resolved Hide resolved

move get_source_data to pull.py + make get_source_data run when sourc…

7e06f94

…e_dir not exist/empty + adjust tests accordingly

minhkhul requested a review from aysim319 September 3, 2024 19:22

minhkhul added 2 commits September 3, 2024 16:51

auto-remove source_dir content after finish patch run

e678ce6

lint happy

bc1d7a7

aysim319 reviewed Sep 5, 2024

View reviewed changes

nssp/delphi_nssp/pull.py Outdated Show resolved Hide resolved

aysim319 reviewed Sep 5, 2024

View reviewed changes

nssp/delphi_nssp/pull.py Outdated Show resolved Hide resolved

aysim319 reviewed Sep 5, 2024

View reviewed changes

nssp/delphi_nssp/pull.py Outdated Show resolved Hide resolved

aysim319 reviewed Sep 5, 2024

View reviewed changes

nssp/delphi_nssp/patch.py Show resolved Hide resolved

minhkhul requested a review from nmdefries September 10, 2024 18:11

minhkhul and others added 5 commits September 10, 2024 14:57

Update pull.py

84cba84

Update pull.py - remove stat debug

742737b

add progress log for source file download

e13d3db

lint

9cec6ff

lint

5450d8b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nssp patching code #2000

nssp patching code #2000

minhkhul commented Jul 23, 2024 •

edited

Loading

aysim319 Aug 8, 2024

minhkhul Aug 8, 2024

aysim319 Aug 8, 2024

nmdefries Aug 9, 2024

aysim319 Aug 15, 2024

nmdefries Aug 19, 2024

minhkhul commented Aug 29, 2024

minhkhul commented Sep 4, 2024

		from .run import run_module


		def good_patch_config(params, logger):

nssp patching code #2000

Are you sure you want to change the base?

nssp patching code #2000

Conversation

minhkhul commented Jul 23, 2024 • edited Loading

Description

Fixes

aysim319 Aug 8, 2024

Choose a reason for hiding this comment

minhkhul Aug 8, 2024

Choose a reason for hiding this comment

aysim319 Aug 8, 2024

Choose a reason for hiding this comment

nmdefries Aug 9, 2024

Choose a reason for hiding this comment

aysim319 Aug 15, 2024

Choose a reason for hiding this comment

nmdefries Aug 19, 2024

Choose a reason for hiding this comment

minhkhul commented Aug 29, 2024

minhkhul commented Sep 4, 2024

minhkhul commented Jul 23, 2024 •

edited

Loading