Creating base get_meta() function to retrieve data set meta data #19

rmbielby · 2024-09-05T12:07:26Z

Brief overview of changes

Analysts will need a function to retrieve the basic meta data for a given data set. This adds that in the form of get_meta(). To support this, I've also created a first step in creating an error handling script that helps translate html connection codes.

Why are these changes being made?

We need a function to connect to the meta data held on data sets on the EES API. The meta data holds the column and indicator info on a given data set, including the filter item and indicator codes required to query the dataset via the API.

Detailed description of changes

I've add the following functions:

get_meta_response()
http_request_error()

get_meta_response() will take a dataset_id, dataset_version and api_version and deliver the meta data associated with that dataset. This can be returned as the basic query result provided by the API (parse = FALSE) or an initial R friendly structured list contianing the results (parse = TRUE).

http_request_error() will translate any http return codes (e.g. 200, 404, 504 etc) and translate these into a broad-brush error message. This could be expanded in the future to be more fine grained and informative, but I've kept it fairly top level for now (i.e. it only picks up whether it's 2XX, 4XX or 5XX).

And following comments, I've created an extra bunch of functions to do the additional parsing I'd been saving for later PRs:

get_meta()
parse_meta_filter_columns()
parse_meta_filter_item_ids()
parse_meta_indicator_columns()
parse_meta_location_ids()

parse_meta_filter_columns(), parse_meta_filter_item_ids(), parse_meta_indicator_columns() and parse_meta_location_ids() tidy up the individual outputs in the structured list returned by get_meta_response() into individual data frames. Finally, get_meta() is the function I'm intending most end users to actually use and it just a wrapper that runs get_meta_response() and then applies the 4 parse functions to it to create a single structured list of data frames.

Issue ticket number/s and link

#1

And now #9, #10, #11, #12 and #13 as well.

R/get_meta.R

cjrace

Could we change the format of the parsed ones a little so they're a bit less nested? Happy with the general list format, though might be nice to do some more clean up so there's only ever one layer of nesting and all data frames in the list are easier to use (might make our own lives ees-ier for other functions too)

$locations which currently gives level.code and level.label as well as locations, we already have $geographicLevels, so could drop the level.code / level.label, just have a single data frame with code, name and id cols for locations? Will make it print nicer in the console and be easier to reuse as a lookup?

Could we split $filters into a filter_columns table and a filter_options table? Again flattening this out a bit will make it easier to reuse as a lookup and print in a more friendly way. For filter options, I'd imagine one flat table with all options, and a column for what filters they apply to, and I guess if you do that, you could leave it called filters, and not need a separate columns table if it's one big dataframe with cols like filter_column_id filter_column_label filter_option_id filter_option_label

DESCRIPTION

R/get_meta.R

tests/testthat/test-get_meta.R

tests/testthat/test-helper_functions.R

R/get_meta.R

…se into data frames

R/get_meta.R

…r::select and rename

cjrace

Nice solution / code generally - a neat way to structure the functions so that the main user query doesn't repeat 5 calls just to get the metadata out. Few comments in the code specifically, plus:

Should we separate out the functions we expect users to use in the _pkgdown_yml file for the reference list? Currently it's one flat list, and I expect most users will only need a couple of the functions to start with.
Should we have before / after test data saved in the test folder to check the parse_meta... functions against?

DESCRIPTION

R/get_meta.R

tests/testthat/test-helper_functions.R

R/helper_functions.R

…tyling workarounds

…ta data

R/get_meta.R

rmbielby · 2024-09-10T15:45:25Z

Should we separate out the functions we expect users to use in the _pkgdown_yml file for the reference list? Currently it's one flat list, and I expect most users will only need a couple of the functions to start with.

I've tried some sort of structuring that makes rough sense to me as something that could be extended sensibly as we add more functionality.

rmbielby · 2024-09-10T16:23:01Z

Should we have before / after test data saved in the test folder to check the parse_meta... functions against?

I guess so...

I've added meta test data into a testdata/ folder and written a test for each of the parsing functions. For the data format, I've picked RDS as:

I want to be able to write lists of data frames and not just individual data frames
The save and read functions for rds are included in base R, so don't need any extra packages

…rally. Added extra tests for it too.

rmbielby added 4 commits September 2, 2024 14:01

Added initial get_meta() function

55e140c

Expanded get_meta function to extract to json parsed structured list

c903c56

Merge branch 'main' into get-meta

1dc3ca4

Added tests for get_meta and helper_functions

4ed41cc

rmbielby requested a review from cjrace September 5, 2024 12:07

rmbielby self-assigned this Sep 5, 2024

Code tidying

c106aad

github-advanced-security bot found potential problems Sep 5, 2024

View reviewed changes

R/get_meta.R Fixed Show fixed Hide fixed

R/get_meta.R Fixed Show fixed Hide fixed

rmbielby added 3 commits September 5, 2024 13:13

Explicitly calling eesyapi functions where used

1c6aec4

Added validation to get_meta

69be728

Added title to get_meta function

3ee8a57

cjrace requested changes Sep 5, 2024

View reviewed changes

DESCRIPTION Show resolved Hide resolved

R/get_meta.R Show resolved Hide resolved

R/get_meta.R Outdated Show resolved Hide resolved

tests/testthat/test-get_meta.R Outdated Show resolved Hide resolved

tests/testthat/test-helper_functions.R Outdated Show resolved Hide resolved

cjrace reviewed Sep 5, 2024

View reviewed changes

R/get_meta.R Outdated Show resolved Hide resolved

rmbielby added 2 commits September 6, 2024 09:54

Updating description for get_meta() and a few other minor tweaks

1b41720

Produced parsing functions to process filter info from the API repson…

8f146ad

…se into data frames

github-advanced-security bot found potential problems Sep 6, 2024

View reviewed changes

Updated tests for errors on get_meta()

bbc1d88

github-advanced-security bot found potential problems Sep 6, 2024

View reviewed changes

R/get_meta.R Fixed Show fixed Hide fixed

R/get_meta.R Fixed Show fixed Hide fixed

Added location parsing to the meta data retrieval

a28987a

github-advanced-security bot found potential problems Sep 6, 2024

View reviewed changes

rmbielby added 3 commits September 6, 2024 12:48

Updating version number

7543739

Add NEWS.md

4b27e59

Updated news.md with changes in latest version update

8dc5ea4

Clearing out some lintr issues around how columns are defined in dply…

39da35f

…r::select and rename

cjrace requested changes Sep 10, 2024

View reviewed changes

rmbielby added 2 commits September 10, 2024 15:55

Updated with a few changes from another branch including some lintr s…

dea3484

…tyling workarounds

Responding to some PR comments - added ordering to time periods in me…

5bdb7bd

…ta data

github-advanced-security bot found potential problems Sep 10, 2024

View reviewed changes

R/get_meta.R Fixed Show fixed Hide fixed

R/get_meta.R Fixed Show fixed Hide fixed

R/get_meta.R Fixed Show fixed Hide fixed

rmbielby added 3 commits September 10, 2024 16:27

Fixing a bug in time periods parsing

7d6d410

Restructured reference list in documentation

b58572b

Bit of minor rephrasing in the reference list

a4642e7

rmbielby added 2 commits September 10, 2024 17:12

Added test meta data and added test to check time period parsing

89976d2

Added testdata for meta data parsing and created associated new tests

4b3eeeb

rmbielby and others added 3 commits September 10, 2024 17:37

Fixed a bug in http_request_error and improved its functionality gene…

da74ace

…rally. Added extra tests for it too.

Adapted a few function titles to be more informative

effd920

fix inconsistent full stops in reference list

9fe47cd

cjrace approved these changes Sep 10, 2024

View reviewed changes

rmbielby merged commit c3ef4b6 into main Sep 10, 2024
9 checks passed

rmbielby deleted the get-meta branch September 18, 2024 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Creating base get_meta() function to retrieve data set meta data #19

Creating base get_meta() function to retrieve data set meta data #19

rmbielby commented Sep 5, 2024 •

edited

Loading

cjrace left a comment

cjrace left a comment

rmbielby commented Sep 10, 2024

rmbielby commented Sep 10, 2024

Creating base get_meta() function to retrieve data set meta data #19

Creating base get_meta() function to retrieve data set meta data #19

Conversation

rmbielby commented Sep 5, 2024 • edited Loading

Brief overview of changes

Why are these changes being made?

Detailed description of changes

Issue ticket number/s and link

cjrace left a comment

Choose a reason for hiding this comment

cjrace left a comment

Choose a reason for hiding this comment

rmbielby commented Sep 10, 2024

rmbielby commented Sep 10, 2024

rmbielby commented Sep 5, 2024 •

edited

Loading