Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Proposed Recipes for PyQG Subgrid Forcing (Ross et al. 2022) #231

Open
cmdupuis3 opened this issue Dec 14, 2022 · 1 comment
Open

Proposed Recipes for PyQG Subgrid Forcing (Ross et al. 2022) #231

cmdupuis3 opened this issue Dec 14, 2022 · 1 comment

Comments

@cmdupuis3
Copy link

cmdupuis3 commented Dec 14, 2022

Dataset Name

PyQG Subgrid Forcing

Dataset URL

The base URL is https://g-402b74.00888.8540.data.globus.org/, but see below for examples.

Description

For full details, see the official publication here: https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/2022MS003258

The basic premise of the research behind this dataset is to use a quasi-geostrophic model (PyQG) for developing and testing ML-based parametrizations. The goal was to filter a high resolution (expensive) simulation to estimate sub-grid scale effects, which could then be incorporated in to a low resolution (cheap) simulation. The method of augmenting a low-resolution QG model with accurate subgrid-scale parameterization is important because it is a much faster model than full, high-resolution simulations, which are cost-prohibitive in a number of contexts.

Here, I am interested in creating a Pangeo Forge recipe for the PyQG data that was generated.

The scientific reasoning behind why these files are different are as follows:

  • Eddy vs. Jet configuration: These configurations refer to two different fluid mechanical regimes with differences in:
    • beta plane slope
    • bottom drag coefficient
    • bottom layer depth
      The different values for these result in one regime with evenly-distributed eddies (the eddy configuration), and one regime with obvious, latitude-dependent jets (the jet configuration)
  • HiRes vs. LoRes: As one could guess, HiRes is higher-resolution than LoRes. The exact resolutions were chosen specifically so that HiRes would be able to resolve eddies, and LoRes would not (thereby losing the underlying turbulence).
  • Forcing runs: These are low-resolution runs with three different subgrid-scale forcings to compensate for the lack of eddy resolution. These forcings are:
    • online total tendency
    • nonlinear advection
    • subgrid flux divergence

License

Unknown

Data Format

Zarr

Data Format (other)

No response

Access protocol

Globus

Source File Organization

The files are arranged in the following heirarchical structure:

eddy/
    low_res.zarr
    high_res.zarr
    forcing1.zarr
    forcing2.zarr
    forcing3.zarr
jet/
    low_res.zarr
    high_res.zarr
    forcing1.zarr
    forcing2.zarr
    forcing3.zarr

Example URLs

Each of these files can be obtained by appending these paths to the Globus-based root path, which is https://g-402b74.00888.8540.data.globus.org/.

As an example:

https://g-402b74.00888.8540.data.globus.org/eddy/low_res.zarr

Authorization

None

Transformation / Processing

We may need a structure like that of DataTree to deal with the heirarchical file structure if we want a single, unified dataset. Since the top level is relatively simple though, it could also make sense to break it into two separate datasets with a simpler structure, obviating the need for a tree-like structure.

Target Format

Zarr

Comments

No response

@cisaacstern
Copy link
Member

Thanks for opening this, @cmdupuis3!

it could also make sense to break it into two separate datasets with a simpler structure, obviating the need for a tree-like structure

This is the easiest way to pursue this using the latests pangeo-forge-recipes release (which does not yet support DataTree). Multiple recipes can be specified in a single recipe.py module. Please let me know how I can help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants