intake-esm or intake #1

koldunovn · 2023-10-12T15:12:41Z

The catalog you created for jasmin is the intake-ESM one. Do you have some strings attached to this format, or you might consider to change to the format we use in nextGEMS, that I find more intuitive: https://github.com/nextGEMS/catalog

wachsylon · 2023-10-18T10:23:52Z

I think both have their features and complement each other. ESM is rather for finding and browsing with metadata while the access via the normal intake is faster. Afaik, searching via intake (no esm) is like a google search but some users may not know how to search efficiently through the catalog. For these users, intake-esm is better.

I think providing both is also a good idea.

jonseddon · 2023-10-24T11:48:07Z

@koldunovn I have no experience with either format, but in my initial tests then intake-ESM seemed to be much easier to use for me to catalogue the data as most of our data had been CMORised. If I used the other format then it appeared as if I was going to have to generate appropriate YAML files. Intake-ESM scans my directory structure for me.

However, it would be good to be consistent with others and so I don't have strings attached to this format.

What is the best way to create the YAML files for CMORised data?

koldunovn · 2023-10-24T21:21:43Z

Hi @jonseddon We discussed it a bit on the meetings and I think so far we converging to having both at the same time and see how much mess it will create :) Anyway things like CMIP6 data on DKRZ are stored in intake-esm, so people should know how to use both - we will try to assist with that (maybe by providing converters). In my view simple intake is easy when there are only few experiments (as in our case), while intake-esm make sense when there is a lot of different experiments and models.

Regarding example YAML, it can be as simple as:

plugins:
  source:
  - module: intake_xarray
sources:
  2D_1h_0.25deg:
    args:
      urlpath:
      - /work/bm1344/AWI/Cycle3/FESOM/IFS_4.4-FESOM_5-cycle3/025/2D_1h_native/*/*.nc
    description: 2D_1h_0.25deg data
    driver: netcdf

In this case all netCDF files will be combined in one happy xarray. The best practice is not to mix different time frequencies.

jonseddon · 2023-10-25T10:34:53Z

@koldunovn , great! This is all very new to me and so it will be really useful at the Hackathon to see how everyone uses Intake and to work on improved solutions during the event.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

intake-esm or intake #1

intake-esm or intake #1

koldunovn commented Oct 12, 2023

wachsylon commented Oct 18, 2023

jonseddon commented Oct 24, 2023

koldunovn commented Oct 24, 2023

jonseddon commented Oct 25, 2023

intake-esm or intake #1

intake-esm or intake #1

Comments

koldunovn commented Oct 12, 2023

wachsylon commented Oct 18, 2023

jonseddon commented Oct 24, 2023

koldunovn commented Oct 24, 2023

jonseddon commented Oct 25, 2023