You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Trying to intake many members/models that satisfy the following criteria:
cat = col.search(
variable_id=['tos', 'sos', 'chl', 'mlotst', 'spco2'],
table_id=['Omon'], # monthly ocean output only
experiment_id=['ssp245','historical'],
require_all_on=['source_id', 'member_id', 'grid_label'] # this ensures that results will have all variables and experiments available
)
However, many of the ssp245 and historical time periods do not match with each other. For instance,
will show you an ingested historical member with a time period that ends in 1969, whereas a few weeks ago every historical member ingested had data through 2014. Some ssp245 time periods are ending in 2022, some in 2024, some in 2100, it appears. Is there a way to only ingest members based on whether they satisfy certain year ranges? I know that, in the case of the historical member listed above, I would want that filtered out, as its associated ssp245 data starts in 2015, meaning 1970-2014 are missing.
The text was updated successfully, but these errors were encountered:
AbbySh
changed the title
[Potential Bug]: Members being ingested with inconsistent time lengths
[Potential Bug]: Members being ingested with inconsistent time lengths/missing time chunks
Jun 11, 2024
We need to confirm if these 'short' runs are short as they are submitted (then there is nothing we can do from our end), or if there is a bug in the ingestion (this we can and should fix, and rerun these particular stores).
I will try to get to this in the next days, but for your purposes I recommend adding an additional check along the lines of
filtered_datasets= []
fordsinall_your_datasets:
ds=ds.sel(time=slice('2014', '2100')) # Some members are run longer than 2100, and those should just be trimmed.ifds.time[0].data.year==2014andds.time[-1].data.year==2100:
filtered_datasets.append(ds)
else:
print(f"{ds=} did not pass the time test")
Hi @jbusecke, any update on this? We're trying to pull CMIP6 data again to include an extra year, but we can't concatenate/merge our variables because a lot of them seem to not match in time values/time ranges all of a sudden
Description
Trying to intake many members/models that satisfy the following criteria:
However, many of the ssp245 and historical time periods do not match with each other. For instance,
ddict['CMIP.MPI-M.MPI-ESM1-2-LR.historical.r41i1p1f1.Omon.spco2.gn.none.r41i1p1f1.v20190815.gs://cmip6/cmip6-pgf-ingestion-test/zarr_stores/9103515710_1/CMIP6.CMIP.MPI-M.MPI-ESM1-2-LR.historical.r41i1p1f1.Omon.spco2.gn.v20190815.zarr']
will show you an ingested historical member with a time period that ends in 1969, whereas a few weeks ago every historical member ingested had data through 2014. Some ssp245 time periods are ending in 2022, some in 2024, some in 2100, it appears. Is there a way to only ingest members based on whether they satisfy certain year ranges? I know that, in the case of the historical member listed above, I would want that filtered out, as its associated ssp245 data starts in 2015, meaning 1970-2014 are missing.
The text was updated successfully, but these errors were encountered: