-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HIGHLY DESIRED: python cmor
example for variables on native ocean tripolar grid
#766
HIGHLY DESIRED: python cmor
example for variables on native ocean tripolar grid
#766
Comments
I am absolutely thrilled to continue to be a good user about things. I'm pondering a minimal case showing my current problems with the dummy-index approach. I'm bumping into monotonicity concerns, or min/max val checks. when i try to avoid one, i run smack-dab into the other. |
@ilaflott this is MOM6 output right? I wonder whether we could liberate an example from the CSIRO folks, who have been using (MOM5, I think) in their ACCESS configs.. It might be a good idea to start with a working example, and then tweak.. @chloemackallah any tips you have for us, or examples you can point to in https://github.com/ACCESS-Community-Hub/APP4? |
I'm not sure if this is exactly what you are looking for but there is a Python test in CMOR that demonstrates the use of lat-lon grids with vertices. https://github.com/PCMDI/cmor/blob/main/Test/test_python_grid_and_ocn_sigma.py |
Thanks for that- studying your test now too! I hope that upon doing so, it "clicks" on my end, and I can make my case work. But also, consider it: the only case on my end that i could not figure out, is the ones NOT covered by what's super-easily-searchable in https://cmor.llnl.gov/mydoc_cmor3_python/. What I truly desire as an outcome, is for the aforementioned link to be updated with an example fully blessed by the primary |
Hey @durack1 , just stepping in here... so @chloemackallah handed APP4 over to ACCESS-NRI but it was basically used only for CMIP6. @paolap (formerly) at the Centre of Excellence for Climate Extremes recently released ACCESS-MOPPeR (https://access-mopper.readthedocs.io/en/latest/) which builds on it and can extend beyond CMOR use to other experiments via custom tables, and is probably what you want to base anything on if you want to go poking in ACCESS post-processing, as it's what we're hoping will form the basis for the CMIP7 postprocessor :) (it cleans up some of the residual 'mess' in APP4 too which was not Chloe's fault but you can always do better with hindsight!) |
I would note that we have had a similar request from an obs4MIPs contributor who would like to store data on a hemispheric EASE2 grid. I've copied below some email exchanges about this: 2/1/24 We're planning on preparing one or more of our sea-ice concentration The dataset will be of a smilar format to so a on a 12.5 km resolution EASE2 grid split into NH and SH files, one I've started experimenting a bit with CMOR, and it looks to me like all Also, would the NH/SH files need to be combined for a "global" files, or Anyway, if you have some advice I'd greatly appreciate it! A. 2/1/24 There may be a file naming problem if data from the two hemispheres is separated into 2 files. Under the currently proposed obs4MIPs file name template, the file names would be the same. I have been advocating for CMIP7 that we include in the file name a "region" identifier. If we had that in CMIP6, we would have needed the following identifiers: glb: global domain Similarly, CORDEX also needs a "region" identifier in its filenames. We are currently finalizing the global attribute list and file naming convention for obs4MIPs. Perhaps Peter can suggest how to best proceed in the mean time. CMOR's is able to write data on grids other than those described by a rectangular grid of longitudes and latitudes, but its capabilities are quite limited and it doesn't make use of recent augmentation of the CF conventions to include more grid-defining information. If you must avoid regridding, we would have to think about how to best proceed. You could avoid the both of the above issues by regridding the data in each hemisphere to a regular latitude x longitude grid and combining the two hemispheres into a single file. You could fill the tropical region with either 0 (presuming no sea ice there) or with the missing value indicator. This would make it easier for most users to ingest the data, although it would result in a larger file size and imply more resolution in the polar regions than you really have. Happy to iterate in finding a way forward, P.S. To be clear, I think it would be idea if for obs4MIPs we also included a "region" identifier in the filenames, but that is inconsistent with the current template. 2/2/24 Thank you for your query. We certainly want to help you advance your planned contribution to obs4MIPs. As you have noticed, for obs4MIPs to date we have only been using CMOR with regular lon x lat grids, and additionally there are several important issues raised by Karl that we need to consider for this special case. The good news is the timing is good for us to be thinking through this. We will discuss some options and get back to you next week. All the best, Hi A, Just a quick update to let you know that efforts are underway to construct an example use of CMOR for your application. Members of the WIP are helping us expedite this. We have identified solutions to the other questions Karl Taylor raised below. We will keep you posted on our progress with this. Many thanks for your patience. 11/25/24 Now that I should start working on this again for the CCI+ project I was wondering if there was any update about the best way to go forward with hemispheric data on equal-area (ie non-regular lat/lon) projection? Best regards, 12/19/24 Can we please conclude on this? The AR7 FastTrack process is rolling, and there is an incentive to get the reference observation datasets into Obs4MIPS. As far as I can see, all we need is a decision / clarification from Obs4MIPS about the handling of polar gridded data: Can they be added to Obs4MIPS in their polar-focused grids (better for the representativity of polar regions), or must they be re-projected to a regular lat/lon grid first (losing detail in the polar regions). Please refer to the threads of emails below: there has been some progress towards handling polar-gridded data, but no firm decision as far as we can see. Understandably, we have a preference for the polar grids. But will also accept going for the lat/lon version if it can speed ingestion in Obs4MIPS up. What do you think? 12/19/24 Thanks for bringing this important issue back to our attention. It is not a matter of decision making but rather being able to point you to an example of how to use CMOR with polar grids. We have consulted with the WIP on this and were hopeful we were on a path to have an example soon, so we are now revisiting that possibility. We know it is not difficult – many modeling groups provide their ocean model output on native grid – we just need to get the attention of somebody with the right experience and availability to help. There is of course the possibility of providing a regridded product, but this would limit the value of your data for model evaluation, so like you I hope that can be avoided. There is also the possibility that you attempt to follow the ODS2.5 data specification without CMOR, but that too is not ideal as would likely require a lot more work to ensure the standards are met which is important to how the data is managed, delivered and used. So, apologies for this situation but maybe you can give us another chance to provide an example soon? At least you have our attention! Sincerely, |
Given certain responses, I feel I should clarify- I am not a climate scientist, and I am not an oceanographer. I'm an ex-physicist working with NOAA-GFDL in a non-scientific capacity as a software engineer. My Ph.D. was in particle/nuclear physics, and overlap with this field was minimal at best. The point I'm trying to drive home is this: as a non-scientific contributor, and a non-expert, there is no replacement for well-thought out documentation, as blessed by the experts of the field. Again, this is the only For my other 8 cases, four went down without a problem, the next three took understanding PS: I will also bump what @taylor13 is highlighting (i think): ice-realm models are coming, and many will use a tripolar grid (in my current best non-scientific-expert understanding). PPS: follow up code coming, last 24 hours have been weird. I'm wearing more hats all of a sudden than I am used to. patience appreciated. PPPS: thank you for your time and energy, really! thank you!!! please help with doc :-) |
Yeah, the ESGF meta grid i've been exploring. I do NOT need to make a tripolar grid. the grid is "given" to me. I do NOT expect For the sake of our discussion in here, consider So, assume in my file, if i
Now, when i try to define OK- so now we try a dummy-index: add in the following to the code to create the dummy indices (only
then I do:
and this will error out depending on the commented out line. If I use Additionally, there's an ambiguity in how I should use I don't think I've even gotten to the part where I can start feeling out |
My recall of this part of CMOR is vague, but I've reviewed the documentation for cmor.grid here, and I would try the following: I think you do need to invoke cmor.grid to define your grid, but I don't think you need to define grid mapping parameters, which apply to "grid mappings" which I think do not apply to the tripolar grid. My guess is that your "sos" data array has 3 logical dimensions, 1 for time and 2 (say, yh and xh) to spatially locate each data value. You need to first define simple "index" dimensions by calling cmor.axis and referencing table CMIP6_grids.json, where the axes, i_index, j_index, k_index, ... are defined, each representing one of the dimensions of a multi-dimensional array. If xh is considered the 1st dimension, then invoke cmor.axis with table_entry="i_index" and length set to the length of the xh dimension. Similarly, for yh, but with the entry set to "j_index". Then, you're ready to execute cmor.grid following the documentation here. And then you can define your "sos" variable with cmor.variable as described here . Perhaps others can check whether this guidance is rubbish. |
hi @ilaflott |
thank you all for your help, i need to rescope my PR on my end, and spin off the effort to work on this case into a separate pull/issue. So it's gonna be a tad longer on my end. But mark my words I'm coming back to this. My processing this file flawlessly has become a personal vendetta, and it will vastly improve things in our workflow on our end, WHEN i make it work. I'll pay it forward when I get it working. @mauzey1 I'm looking at that code again, and I have some things to try. @taylor13 thank you very much for the detailed reply. that also gives me some things to try. @hot007 sorry I didn't mean to seem like i was ignoring, I just didn't want to reply until I gave that package/doc an honest peek and digested it a little (slow reader). TBH I am still poking through the doc, and the CMOR notes there are notably detailed, thank you! I'm not sure |
Hi @ilaflott, I missed that Chris had already pointed you to a example test code that I think could easily be modified to write your tripolar field. My discussion above of the "index" approach to defining axes applies to the most general unstructured grid when the native coordinates are of no interest. For the tripolar grid, the native coordinate values and bounds are probably of interest, in addition to the longitudes and latitudes and their vertices, so I would follow the example Chris provided, rather than my "index" approach. |
I will make a Python CMOR demo that will write a file similar to what is happening here. We'll added it to our doc site examples to help others in the future. |
I've made a new version of the code that I posted earlier but more geared towards the variable you are trying to write. |
Haha no offense taken @ilaflott , ACCESS-MOPPeR is written specifically for the ACCESS model which is broadly speaking UM+MOM, it is likely of little utility to other GCMs, however just wanted to throw it in there to demonstrate there are MOM (tripolar) CMORisers out there :) You can indeed make this work :) |
Does this new test for setting up the grid values help with CMORizing tripolar data? If so, we can start the process of making the test another Python example for CMOR here. |
@gleckler1 pinging you on this thread! |
This looks very encouraging! I will try it out soon. |
i was beginning to parse the other unit test yesterday- this is much cleaner. i'm sure this will help! |
After collecting and appropriately modifying the required jsons for this test, I was able to successfully execute resulting in the following |
@gleckler1 just cleaned that up into a file attachment. It looks like a too simple a case to me, as the x and y coords are just vectors, and not arrays, with spatially varying sizes - exactly what the tripolar data looks like - the x and y are indices into a lon and lat variable that are an array |
These are the results from the code ilaflott pointed to 3 weeks ago (above) that generates sample data. But I agree with you it is too simple an example. I'm attempting to follow the logic with OSI-SAF. |
Thank you so much!!! i think it works! If i can comment on what in your (cleaner) example was so helpful:
IMO, i feel my concerns are addressed. The example pointed to in this comment could essentially be copy/pasted into the examples section. Though perhaps for consistency, you'll want to do some style-matching, but that seems to me a nice-to-have. |
@ilaflott if you have a nice NOAA-GFDL example (native model data to CMOR) that we could reproduce in our demos, it would be great to grab it - a notebook would be ideal. There are numerous groups using MOMx, so having that on hand could help many of them, assuming they're also using a similar tripolar grid. Once we have that in place, we can close out this issue with a PR/fix |
I guess one more question to tack on while everyone is here, if you don't mind me checking my understanding. but lets say instead of Would that basically change the lines here in example 6 from:
to:
|
So I need to check with a few others on my side before releasing data, even if just as a sample/example. But, from what i understand, I am working with post-processed MOMx data, and not "fresh-off-the-stove" MOMx data. I.e. if i give you what I have, might not generalize to non-postprocessed MOMx data. But, really, the randomly generated data in @mauzey1's irregular grid generation example has the right mathematical properties- a lot of the extra stuff I am doing has to do with internal-convention-wrangling. edit: on that note, i've followed up via email regarding the sample data |
@ilaflott I have modified example6.py to use cmor.grid. example 6 using cmor.gridimport cmor
import numpy
import os
data = 10. * numpy.random.random_sample((2, 3, 4)) + 250.
data = numpy.array([
72.8, 73.2, 73.6, 74,
71.6, 72, 72.4, 72.4,
70.4, 70.8, 70.8, 71.2,
67.6, 69.2, 69.6, 70,
66, 66.4, 66.8, 67.2,
64.8, 65.2, 65.6, 66,
63.6, 64, 64.4, 64.4,
60.8, 61.2, 62.8, 63.2,
59.6, 59.6, 60, 60.4,
58, 58.4, 58.8, 59.2,
56.8, 57.2, 57.6, 58,
54, 54.4, 54.8, 56.4,
52.8, 53.2, 53.2, 53.6,
51.6, 51.6, 52, 52.4,
50, 50.4, 50.8, 51.2,
72.9, 73.3, 73.7, 74.1,
71.7, 72.1, 72.5, 72.5,
70.5, 70.9, 70.9, 71.3,
67.7, 69.3, 69.7, 70.1,
66.1, 66.5, 66.9, 67.3,
64.9, 65.3, 65.7, 66.1,
63.7, 64.1, 64.5, 64.5,
60.9, 61.3, 62.9, 63.3,
59.7, 59.7, 60.1, 60.5,
58.1, 58.5, 58.9, 59.3,
56.9, 57.3, 57.7, 58.1,
54.1, 54.5, 54.9, 56.5,
52.9, 53.3, 53.3, 53.7,
51.7, 51.7, 52.1, 52.5,
50.1, 50.5, 50.9, 51.3])
data.shape = (2, 5, 3, 4)
x = numpy.arange(4, dtype=float)
x_bnds = numpy.array([[x_ - 0.5, x_ + 0.5] for x_ in x])
y = numpy.arange(3, dtype=float)
y_bnds = numpy.array([[y_ - 0.5, y_ + 0.5] for y_ in y])
lat = numpy.array([10, 20, 30]).reshape((3, 1))
lat = numpy.tile(lat, (1, 4))
lon = numpy.array([0, 90, 180, 270])
lon = numpy.tile(lon, (3, 1))
lon_bnds = numpy.zeros((3, 4, 4))
lat_bnds = numpy.zeros((3, 4, 4))
for j in range(3):
for i in range(4):
lon_bnds[j, i, 0] = (lon[j, i] - 45) % 360
lon_bnds[j, i, 1] = lon[j, i]
lon_bnds[j, i, 2] = (lon[j, i] + 45) % 360
lon_bnds[j, i, 3] = lon[j, i]
lat_bnds[j, i, 0] = lat[j, i]
lat_bnds[j, i, 1] = lat[j, i] - 5
lat_bnds[j, i, 2] = lat[j, i]
lat_bnds[j, i, 3] = lat[j, i] + 5
time = numpy.array([15.5, 45])
time_bnds = numpy.array([0, 31, 60])
lev = [0.92, 0.72, 0.5, 0.3, 0.1]
lev_bnds = [1, 0.83,
0.61,
0.4,
0.2,
0
]
p0 = 100000
a = [0.12, 0.22, 0.3, 0.2, 0.1]
b = [0.8, 0.5, 0.2, 0.1, 0]
ps = numpy.array([
97000, 97400, 97800, 98200,
98600, 99000, 99400, 99800,
100200, 100600, 101000, 101400,
97100, 97500, 97900, 98300,
98700, 99100, 99500, 99900,
100300, 100700, 101100, 101500])
ps.shape = (2, 3, 4)
a_bnds = [
0.06, 0.18,
0.26,
0.25,
0.15,
0]
b_bnds = [
0.94, 0.65,
0.35,
0.15,
0.05,
0]
cmor.setup(inpath='Tables',
set_verbosity=cmor.CMOR_NORMAL,
netcdf_file_action=cmor.CMOR_REPLACE)
cmor.dataset_json("Test/CMOR_input_example.json")
cmor.load_table("CMIP6_grids.json")
cmorY = cmor.axis(table_entry='y_deg',
units='degrees',
coord_vals=y,
cell_bounds=y_bnds)
cmorX = cmor.axis(table_entry='x_deg',
units='degrees',
coord_vals=x,
cell_bounds=x_bnds)
cmorGrid = cmor.grid(axis_ids=[cmorY, cmorX],
latitude=lat,
longitude=lon,
latitude_vertices=lat_bnds,
longitude_vertices=lon_bnds)
cmor.load_table("CMIP6_Amon.json")
cmorTime = cmor.axis("time",
coord_vals=time,
cell_bounds=time_bnds,
units="days since 2018")
cmorLev = cmor.axis("standard_hybrid_sigma",
units='1',
coord_vals=lev,
cell_bounds=lev_bnds)
axes = [cmorTime, cmorLev, cmorGrid]
ierr = cmor.zfactor(zaxis_id=cmorLev,
zfactor_name='a',
axis_ids=[cmorLev, ],
zfactor_values=a,
zfactor_bounds=a_bnds)
ierr = cmor.zfactor(zaxis_id=cmorLev,
zfactor_name='b',
axis_ids=[cmorLev, ],
zfactor_values=b,
zfactor_bounds=b_bnds)
ierr = cmor.zfactor(zaxis_id=cmorLev,
zfactor_name='p0',
units='Pa',
zfactor_values=p0)
ips = cmor.zfactor(zaxis_id=cmorLev,
zfactor_name='ps',
axis_ids=[cmorTime, cmorGrid],
units='Pa')
cmorVar = cmor.variable("cl", "%", axes)
cmor.write(cmorVar, data)
cmor.write(ips, ps, store_with=cmorVar)
filename = cmor.close(cmorVar, file_name=True)
print("Stored in:", filename)
cmor.close()
os.system("ncdump {}".format(filename)) NetCDF output
|
I've been doing some work for NOAA-GFDL trying to "CMORize" certain types of data produced by our models and post-processing software. Data on the native tripolar ocean grid is one of the last test cases i need to facilitate.
I've been attempting to be a good user about figuring this out- i.e., doing my homework. But at this point I need a human to talk to, even if just to confirm i'm on the right track.
I've studied the CF conventions... I've found similar auxiliary-coordinate examples elsewhere (e.g. Project Pythia)... I found the aux-coordinate example on a normal lat/lon grid and all the other ones in the CMOR API doc... I found all the coordinate information I need from my people and how to get
_bnds
from it properly. I found the two small super-important sentences in the CMOR API doc.I also studied a closed/stale issue (#472 from 2019) quite extensively, finding a new idea to try. I haven't gone through with it yet (about to), but this was far too difficult a scenario to find information on.
@taylor13 suggested it would be nice to add an example handling exactly this case to the CMOR API doc, and I completely concur. A concrete example would be of high utility to me, and others that are attempting to CMORize their tripolar ocean data for MIP-publishing workflows.
The text was updated successfully, but these errors were encountered: