Skip to content

Adaptation

Jack McNelis edited this page Jul 24, 2021 · 7 revisions

Adaptation

Help for future data engineers that may stumble across this repo.

Disclaimer

This repo was hastily developed/tested/pushed to GH in late 2019 to meet an urgent need. Its code is difficult to follow and poorly-documented, and contains hard-coded elements that won't be familiar outside the context of a specific project. For instance, the logic to select/join trajectories from "multi-leg" flights (a requirement that only applies to ACT-America) is crammed inside the main function of the main.py script. Adapting the code might be more trouble than it's worth, depending on your requirements.

This page points to and describes the useful logic+data from the code and ancillary files so future users can avoid digging for it themselves.

Contents

At first glance, the repo is fairly well-organized:

  • ornldaac_icartt_to_netcdf/ contains all the code, Python 3
    • _utils.py: a script containing helper data/routines which are exposed globally during import by __init__.py
    • __init__.py: the standard init script allowing the folder to be recognized/imported/run like a Python module
    • __main__.py: the module-level routine that executes when the whole dir is run like a module (i.e. python -m ornldaac_icartt_to_netcdf <args>) which implements the ICARTT to netCDF4 conversion for ACT-America
  • references/ contains CSV/JSON metadata which are used for a) look-ups that drive logic in the conversion routines, b) attributes in the output netCDFs, c) ???
    • Please refer to the README.md for now. It gives a decent description of the key files in this directory and how to use them.
Clone this wiki locally