Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

run_summary #86

Open
kkappler opened this issue Mar 19, 2022 · 2 comments
Open

run_summary #86

kkappler opened this issue Mar 19, 2022 · 2 comments
Labels
enhancement New feature or request

Comments

@kkappler
Copy link
Collaborator

In the same way that mth5 has a channel_summary method, that returns a dataframe with info about each channel, it would be nice to have a lighter-weight version that only returned one row per run.

This can be achieved by running a group_by on the channel_summary. grouper = df.groupby(["station", "run"])

A method that does this already is in aurora/aurora/tf_kernel/dataset.py, on the issue31 branch, which will soon be dev branch. The method is called channel_summary_to_dataset_definition

If run_summary is not appreciably faster than channel_summary, then it would probably be best to make run_summary depend explicitly on channel_summary as in my example.

grouper = df.groupby(["station", "run"])

@kkappler kkappler added the enhancement New feature or request label Mar 19, 2022
@kkappler
Copy link
Collaborator Author

@kujaku11 Let's put this one on the backburner until after we have merged our branches into dev

kujaku11 added a commit that referenced this issue Mar 25, 2022
Update Channel and TF Summary tables.  Addresses issue #86, fixes issue #50
@kujaku11
Copy link
Owner

@kkappler I think once you get the format for the Dataset Definition we can pretty easily create that from the channel_summary using pandas groupby.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants