run_summary #86

kkappler · 2022-03-19T00:11:25Z

In the same way that mth5 has a channel_summary method, that returns a dataframe with info about each channel, it would be nice to have a lighter-weight version that only returned one row per run.

This can be achieved by running a group_by on the channel_summary. grouper = df.groupby(["station", "run"])

A method that does this already is in aurora/aurora/tf_kernel/dataset.py, on the issue31 branch, which will soon be dev branch. The method is called channel_summary_to_dataset_definition

If run_summary is not appreciably faster than channel_summary, then it would probably be best to make run_summary depend explicitly on channel_summary as in my example.

grouper = df.groupby(["station", "run"])

The text was updated successfully, but these errors were encountered:

kkappler · 2022-03-19T01:05:13Z

@kujaku11 Let's put this one on the backburner until after we have merged our branches into dev

Update Channel and TF Summary tables. Addresses issue #86, fixes issue #50

kujaku11 · 2022-03-29T22:19:12Z

@kkappler I think once you get the format for the Dataset Definition we can pretty easily create that from the channel_summary using pandas groupby.

kkappler added the enhancement New feature or request label Mar 19, 2022

kujaku11 added a commit that referenced this issue Mar 25, 2022

Merge pull request #88 from kujaku11/add_tf

c688060

Update Channel and TF Summary tables. Addresses issue #86, fixes issue #50

kujaku11 mentioned this issue Mar 29, 2023

Add a run_summary feature to mth5 #96

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

run_summary #86

run_summary #86

kkappler commented Mar 19, 2022

kkappler commented Mar 19, 2022

kujaku11 commented Mar 29, 2022

run_summary #86

run_summary #86

Comments

kkappler commented Mar 19, 2022

kkappler commented Mar 19, 2022

kujaku11 commented Mar 29, 2022