Jean/refacto benchmark #150

jeandut · 2022-06-13T17:34:19Z

This PR is aimed at making the benchmark logic both simpler and more robust.

ghost

Hi,

Thanks for starting the refactoring. Here are a few comments to go even further. I will not mark anything as compulsory given the time constraints, but it would be great to save what will not be done as an issue for future work.

Optimization: in fed_benchmark.py, in as much as possible, initialize dataloaders after checking if the experiment needs to be run or not
Cutting main in fed_benchmark.py into sub-functions to make it even more readable (it's > 370 lines long at the moment...). I give more actionable suggestions below.

Split main into sub-functions pooled_training, local_training, strategy_training. That will make this main much more simpler (but +1 for the single_centric_training which is much better than before).
The logic for the tests if the exps are finished or not should be hidden in sub-functions for readability.
- replace all the len(index_of_interest) < (NUM_CLIENTS + 1) tests to check if experiment is finished by a more explicit function like if check_exp_finished(df, hyperparameters, sname, num_updates)
- same for all the statements #dealing with edge cases that should not happen --> in sub-functions
Lots of variables are assigned very short names, like training_dls for training_dataloaders, s for strategy, m for model... Better to use larger names
the logic for filling the hyperparameters' param could also be delegated in sub-functions.

conf.py should be renamed into something more explicit, like configuration_utils.py

jeandut · 2022-06-15T07:50:31Z

Created an issue #154 so that this review is tracked but merging in the meantime.

jeandut marked this pull request as ready for review June 14, 2022 11:55

jeandut force-pushed the jean/refacto-benchmark branch from ac1d63d to a4e437a Compare June 14, 2022 13:05

ghost approved these changes Jun 14, 2022

View reviewed changes

jeandut added 8 commits June 15, 2022 09:48

first refactor, working except duplicate column isssue

38f8815

first refactor, working except duplicate column isssue

8fd1341

woorking script needs to be tested on true use-case

d638356

modifying extract config script

fa77679

fixing model and benchmark script

2d33068

retracting change in config

6366c64

adding seed in column

ed66b87

renaming object

f236007

jeandut force-pushed the jean/refacto-benchmark branch from a4e437a to f236007 Compare June 15, 2022 07:48

jeandut closed this Jun 15, 2022

jeandut reopened this Jun 15, 2022

jeandut merged commit fce7ad2 into main Jun 15, 2022

jeandut mentioned this pull request Jun 15, 2022

Add seed to the column of the produced benchmark file #146

Closed

jeandut deleted the jean/refacto-benchmark branch June 20, 2022 09:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jean/refacto benchmark #150

Jean/refacto benchmark #150

jeandut commented Jun 13, 2022 •

edited

Loading

ghost left a comment

jeandut commented Jun 15, 2022

Jean/refacto benchmark #150

Jean/refacto benchmark #150

Conversation

jeandut commented Jun 13, 2022 • edited Loading

ghost left a comment

Choose a reason for hiding this comment

jeandut commented Jun 15, 2022

jeandut commented Jun 13, 2022 •

edited

Loading