Skip to content

Finetune algorithms log only train regret #76

Open
DT6A opened this issue Aug 1, 2023 · 0 comments
Open

Finetune algorithms log only train regret #76

DT6A opened this issue Aug 1, 2023 · 0 comments
Labels
bug Something isn't working wontfix This will not be worked on

Comments

@DT6A
Copy link
Collaborator

DT6A commented Aug 1, 2023

All of the algorithms with offline-to-online finetuning log training regret (regret obtained by online interactions which are used for training) under both train/regret and eval/regret. So we report only train regret which is different from Cal-QL work where authors report eval regret. Reporting eval regret is strange because the thing we really want to minimize on practice is a train regret so this bug is not critical but should be kept in mind. I will fix it but without reruning all of the algorithms due to compute limitations (maybe later we will rerun it).

@DT6A DT6A added bug Something isn't working wontfix This will not be worked on labels Aug 1, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Something isn't working wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

1 participant