Change the repository type filter
All
Repositories list
29 repositories
AMPLIFY
PublicEfficientLLMs
PublicSubGoal_Distillation_LLM
PublicCode for paper Sub-goal Distillation: A Method to Improve Small Language Agents, accepted at CoLLAs 2024.- We explain why fairness metrics don't correlate and propose CAIRO to make them correlate.
CMOptimizer
PublicFASP
Public- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
Lookbehind-SAM
PublicRISC
PublicEpiK-Eval
Publicr2i.github.io
Publiccrystal-design
PublicRLHive
Publicadaptive-hanabi
PublicCOE
PublicCGOptimizer
Publichealthy-data-diet
PublicLoCA
PublicIIRC
PublicPatchUp
PublicRL-Tuner-CP
PublicLoCA2
PublicLifelong-Hanabi
Public