Unusual Model Utility Gap: Gradient Difference vs Ascent (Llama2) #45

jeongjin0 · 2024-12-02T07:06:03Z

Hi, I noticed that for Llama2(forget10), gradient difference shows much lower model utility (~0.27) than gradient ascent (~0.63) in the leaderboard. This seems unusual since gradient difference is designed to maintain performance on the retain set while unlearning.

Interestingly, in Phi model results, gradient difference shows higher utility than ascent as expected. Could you help explain this significant performance gap in Llama2 implementation?

Thanks!

zhilif · 2024-12-19T16:38:29Z

Apologize for the confusing. Previously there's a bug in the leaderboard csv generation. We have uploaded a new version. Meanwhile, I will rerun evals again to double check. Thanks for your comments!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unusual Model Utility Gap: Gradient Difference vs Ascent (Llama2) #45

Unusual Model Utility Gap: Gradient Difference vs Ascent (Llama2) #45

jeongjin0 commented Dec 2, 2024 •

edited

Loading

zhilif commented Dec 19, 2024

Unusual Model Utility Gap: Gradient Difference vs Ascent (Llama2) #45

Unusual Model Utility Gap: Gradient Difference vs Ascent (Llama2) #45

Comments

jeongjin0 commented Dec 2, 2024 • edited Loading

zhilif commented Dec 19, 2024

jeongjin0 commented Dec 2, 2024 •

edited

Loading