-
Notifications
You must be signed in to change notification settings - Fork 250
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add NullModel to AlpacaEval #414
Conversation
Hi, @YannDubs, is there any issue with this pr? We would appreciate your review and feedback! |
Hi @xszheng2020, Can you give a TLDR of how you find the cheating prompt? It's trained on the AlpacaEval prompts, right? By the typical reasoning I wouldn't accept such a PR. But it clearly has some value to tell people about potential issues. Also please add add the constant output you use in the folder! |
Hi, @YannDubs
After random searching on UltraFeedback instructions which does not overlap with AlpacaEval's, the LC win rate is boosted from 76.8% to 86.5%.
Thanks. |
Hi, @YannDubs, just as @xszheng2020 mentioned, we did not use AlpacaEval prompts, as that would make it quite trivial to cheat. Although our paper title is about cheating, we strictly follow AlpacaEval's rules, ensuring no data contamination. In this sense, our submission may even be more justified than some trained models that do involve data contamination. AlpacaEval is one of the most popular benchmarks, and we have no intention of disrupting its normal function. We suggest adding a special marker to clarify that our work is focused on red teaming, which won’t impact the evaluation of other submissions. Of course, the final decision is yours, and we’re open to any suggestions you might have. Thank you for your great efforts! Best, |
That all makes sense to me can you make the following changes and I’ll merge:
|
Hi, @YannDubs Thanks for your timely reply!
Please check! |
done @xszheng2020 @P2333 ! |
Dear AlpacaEval Team,
We are researchers from Sea AI Lab, Singapore and we are writing to share our recent work: Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (https://arxiv.org/abs/2410.07137).
We found that a "Null Model" that always outputs a constant response for any instructions can reach a 86.5% LC win rate on AlpacaEval 2.0.
We want to contribute our results to the leaderboard, thanks!
Best,
Xiaosen