Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Official repository of the 2024 SaTML LLM Capture-the-Flag Competition led by Edoardo Debenedetti, Javier Rando and Daniel Paleka.

Competition report: https://arxiv.org/abs/2406.07954

Dataset: https://huggingface.co/datasets/ethz-spylab/ctf-satml24

Blogpost: https://spylab.ai/blog/results-competition/

Loading the dataset form HuggingFace

from datasets import load_dataset

defenses = load_dataset("ethz-spylab/ctf-satml24", "defense")["valid"]

teams = load_dataset("ethz-spylab/ctf-satml24", "teams")["defense_teams"]

chats = load_dataset("ethz-spylab/ctf-satml24", "interaction_chats")["attack"]

Analyzing the data

We provide a script chat_diversity.py to reproduce the basic analysis that we include in our official report. The folder raw_data_manipulation includes transformations we performed on the raw data we collected from the competition. Please, reach to us if you think you need the original raw data.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
raw_data_manipulation		raw_data_manipulation
.gitignore		.gitignore
README.md		README.md
chat_diversity.py		chat_diversity.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Loading the dataset form HuggingFace

Analyzing the data

About

Releases

Packages

Contributors 2

Languages

ethz-spylab/ctf-satml24-data-analysis

Folders and files

Latest commit

History

Repository files navigation

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Loading the dataset form HuggingFace

Analyzing the data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages