In this research, several aspects of recipes and ingredient pairing are studied using a dataset that was derived from the Allrecipes platform. This dataset was enriched with data from FooDB, a dataset that includes information on the flavor components of ingredients. The resulting dataset is explored from various perspectives, involving ingredient lists and user ratings, in order to both validate the data and get a better understanding. After that, collaborative filtering techniques are investigated that are used to get a broader knowledge on user preferences in relation to ingredient combinations.
A PDF of the paper can be downloaded here.
The public_data
folder contains
.
├── recipes.single: a list of ingredients
├── recipes.basket: each line contains the ingredients in a recipe
└── reviews.csv: star ratings given to recipes by users
The reviews.csv
contains 3,281,560 ratings on a scale from 1 to 5. Each line represents a review and is a tuple of (recipe_id, reviewer_id, rating, date)
.
recipe_id
: id of the recipe on Allrecipesauthor_id
: user id of the reviewer on Allrecipesrating
: rating on a scale from 1 to 5date
: date of the review in YYYY-MM-DD format