initial commit of project

This commit is contained in:
2021-04-11 19:51:12 +02:00
commit a21a8186d9
110 changed files with 16326178 additions and 0 deletions

7
crawl_recipes/README.md Normal file
View File

@@ -0,0 +1,7 @@
# Crawl Recipes from Chefkoch
Get all recipes from Chefkoch by crawling the categories. Recipes are saved to file every 1000 recipes.
This code is adapted from LeonZucchinis Version (https://github.com/leonzucchini/recipes).
## Run order
1. run crawl_category_subpages
2. run parse_category_subpages
2. run crawl_recipe_pages (this also runs parse_recipe_pages)