added to README files, added full dataset versions to data

This commit is contained in:
2021-04-15 20:19:09 +02:00
parent cf40ad15fb
commit 1ea0677029
9 changed files with 61 additions and 543 deletions

View File

@@ -1,3 +1,19 @@
## German FoodBERT Models
Unzip German FoodBERT models here!
They can be found under https://cloud.marquis.site/s/ZUVIIIQv6yznBj6
## Datasets
Each model has a folder "dataset" with the following files:
**full_dataset.json**: Entire dataset with cleaned ingredients and instructions. This is the same file as found for each version in the main data directory.
**complete_dataset.json**: dataset containing only URLs and instructions, separated depending on the version
**model_datapoints.txt**: list of only the instruction datapoints from **complete_dataset.json**
**training_data.txt**: instruction datapoints from recipes set aside for training
**testing_data.txt**: instruction datapoints from recipes set aside for testing