• Stars
    star
    133
  • Rank 266,814 (Top 6 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 7 years ago
  • Updated 24 days ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Easily dump cookidoo recipes

cookidump

Easily dump cookidoo recipes from the official website

Description

This program allows you to dump all recipes on Cookidoo websites (available for different countries) for offline and posticipate reading. Those recipes are valid in particular for Thermomix/Bimby devices. In order to dump the recipes, a valid subscription is needed.

The initial concept of this program was based on jakubszalaty/cookidoo-parser.

Mentioning

If you intend to scientifically investigate or extend cookidump, please consider citing the following paper.

@article{cambiaso2022cookidump,
title = {Web security and data dumping: The Cookidump case},
journal = {Software Impacts},
volume = {14},
pages = {100426},
year = {2022},
issn = {2665-9638},
doi = {https://doi.org/10.1016/j.simpa.2022.100426},
url = {https://www.sciencedirect.com/science/article/pii/S2665963822001105},
author = {Enrico Cambiaso and Maurizio Aiello},
keywords = {Cyber-security, Data dump, Database security, Browser automation},
abstract = {In the web security field, data dumping activities are often related to a malicious exploitation. In this paper, we focus on data dumping activities executed legitimately by scraping/storing data shown on the browser. We evaluate such operation by proposing Cookidump, a tool able to dump all recipes available on the Cookidoo© website portal. While such scenario is not relevant, in terms of security and privacy, we discuss the impact of such kind of activity for other scenarios including web applications hosting sensitive information.}
}

Further information can be found at https://www.sciencedirect.com/science/article/pii/S2665963822001105.

Features

  • Easy to run
  • Easy to open HTML output
  • Output including a surfable list of dumped recipes
  • Customizable searches

Installation

nix

nix run github:auino/cookidump -- <outputdir> [--separate-json]

Nix provisions google-chrome together with chromedriver. Only <outputdir> and [--separate-json] arguments are expected.

manual

  1. Clone the repository:
git clone https://github.com/auino/cookidump.git
  1. cd into the download folder

  2. Install Python requirements:

pip install -r requirements.txt
  1. Install the Google Chrome browser, if not already installed

  2. Download the Chrome WebDriver and save it on the cookidump folder

  3. You are ready to dump your recipes

Usage

Simply run the following command to start the program. The program is interactive to simplify it's usage.

python cookidump.py [--separate-json] <webdriverfile> <outputdir>

where:

  • webdriverfile identifies the path to the downloaded Chrome WebDriver (for instance, chromedriver.exe for Windows hosts, ./chromedriver for Linux and macOS hosts)
  • outputdir identifies the path of the output directory (will be created, if not already existent)
  • --separate-json allows to generate a separate JSON file for each recipe, instead of one aggregate file including all recipes

The program will open a Google Chrome window and wait until you are logged in into your Cookidoo account (different countries are supported).

After that, follow intructions provided by the script itself to proceed with the dump.

Considerations

By following script instructions, it is also possible to apply custom filters to export selected recipes (for instance, in base of the dish, title and ingredients, Thermomix/Bimby version, etc.).

Output is represented by an index.html file, included in outputdir, plus a set of recipes inside of structured folders. By opening the generated index.html file on your browser, it is possible to have a list of recipes downloaded and surf to the desired recipe.

The number of exported recipes is limited to around 1000 for each execution. Hence, use of filters may help in this case to reduce the number of recipes exported.

Other approaches

A different approach, previously adopted, is based on the retrieval of structured data on recipes. More information can be found on the datastructure branch. Output is represented in this case in a different (structured) format, hence, it has to be interpreted. Such interpretation is not implemented in the linked previous commit.

TODO

  • Bypass the limited number of exported recipes
  • Parse downloaded recipes to store them on a database, or to generate a unique linked PDF
  • Make Chrome run headless for better speeds
  • Set up a dedicated container for the program

Supporters

  • @vikramsoni2, regarding JSON saves plus minor enhancements
  • @mrwogu, regarding additional information to be extracted on the generated JSON file, plus suggestions on the possibility to save recipes on dedicated JSON files
  • @nilskrause, regarding argument parsing and updates on the link to download the Chrome WebDriver
  • @NightProgramming, regarding the use of selenium version 3
  • @morela, regarding the update of the tool to support a newer version of Selenium
  • @ndjc, fixing some deprecation warnings

Disclaimer

The authors of this program are not responsible of the usage of it. This program is released only for research and dissemination purposes. Also, the program provides users the ability to locally and temporarily store recipes accessible through a legit subscription. Before using this program, check Cookidoo subscription terms of service, according to the country related to the exploited subscription. Sharing of the obtained recipes is not a legit activity and the authors of this program are not responsible of any illecit and sharing activity accomplished by the users.

Contacts

You can find me on Twitter as @auino.

More Repositories

1

500px-osx-background

Programmatically set your Mac background to a random image on 500px
Shell
74
star
2

php-telegram-bot-library

A PHP library to easily write Telegram Bots
PHP
37
star
3

mail-attachments-archiver

Store mail attachments to file-system
Python
13
star
4

virtualphone

virtualphone, aka the self-hosted alternative to Google Voice: make your phone become a virtual phone
Python
12
star
5

your-own-ssl-certificate-authority

A simple script to set up your own certificate authority and generate new (sub)keys
Shell
11
star
6

cryptocurrencies2excel

Generate Excel spreadsheets from live data from crypto-currency markets
Python
9
star
7

chromecast-web-viewer

An easy to use video web viewer supporting Google Chromecast
PHP
7
star
8

reddit-macos-background

Easily set Reddit pictures from a specific subreddit as desktop background for your machine
Shell
1
star
9

kickass-api-wrapper

A KAT wrapper for API, to be used with services like Sonarr
Python
1
star
10

italian-television

An Italian television streaming services collector
JavaScript
1
star
11

wsbpf

WebSocket Based Presentation Framework
JavaScript
1
star
12

smartintercom

Use your Android device as a Smart Intercom
Java
1
star
13

directorytree

Graphical view for your directory tree to enhance productivity
CSS
1
star
14

osx-screensaver-profile

Dynamically set different OS X screen savers for different networks
Shell
1
star
15

seleniumprocessor

A simple library to set up Selenium processes
Python
1
star
16

cool-py-anagrams

A project created as a response to sgabber/cool-scala anagrams software
Python
1
star
17

personalwebsite

An Heroku deployable personal website
JavaScript
1
star