Compare Proposal

Nothing to compare.

Scraping Web and PDF recipes

  • Posted at : 1 month ago
  • Post Similar Project
30

Budget
37
Proposals
506
Views
Closed
Status
Skills Required

Posted By -

KL

4.9
Projects Posted : 128
Projects Paid : 47
Services Purchased : 2
Total Spent :
1040
Feedbacks : 76 %

Project Details show (+) hide (-)

Starting at https://therawadvantage.com/category/rawsome-recipes/ , want to capture the recipes, starting with title, ingredients, instructions, notes (the human understanding of "recipe") as well as video link and image(s) close to the recipes.  So this is a pure HTML scrape.  For the PDFs, add the book title or a shorthand for it as a keyword, so that I can import that as a category.

Target is the JSON import format for WPRecipemaker, https://help.bootstrapped.ventures/article/197-import-recipes-from-json

Following this, the same code/logic can be used to parse about a dozen recipe books.