Starting at https://therawadvantage.com/category/rawsome-recipes/ , want to capture the recipes, starting with title, ingredients, instructions, notes (the human understanding of "recipe") as well as video link and image(s) close to the recipes. So this is a pure HTML scrape. For the PDFs, add the book title or a shorthand for it as a keyword, so that I can import that as a category.
Target is the JSON import format for WPRecipemaker, https://help.bootstrapped.ventures/article/197-import-recipes-from-json
Following this, the same code/logic can be used to parse about a dozen recipe books.
Copyright © 2020 | Truelancer.com