Compare Proposal

Nothing to compare.

Develop a Scraper for a Job Portal with our Scraping System One Groovy Class with CSS Selectors

  • Posted at : 1 month ago
  • Post Similar Project
30

Budget
4
Proposals
240
Views
Open
Status
Skills Required

Posted By -

JR

0.0
Projects Posted : 1
Projects Paid : 0
Services Purchased : 0
Total Spent :
0
Feedbacks : 0 %

Project Details show (+) hide (-)

## Task: Develop a Scraper for a Job Portal with our Scraping System (One Groovy / Java Class with CSS Selectors)

Your task is to develop a scraper for the website [Jobrino](https://www.jobrino.com/)
based on our system on [Github](https://github.com/TechMap-io/dev.scraper.techmap.io) - just
copy the scraper for Stepstone and adapt the new Scraper to Jobrino.

#### Task Notes
* Required Skills: Groovy or Java, CSS (Selectors), Developer Tools (Chrome, etc.), Jsoup
* Effort per scraper: approx. 4-8 hours
* Potential for 30+ other scrapers / projects - if we like your code & productivity.

#### General Steps:
1. Checkout the Github project
* `git clone https://github.com/TechMap-io/dev.scraper.techmap.io`
2. Test if the system runs on your machine
* see README.md
3. Create a (feature) branch for your scraper (just use the website's name, e.g., "Jobrino")
* `git checkout -b Jobrino master`
4. Copy & rename the class StepstoneScraper.groovy to JobrinoScraper.groovy
5. Adapt the CSS selectors and Groovy sourcecode to work with the Jobrino website
6. Test if it scrapes all relevant data (check the Job.groovy, Location.groovy and Company.groovy classes)
7. Final check with 50+ pages from different categories if problems arise.
8. Final cleanup your code
9. Final commit your code / branch and create a pull request
* `git status` (check your changes)
* `git add Jobrino.groovy` (add your Scraper class - there should be no other changes (create an issue or Email if you must change something))
* `git commit -m "Jobrino Job Scraper"` (commit your Scraper class)
* `git push origin Jobrino` (push your branch to GitHub)
* Create the [pull request](https://docs.github.com/en/github/collaborating-with-issues-and-pull-requests/creating-a-pull-request) on GitHub