Provide a self-contained Python script to scrape all products from dischem.co.za using Scrapy and/or Selenium.
The starting point could be here - https://www.dischem.co.za/shop-by-department
Output all product information into a JSON file that is saved into a "/data" subfolder in the root directory with file format "dischem_[YYYY-MM-DD].json".
All output fields are to be formatted, cleaned and normalised for further processing.
The scraper must be throttled to avoid being blocked by the site.
Required product information must include the items highlighted in the attached images.
Provide script in a github repo, with necessary modules in the requirements.txt file
Provide full Readme with instructions on run the scraper
Budget: $50
Posted On: April 22, 2023 14:28 UTC
Category: Data Extraction
Skills:Python, Data Scraping, Scrapy, Selenium, API
Country: South Africa
click to apply
The starting point could be here - https://www.dischem.co.za/shop-by-department
Output all product information into a JSON file that is saved into a "/data" subfolder in the root directory with file format "dischem_[YYYY-MM-DD].json".
All output fields are to be formatted, cleaned and normalised for further processing.
The scraper must be throttled to avoid being blocked by the site.
Required product information must include the items highlighted in the attached images.
Provide script in a github repo, with necessary modules in the requirements.txt file
Provide full Readme with instructions on run the scraper
Budget: $50
Posted On: April 22, 2023 14:28 UTC
Category: Data Extraction
Skills:Python, Data Scraping, Scrapy, Selenium, API
Country: South Africa
click to apply