Skip to content

jan-schaeffer/amazon_pd_scrape

Repository files navigation

Amazon Parent Dashboard Scrape

A Python script to scrape the Amazon Parent Dashboard. Uses Selenium to navigate pages.

Features

  • Reads login data from a .csv file
  • Uses this data to login to the Amazon Parent Dashboard
  • Handles various types of error messages
  • Deals with Captchas using TwoCaptcha API
  • Deals with OTP authentification by logging into a mail-service, retrieving the OTP and entering it
  • Deals with having to verify login by clicking on a link in an Email
  • Upon succesfull login it scrapes the Parent Dashboard for the books read on a date and the reading minutes and writes this to .csv
  • Writes a log and backup file in case the script execution fails
  • Plays a sound when execution fails

Requires

  • chromedriver for Selenium
  • /csv_in path with login data
  • Access to TwoCaptcha API

Amazon Parent Dashboard

Amazon Parent Dashboard

Captcha

Captcha

Captcha Retry

Captcha Retry

Verify Email

Verify Email

Email Link

Email Link

Allow Access

Allow Access

Final Note

This is my first larger web-scraping script. The Amazon Login procedure is quite complex and riddled with error messages, therefore the script may not work in all cases.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages