A Python script to scrape the Amazon Parent Dashboard. Uses Selenium to navigate pages.
- Reads login data from a .csv file
- Uses this data to login to the Amazon Parent Dashboard
- Handles various types of error messages
- Deals with Captchas using TwoCaptcha API
- Deals with OTP authentification by logging into a mail-service, retrieving the OTP and entering it
- Deals with having to verify login by clicking on a link in an Email
- Upon succesfull login it scrapes the Parent Dashboard for the books read on a date and the reading minutes and writes this to .csv
- Writes a log and backup file in case the script execution fails
- Plays a sound when execution fails
- chromedriver for Selenium
- /csv_in path with login data
- Access to TwoCaptcha API
This is my first larger web-scraping script. The Amazon Login procedure is quite complex and riddled with error messages, therefore the script may not work in all cases.