Book Scraper is a python application that scrapes the book catalogue website, http://books.toscrape.com. It then allows you to look at the data related to books on the different pages of the website.
The purpose of this application was to become more familiar with web scraping.
- Scrapes the web site: http://books.toscrape.com
- Asynchronous requests
- Console menu with the options:
- Look at highest rated books
- Look at cheapest books
- Get next available book in catalogue
- Exit menu
- Console menu also has a progress bar
To run, in the console enter:
python app.py
- BeautifulSoup4
- Allows a web page to be scraped (parses its HTML or XML)
- requests
- Allows sending HTTP requests
- logging
- Allows for logging information.
- asyncio
- For using event loops.
- aiohttp
- For creating asynchronous HTTP Client/Server connections with asyncio.
- async_timeout
- Asyncio-compatible timeout context manager.
- Timeouts requests that take too long.
- Asyncio-compatible timeout context manager.
- time
- For timing code execution time.