The main objective of this project is to create a crawler who could extract the title, name and url of all the products in this website: http://www.epocacosmeticos.com.br/.
Mozilla Firefox
webdriver geckodriver
Python 3
Selenium
BeautifulSoup4
Requests
You can use git to clone
git clone https://github.com/Gabrielly-Andrade/webCrawler.git
or you can download the zip package
You can install the items in this steps using pip
-
pip install selenium
pip install beautifulsoup4
pip install requests
After installing everything, you need to open the terminal, navigate to the right path (use cd to open the src file) and run
python crawler.py