This repository for running crawler from target_url to getting data in database on docker container.
-
Install and run the Docker machine.
-
Clone this repository
-
Inside the repository:
- check existing or create new
target_url_crawler
anddata_cralwer
as per your requirements.NOTE: You can access mysql after running docker on:
http://<docker-public ip>:8080?server=172.30.0.5
)
For docker public ip, check:docker-machine ip default
On mac:http://localhost:8080?server=172.30.0.5
- check existing or create new
-
Change
go_spider.py
cmd as per your crawler -
Build and Run the Crawler
docker-compose up --build
Note:
-
Any changes in the service, make sure you first delete respective service conatiner.
-
Can access the volume after running container:
docker exec -it <container-id/name> bash