- Design a web scraper using an existing Java api
- Configure it to what we want or add more functions
- Do some data processing (i.e. which news sites contain articles about pop artists)
###Development:
- Make an issue with the bug you're fixing or enchancement your doing and assign yourself or assign yourself an existing issue.
- Go to Slack and msg bubbot with
pr <issue #> from WebScraper
, a new branch would be created a long with a pull request based off the issue. - Run
git fetch
. - Run
git checkout <branch name>
using the branch you have just created. It'll be appended with the issue #. For examplepr 1 from WebScraper
would create a branch callediss1
.
###Execution:
- Clone this repository and download Jaunt to the root of this repository.
- Go into the main directory:
cd src/main
- To Compile:
javac -cp jaunt0.9.9.9/jaunt0.9.9.9.jar;jaf-1.1.1/activation.jar;javamail-1.4.5/mail.jar Email.java MangaFox.java MangaStream.java MangaHere.java Scrape.java
- To Run:
java -cp jaunt0.9.9.9/jaunt0.9.9.9.jar;.;jaf-1.1.1/activation.jar;javamail-1.4.5/mail.jar Scrape