Darknet Spider

This is still work in progress and most likely contains one or two bugs. If you find one please report it through the issue tracker. Note that this can also be more undrestood as a framework, which can be used to crawl, process and analyse web data with the ability to apply it to the Tor network. Depending on the proxy used, it can also be run on other networks.

The Darknet Spider consists of several modules (which are represented by the different subfolders in the project). Below the three most important submodules

Crawler

The darknet spider is a program that crawls through the Tor network, following links recursively. In its current state it collects each link once and supports different prioritisation modes for the crawling process.

Storing the data

The software requires a Postgres DB to be configured to store the collected data for further analysis.

Analysing the data

The darknet spider contains two additional modules, one for preprocessing the collected data and another one for applying machine learning techniques on the collected and preprocessed material. Within the /classifier, one can include its own algorithms to be applied on the data.

Name		Name	Last commit message	Last commit date
Latest commit History 248 Commits
classifier		classifier
dataPreprocessing		dataPreprocessing
deployment		deployment
docs		docs
server		server
uriExtractor		uriExtractor
whitepaper		whitepaper
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
.travis.yml		.travis.yml
Build.sh		Build.sh
CONTRIBUTING.md		CONTRIBUTING.md
Clear.sh		Clear.sh
GenerateDocs.sh		GenerateDocs.sh
Init.sh		Init.sh
LICENSE		LICENSE
Lint.sh		Lint.sh
README.md		README.md
Run.sh		Run.sh
Setup.sh		Setup.sh
docker		docker
docker-compose.yml		docker-compose.yml
my-postgres.conf		my-postgres.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Darknet Spider

Crawler

Storing the data

Analysing the data

About

Releases

Packages

Contributors 2

Languages

License

decrypto-org/spider

Folders and files

Latest commit

History

Repository files navigation

Darknet Spider

Crawler

Storing the data

Analysing the data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages