Skip to content

Latest commit

 

History

History
10 lines (6 loc) · 512 Bytes

README.md

File metadata and controls

10 lines (6 loc) · 512 Bytes

openSUSE - Datamining scripts

Use create-virtualenv.sh to create a virtual environment for Python (needs PIP).

To generate the bots.txt file, you can use user-agents.sh script.

To convert the Apache log files to a BerkelyDB representation, use log2db.sh. After that you need to convert this in a BerkeleyDB queue for fast data analysis using the analyze.sh script.

The IPython notebook Group and tables.ipynb contains all the cells used to make the proper analysis of the data.