URL = http://erms.gujarat.gov.in/ceo-gujarat/master/frmEPDFRoll.aspx
Year = Draft Roll for 2017
conda env create -f tools/environment.yml
to install working environment andsource activate erolls
- Or,
pip install -r requirements.txt
if not using a conda environment tools/utils.py
is a helper function for downloading files, and sanity checkspython gujarat.py
to downloads all the pdfs to directory../data/Gujarat/
and creates 'Gujarat.txt' for files that were not downloaded successfullypython gujarat_retry.py
for retrying downloads for files in 'Gujarat.txt'python gujarat_SanityCheck.py
for doing a sanity check on the files downloaded
- Total Number of files = 43142
- The downloaded files are of form NORMAL_AC{assembly constituency number}N{assembly constituency number}{Part Number}.pdf
- Files not available can be found in 'Gujarat3.txt''