electoral_rolls/dadra at master · vahini01/electoral_rolls

readme.md

Requester pays for the charges associated with downloading the data. For more information about about that, see: https://cloud.google.com/storage/docs/requester-pays

Year = Final Electoral Roll for 2017

The Script does three things:

Produces dadra.csv that contains metadata about the pdfs. The CSV has the following fields: language, main_or_supplementary, part_no, file_name
Downloads all the pdfs to a directory called dadra_pdfs/
Renames files as follows:
- English language rolls have the prefix eng and Gujarati language rolls have the prefix guj.
- The main rolls have the word main in them and supplementary supp
- And the last segment is the 3 digit part_no.
So a sample name = eng_main_001.pdf

pip install -r requirements.txt
python dadra.py

There are missing supplementary files getting error 404 (File or directory not found).

Draft roll for 2018 is also available.