Skip to content

Latest commit

 

History

History
31 lines (19 loc) · 901 Bytes

readme.md

File metadata and controls

31 lines (19 loc) · 901 Bytes

Daman and Diu Archives

URLs:

Script

The Script does 3 things:

  1. Produces daman_2015.csv and daman_2016.csv that contains metadata about the pdfs. The CSV has the following fields: year, language, poll_station_no, file_name

  2. Downloads all the pdfs to a directory called daman_201x/

  3. Renames the pdfs:

  • English language rolls have the prefix eng and Gujarati language rolls have the prefix guj.
  • The polling station no. is a 3 digit number.

So a sample name = eng_001.pdf

Running the script

pip install -r requirements.txt
python daman_archives.py

Misc. info.

There is no electoral rolls in Gujarati for 2016.