Year = 2018
Total number of files = 600
The Script iterates through the HTML Table and does two things:
-
Writes a sikkim.csv that tracks metadata about the pdf. The pdf has the following columns:
ac_number, ac_name, part_number, polling_station_name, relative_path_to_file
-
Downloads all the pdfs to sikkim_pdfs/
pip install -r requirements.txt
python sikkim.py
There are missing PDF files (return HTTP 404)