Football Players Information Scraper in Python

Description

This project is designed to scrape player data from a list of URLs, save the scraped data into a CSV file, and then update a database with the scraped information. It includes functionality to handle command-line inputs for the URLs file and ensures data integrity by checking for valid CSV files and non-null data.

Features

Scrapes player data from given URLs.
Saves valid scraped data into a CSV file.
Loads initial database information from given playerData.CSV file.
Updates a SQLite database with the scraped data.
SQL queries for insight into data.
Command-line interface for easy use.

Requirements

Python 3.x
Pandas
Other dependencies as listed in requirements.txt

Installation

Clone the repository and navigate to the project directory.
Install the required Python packages:

pip install -r requirements.txt

Scraping Data

Prepare a CSV file containing the URLs to scrape, with each URL in a new line.
Run the scraper script with the path to your URLs file:

python run_scraper.py path/to/your/urls_file.csv

Run rest of the code - database loading:

python run_import_data.py

This will save the scraped data into scraped_player_data.csv in the data folder.

Running Tests

To run tests verifying the correctness of the scraping and data processing: In tests folder:

python -m unittest test_scraper_output.py or python -m unittest test_scraper.py

Sql queries

In sql_queries folder, there are three sql queries that correspond to the three queries in pdf on page 2. In csv files are results of each query.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
db		db
sql_queries		sql_queries
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run_scraper.py		run_scraper.py
scraper-explained.pdf		scraper-explained.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Football Players Information Scraper in Python

Description

Features

Requirements

Installation

Scraping Data

Running Tests

Sql queries

About

Releases

Packages

Languages

dorabz/wikipedia-scraper

Folders and files

Latest commit

History

Repository files navigation

Football Players Information Scraper in Python

Description

Features

Requirements

Installation

Scraping Data

Running Tests

Sql queries

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages