Web-Scraping

This application is in scraping.py. The data is fetched from "https://www.theverge.com/" using the python library known as Beautiful soup. The parameters that are used from the data scraped were url, author, date, headline. After scraping/fetching data, a .csv(comma seperated value) file is created with date as its name and in the format "DDMMYYYY_verge.csv".

Comments are written above the code scraping.py in a user understandable language.

The same data is also uploaded to the sqlite database with

Id as its primary key, which is unique for every article with no duplicates in it.
URL of the article
Headline of the article
Author of the article
Date on which the article was published

How to run :

python Scraping.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
sql_lite		sql_lite
sqlite-tools-linux-x86-3390200/sqlite-tools-linux-x86-3390200		sqlite-tools-linux-x86-3390200/sqlite-tools-linux-x86-3390200
01092022_verge.csv		01092022_verge.csv
02092022_verge.csv		02092022_verge.csv
03092022_verge.csv		03092022_verge.csv
README.md		README.md
scraping.py		scraping.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web-Scraping

About

Releases

Packages

Languages

sankalp-25/Web-Scraping

Folders and files

Latest commit

History

Repository files navigation

Web-Scraping

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages