Skip to content

Blatzar/scraping-tutorial

Repository files navigation

Requests based scraping tutorial

You want to start scraping? Well this guide will teach you, and not some baby selenium scraping. This guide only uses raw requests and has examples in both python and kotlin. Only basic programming knowlege in one of those languages is required to follow along in the guide.

If you find any aspect of this guide confusing please open an issue about it and I will try to improve things.

If you do not know programming at all then this guide will not help you, learn programming first! Real scraping cannot be done by copy pasting with a vauge understanding.

  1. Starting scraping from zero
  2. Properly scraping JSON apis often found on sites
  3. Evading developer tools detection when scraping
  4. Why your requests fail and how to fix them
  5. Finding links and scraping videos

Once you've read and understood the concepts behind scraping take a look at a provider in CloudStream. I added tons of comments to make every aspect of writing CloudStream providers clear. Even if you're not planning on contributing to Cloudstream looking at the code may help.

Take a look at Thenos for an example of json based scraping in kotlin.

About

Tutorial for scraping streaming sites

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •