Skip to content

ScrapeOps/scrapeops-docs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ScrapeOps Docs

The following is documentation on how to setup and use ScrapeOps with your Scrapy spiders.

💻 Demo

🔗 ScrapeOps Dashboard Demo

⭐ Features

  • Scrapy Job Stats & Visualisation

    • 📈 Individual Job Progress Stats
    • 📊 Compare Jobs versus Historical Jobs
    • 💯 Job Stats Tracked
      • ✅ Pages Scraped & Missed
      • ✅ Items Parsed & Missed
      • ✅ Item Field Coverage
      • ✅ Runtimes
      • ✅ Response Status Codes
      • ✅ Success Rates & Average Latencies
      • ✅ Errors & Warnings
      • ✅ Bandwidth
  • Health Checks & Alerts

    • 🕵️‍♂️ Custom Spider & Job Health Checks
    • 📦 Out of the Box Alerts - Slack (More coming soon!)
    • 📑 Daily Scraping Reports
  • ScrapyD Cluster Management

    • 🔗 Integrate With ScrapyD Servers
    • ⏰ Schedule Periodic Jobs
    • 💯 All Scrapyd JSON API Supported
    • 🔐 Secure Your ScrapyD with BasicAuth, HTTPS or Whitelisted IPs
  • Proxy Monitoring (Coming Soon)

    • 📈 Monitor Your Proxy Account Usage
    • 📉 Track Your Proxy Providers Performance
    • 📊 Compare Proxy Performance Verus Other Providers

🚀 Getting Started

To use ScrapeOps you first need to create a free account and get your free API_KEY.

Create Free Account

There are 2 way you can use ScrapeOps:

  1. ScrapeOps Logger Mode
  2. ScrapyD Manager Mode

1) Spider Logger Mode

In this mode the ScrapeOps SDK will log all your scraping stats and generate statistics, graphs and trigger alerts on the ScrapeOps dashboard. Getting setup is very easy, you just need to add 3 lines to your Scrapy projects settings.py file and the ScrapeOps SDK will take care of the rest.

Detailed Read: ScrapeOps SDK Installation Guide

2) ScrapyD Manager Mode

In this mode, if you connect ScrapeOps with your ScrapyD server you will be able to schedule and manage your ScrapyD spiders via the ScrapeOps dashboard.

Note: To use the stats, graphs and alerts functionality of ScrapeOps, you need to install the ScrapeOps SDK in your Scrapy spiders.

Read: ScrapeOps ScrapyD Integration Guide

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •