ScrapeOps Docs

The following is documentation on how to setup and use ScrapeOps with your Scrapy spiders.

💻 Demo

🔗 ScrapeOps Dashboard Demo

⭐ Features

Scrapy Job Stats & Visualisation
- 📈 Individual Job Progress Stats
- 📊 Compare Jobs versus Historical Jobs
- 💯 Job Stats Tracked
  - ✅ Pages Scraped & Missed
  - ✅ Items Parsed & Missed
  - ✅ Item Field Coverage
  - ✅ Runtimes
  - ✅ Response Status Codes
  - ✅ Success Rates & Average Latencies
  - ✅ Errors & Warnings
  - ✅ Bandwidth
Health Checks & Alerts
- 🕵️‍♂️ Custom Spider & Job Health Checks
- 📦 Out of the Box Alerts - Slack (More coming soon!)
- 📑 Daily Scraping Reports
ScrapyD Cluster Management
- 🔗 Integrate With ScrapyD Servers
- ⏰ Schedule Periodic Jobs
- 💯 All Scrapyd JSON API Supported
- 🔐 Secure Your ScrapyD with BasicAuth, HTTPS or Whitelisted IPs
Proxy Monitoring (Coming Soon)
- 📈 Monitor Your Proxy Account Usage
- 📉 Track Your Proxy Providers Performance
- 📊 Compare Proxy Performance Verus Other Providers

🚀 Getting Started

To use ScrapeOps you first need to create a free account and get your free API_KEY.

There are 2 way you can use ScrapeOps:

ScrapeOps Logger Mode
ScrapyD Manager Mode

1) Spider Logger Mode

In this mode the ScrapeOps SDK will log all your scraping stats and generate statistics, graphs and trigger alerts on the ScrapeOps dashboard. Getting setup is very easy, you just need to add 3 lines to your Scrapy projects settings.py file and the ScrapeOps SDK will take care of the rest.

Detailed Read: ScrapeOps SDK Installation Guide

2) ScrapyD Manager Mode

In this mode, if you connect ScrapeOps with your ScrapyD server you will be able to schedule and manage your ScrapyD spiders via the ScrapeOps dashboard.

❗ Note: To use the stats, graphs and alerts functionality of ScrapeOps, you need to install the ScrapeOps SDK in your Scrapy spiders.

Read: ScrapeOps ScrapyD Integration Guide

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
assets		assets
scrapy		scrapy
README.md		README.md
logger-readme.md		logger-readme.md
manage-readme.md		manage-readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScrapeOps Docs

💻 Demo

⭐ Features

🚀 Getting Started

1) Spider Logger Mode

2) ScrapyD Manager Mode

About

Releases

Packages

Contributors 3

ScrapeOps/scrapeops-docs

Folders and files

Latest commit

History

Repository files navigation

ScrapeOps Docs

💻 Demo

⭐ Features

🚀 Getting Started

1) Spider Logger Mode

2) ScrapyD Manager Mode

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Packages