You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Scrapy is a high-level web crawling and scraping framework for Python. If you use HttpAuthMiddleware (i.e. the http_user and http_pass spider attributes) for HTTP authentication, all requests will expose your credentials to the request target. This includes requests generated by Scrapy components, such as robots.txt requests sent by Scrapy when the ROBOTSTXT_OBEY setting is set to True, or as requests reached through redirects. Upgrade to Scrapy 2.5.1 and use the new http_auth_domain spider attribute to control which domains are allowed to receive the configured HTTP authentication credentials. If you are using Scrapy 1.8 or a lower version, and upgrading to Scrapy 2.5.1 is not an option, you may upgrade to Scrapy 1.8.1 instead. If you cannot upgrade, set your HTTP authentication credentials on a per-request basis, using for example the w3lib.http.basic_auth_header function to convert your credentials into a value that you can assign to the Authorization header of your request, instead of defining your credentials globally using HttpAuthMiddleware.
CVE-2021-41125 - Medium Severity Vulnerability
Vulnerable Library - Scrapy-1.5.1-py2.py3-none-any.whl
A high-level Web Crawling and Web Scraping framework
Library home page: https://files.pythonhosted.org/packages/5d/12/a6197eaf97385e96fd8ec56627749a6229a9b3178ad73866a0b1fb377379/Scrapy-1.5.1-py2.py3-none-any.whl
Path to dependency file: /scrapers/requirements.txt
Path to vulnerable library: /scrapers/requirements.txt
Dependency Hierarchy:
Found in HEAD commit: 35017e222f0982c0f71acdb8134994d36da4c50f
Vulnerability Details
Scrapy is a high-level web crawling and scraping framework for Python. If you use
HttpAuthMiddleware
(i.e. thehttp_user
andhttp_pass
spider attributes) for HTTP authentication, all requests will expose your credentials to the request target. This includes requests generated by Scrapy components, such asrobots.txt
requests sent by Scrapy when theROBOTSTXT_OBEY
setting is set toTrue
, or as requests reached through redirects. Upgrade to Scrapy 2.5.1 and use the newhttp_auth_domain
spider attribute to control which domains are allowed to receive the configured HTTP authentication credentials. If you are using Scrapy 1.8 or a lower version, and upgrading to Scrapy 2.5.1 is not an option, you may upgrade to Scrapy 1.8.1 instead. If you cannot upgrade, set your HTTP authentication credentials on a per-request basis, using for example thew3lib.http.basic_auth_header
function to convert your credentials into a value that you can assign to theAuthorization
header of your request, instead of defining your credentials globally usingHttpAuthMiddleware
.Publish Date: 2021-10-06
URL: CVE-2021-41125
CVSS 3 Score Details (6.5)
Base Score Metrics:
Suggested Fix
Type: Upgrade version
Origin: GHSA-jwqp-28gf-p498
Release Date: 2021-10-06
Fix Resolution: scrapy - 1.8.1, 2.5.1
Step up your Open Source Security Game with WhiteSource here
The text was updated successfully, but these errors were encountered: