ApifyHttpProxyMiddleware
Index
Methods
__init__
Create a new instance.
Parameters
proxy_settings: dict
Dictionary containing proxy settings, provided by the Actor input.
Returns None
from_crawler
Create an instance of ApifyHttpProxyMiddleware from a Scrapy Crawler.
Parameters
crawler: Crawler
Scrapy Crawler object.
Returns ApifyHttpProxyMiddleware
process_exception
Process an exception that occurs during request processing.
Parameters
request: Request
Scrapy Request object.
exception: Exception
Exception object.
spider: Spider
Scrapy Spider object.
Returns None
process_request
Process a Scrapy request by assigning a new proxy.
Parameters
request: Request
Scrapy Request object.
spider: Spider
Scrapy Spider object.
Returns None
Apify HTTP proxy middleware for Scrapy.
This middleware enhances request processing by adding a 'proxy' field to the request's meta and an authentication header. It draws inspiration from the
HttpProxyMiddlewareincluded by default in Scrapy projects. The proxy URL is sourced from the settings under theAPIFY_PROXY_SETTINGSkey. The value of this key, a dictionary, should be provided by the Actor input. An example of the proxy settings:proxy_settings = {'useApifyProxy': true, 'apifyProxyGroups': []}