mirror of https://github.com/lorien/awesome-web-scraping.git synced 2024-11-24 08:32:19 +02:00

Request to add the data collector

2022-03-16 11:39:57 +02:00

Web-scraping Web Services

Web-data Extracting Services

Dataflow kit - Turn websites data into structured data with a simple point-and-click toolkit
ProxyCrawl - Crawl and scrape any website without blocks, captchas or proxies
ScraperAPI - A service that manages proxies and headless browsers, exposing a single API endpoint to scrape any url.
import.io
ScraperWiki
Mozenda
PhantomJs.Cloud
CloudScrape
DiffBot
Apify - A serverless web scraping, data extraction and web automation platform
Portia; also on GitHub: scrapinghub/portia
Dexi
Morph.io free of charge, fully open-source service
Page.REST
ParseHub
WrapAPI
Agenty
ScrapingBee - A web scraping API that handles rotating proxies and headless browsers.
SerpApi - Real-time API to access structured search results of search engines.
ScrapingAnt - Web Scraping API with thousands of residential proxies and headless Chrome cluster.
Zyte (formerly Scrapinghub) - Web data extraction services and platform, also lead maintainers of Scrapy.
ProxiesAPI - Rotating proxies API with automatic retries, CAPTCHA handling and javacript rendering.
ZenRows - Web Scraping API & proxy server that bypasses any anti-bot solution while offering javascript rendering, rotating proxies, and geotargeting.
Data Collector - Pre-made collectors with web unlocking features and a JavaScript IDE to build your own scraper.