mirror of
https://github.com/lorien/awesome-web-scraping.git
synced 2024-11-24 08:32:19 +02:00
eab9ed7740
Request to add the data collector
2.2 KiB
2.2 KiB
Web-scraping Web Services
Web-data Extracting Services
- Dataflow kit - Turn websites data into structured data with a simple point-and-click toolkit
- ProxyCrawl - Crawl and scrape any website without blocks, captchas or proxies
- ScraperAPI - A service that manages proxies and headless browsers, exposing a single API endpoint to scrape any url.
- import.io
- ScraperWiki
- Mozenda
- PhantomJs.Cloud
- CloudScrape
- DiffBot
- Apify - A serverless web scraping, data extraction and web automation platform
- Portia; also on GitHub: scrapinghub/portia
- Dexi
- Morph.io free of charge, fully open-source service
- Page.REST
- ParseHub
- WrapAPI
- Agenty
- ScrapingBee - A web scraping API that handles rotating proxies and headless browsers.
- SerpApi - Real-time API to access structured search results of search engines.
- ScrapingAnt - Web Scraping API with thousands of residential proxies and headless Chrome cluster.
- Zyte (formerly Scrapinghub) - Web data extraction services and platform, also lead maintainers of Scrapy.
- ProxiesAPI - Rotating proxies API with automatic retries, CAPTCHA handling and javacript rendering.
- ZenRows - Web Scraping API & proxy server that bypasses any anti-bot solution while offering javascript rendering, rotating proxies, and geotargeting.
- Data Collector - Pre-made collectors with web unlocking features and a JavaScript IDE to build your own scraper.