1
0
mirror of https://github.com/lorien/awesome-web-scraping.git synced 2024-11-30 08:57:19 +02:00

Remove web services document

This commit is contained in:
Some User 2022-12-31 16:59:15 +03:00
parent 3e9d7654b3
commit 0017663d49
2 changed files with 0 additions and 31 deletions

View File

@ -15,7 +15,6 @@ Feel free to give feedback or ask web scraping questions in Telegram groups: [@
## Other Things
* [Learning Web Scraping](https://github.com/lorien/learning-web-scraping) - list of articles and books teaching web scraping
* [Web Scraping Services](https://github.com/lorien/web-scraping/blob/master/web_services.md)
* [Console tools](https://github.com/lorien/web-scraping/blob/master/console_tools.md)
* [dhamaniasad / HeadlessBrowsers](https://github.com/dhamaniasad/HeadlessBrowsers) - a list of (almost) all headless web browsers in existence
* [DNS over HTTPS providers](https://github.com/curl/curl/wiki/DNS-over-HTTPS) - list of DNS over HTTPs providers

View File

@ -1,30 +0,0 @@
# Web-scraping Web Services
## Web-data Extracting Services
* [Dataflow kit](https://dataflowkit.com) - Turn websites data into structured data with a simple point-and-click toolkit
* [ProxyCrawl](https://proxycrawl.com) - Crawl and scrape any website without blocks, captchas or proxies
* [ScraperAPI](https://www.scraperapi.com) - A service that manages proxies
and headless browsers, exposing a single API endpoint to scrape any url.
* [Scraping Fish](https://scrapingfish.com) - The simplest API for web scraping without getting blocked powered by 4G/LTE proxy.
* [import.io](https://import.io/)
* [ScraperWiki](https://scraperwiki.com/about)
* [Mozenda](https://www.mozenda.com/)
* [PhantomJs.Cloud](https://phantomjscloud.com/)
* [CloudScrape](http://cloudscrape.com/)
* [DiffBot](http://www.diffbot.com/)
* [Apify](https://www.apify.com/) - A serverless web scraping, data extraction and web automation platform
* [Portia](http://scrapinghub.com/portia/); also on GitHub: [scrapinghub/portia](https://github.com/scrapinghub/portia)
* [Dexi](https://dexi.io)
* [Morph.io](https://morph.io) free of charge, fully [open-source](https://github.com/openaustralia/morph) service
* [Page.REST](https://page.rest/)
* [ParseHub](https://www.parsehub.com/)
* [WrapAPI](https://wrapapi.com/)
* [Agenty](https://www.agenty.com/)
* [ScrapingBee](https://www.scrapingbee.com/) - A web scraping API that handles rotating proxies and headless browsers.
* [SerpApi](https://serpapi.com/) - Real-time API to access structured search results of search engines.
* [ScrapingAnt](https://scrapingant.com/) - Web Scraping API with thousands of residential proxies and headless Chrome cluster.
* [Zyte (formerly Scrapinghub)](https://www.zyte.com/) - Web data extraction services and platform, also lead maintainers of Scrapy.
* [ProxiesAPI](https://proxiesapi.com/) - Rotating proxies API with automatic retries, CAPTCHA handling and javacript rendering.
* [ZenRows](https://www.zenrows.com/) - Web Scraping API & proxy server that bypasses any anti-bot solution while offering javascript rendering, rotating proxies, and geotargeting.
* [Data Collector](https://brightdata.com/products/data-collector/) - Pre-made collectors with web unlocking features and a JavaScript IDE to build your own scraper.