From 0017663d493c6fdd069f3b9c68c3953d7dff0b76 Mon Sep 17 00:00:00 2001 From: Some User Date: Sat, 31 Dec 2022 16:59:15 +0300 Subject: [PATCH] Remove web services document --- README.md | 1 - web_services.md | 30 ------------------------------ 2 files changed, 31 deletions(-) delete mode 100644 web_services.md diff --git a/README.md b/README.md index 1d34582..ef1c98d 100644 --- a/README.md +++ b/README.md @@ -15,7 +15,6 @@ Feel free to give feedback or ask web scraping questions in Telegram groups: [@ ## Other Things * [Learning Web Scraping](https://github.com/lorien/learning-web-scraping) - list of articles and books teaching web scraping -* [Web Scraping Services](https://github.com/lorien/web-scraping/blob/master/web_services.md) * [Console tools](https://github.com/lorien/web-scraping/blob/master/console_tools.md) * [dhamaniasad / HeadlessBrowsers](https://github.com/dhamaniasad/HeadlessBrowsers) - a list of (almost) all headless web browsers in existence * [DNS over HTTPS providers](https://github.com/curl/curl/wiki/DNS-over-HTTPS) - list of DNS over HTTPs providers diff --git a/web_services.md b/web_services.md deleted file mode 100644 index 0410bef..0000000 --- a/web_services.md +++ /dev/null @@ -1,30 +0,0 @@ -# Web-scraping Web Services - -## Web-data Extracting Services - - * [Dataflow kit](https://dataflowkit.com) - Turn websites data into structured data with a simple point-and-click toolkit - * [ProxyCrawl](https://proxycrawl.com) - Crawl and scrape any website without blocks, captchas or proxies - * [ScraperAPI](https://www.scraperapi.com) - A service that manages proxies - and headless browsers, exposing a single API endpoint to scrape any url. - * [Scraping Fish](https://scrapingfish.com) - The simplest API for web scraping without getting blocked powered by 4G/LTE proxy. - * [import.io](https://import.io/) - * [ScraperWiki](https://scraperwiki.com/about) - * [Mozenda](https://www.mozenda.com/) - * [PhantomJs.Cloud](https://phantomjscloud.com/) - * [CloudScrape](http://cloudscrape.com/) - * [DiffBot](http://www.diffbot.com/) - * [Apify](https://www.apify.com/) - A serverless web scraping, data extraction and web automation platform - * [Portia](http://scrapinghub.com/portia/); also on GitHub: [scrapinghub/portia](https://github.com/scrapinghub/portia) - * [Dexi](https://dexi.io) - * [Morph.io](https://morph.io) free of charge, fully [open-source](https://github.com/openaustralia/morph) service - * [Page.REST](https://page.rest/) - * [ParseHub](https://www.parsehub.com/) - * [WrapAPI](https://wrapapi.com/) - * [Agenty](https://www.agenty.com/) - * [ScrapingBee](https://www.scrapingbee.com/) - A web scraping API that handles rotating proxies and headless browsers. - * [SerpApi](https://serpapi.com/) - Real-time API to access structured search results of search engines. - * [ScrapingAnt](https://scrapingant.com/) - Web Scraping API with thousands of residential proxies and headless Chrome cluster. - * [Zyte (formerly Scrapinghub)](https://www.zyte.com/) - Web data extraction services and platform, also lead maintainers of Scrapy. - * [ProxiesAPI](https://proxiesapi.com/) - Rotating proxies API with automatic retries, CAPTCHA handling and javacript rendering. - * [ZenRows](https://www.zenrows.com/) - Web Scraping API & proxy server that bypasses any anti-bot solution while offering javascript rendering, rotating proxies, and geotargeting. - * [Data Collector](https://brightdata.com/products/data-collector/) - Pre-made collectors with web unlocking features and a JavaScript IDE to build your own scraper.