mirror of
https://github.com/lorien/awesome-web-scraping.git
synced 2024-11-30 08:57:19 +02:00
Remove web services document
This commit is contained in:
parent
3e9d7654b3
commit
0017663d49
@ -15,7 +15,6 @@ Feel free to give feedback or ask web scraping questions in Telegram groups: [@
|
||||
## Other Things
|
||||
|
||||
* [Learning Web Scraping](https://github.com/lorien/learning-web-scraping) - list of articles and books teaching web scraping
|
||||
* [Web Scraping Services](https://github.com/lorien/web-scraping/blob/master/web_services.md)
|
||||
* [Console tools](https://github.com/lorien/web-scraping/blob/master/console_tools.md)
|
||||
* [dhamaniasad / HeadlessBrowsers](https://github.com/dhamaniasad/HeadlessBrowsers) - a list of (almost) all headless web browsers in existence
|
||||
* [DNS over HTTPS providers](https://github.com/curl/curl/wiki/DNS-over-HTTPS) - list of DNS over HTTPs providers
|
||||
|
@ -1,30 +0,0 @@
|
||||
# Web-scraping Web Services
|
||||
|
||||
## Web-data Extracting Services
|
||||
|
||||
* [Dataflow kit](https://dataflowkit.com) - Turn websites data into structured data with a simple point-and-click toolkit
|
||||
* [ProxyCrawl](https://proxycrawl.com) - Crawl and scrape any website without blocks, captchas or proxies
|
||||
* [ScraperAPI](https://www.scraperapi.com) - A service that manages proxies
|
||||
and headless browsers, exposing a single API endpoint to scrape any url.
|
||||
* [Scraping Fish](https://scrapingfish.com) - The simplest API for web scraping without getting blocked powered by 4G/LTE proxy.
|
||||
* [import.io](https://import.io/)
|
||||
* [ScraperWiki](https://scraperwiki.com/about)
|
||||
* [Mozenda](https://www.mozenda.com/)
|
||||
* [PhantomJs.Cloud](https://phantomjscloud.com/)
|
||||
* [CloudScrape](http://cloudscrape.com/)
|
||||
* [DiffBot](http://www.diffbot.com/)
|
||||
* [Apify](https://www.apify.com/) - A serverless web scraping, data extraction and web automation platform
|
||||
* [Portia](http://scrapinghub.com/portia/); also on GitHub: [scrapinghub/portia](https://github.com/scrapinghub/portia)
|
||||
* [Dexi](https://dexi.io)
|
||||
* [Morph.io](https://morph.io) free of charge, fully [open-source](https://github.com/openaustralia/morph) service
|
||||
* [Page.REST](https://page.rest/)
|
||||
* [ParseHub](https://www.parsehub.com/)
|
||||
* [WrapAPI](https://wrapapi.com/)
|
||||
* [Agenty](https://www.agenty.com/)
|
||||
* [ScrapingBee](https://www.scrapingbee.com/) - A web scraping API that handles rotating proxies and headless browsers.
|
||||
* [SerpApi](https://serpapi.com/) - Real-time API to access structured search results of search engines.
|
||||
* [ScrapingAnt](https://scrapingant.com/) - Web Scraping API with thousands of residential proxies and headless Chrome cluster.
|
||||
* [Zyte (formerly Scrapinghub)](https://www.zyte.com/) - Web data extraction services and platform, also lead maintainers of Scrapy.
|
||||
* [ProxiesAPI](https://proxiesapi.com/) - Rotating proxies API with automatic retries, CAPTCHA handling and javacript rendering.
|
||||
* [ZenRows](https://www.zenrows.com/) - Web Scraping API & proxy server that bypasses any anti-bot solution while offering javascript rendering, rotating proxies, and geotargeting.
|
||||
* [Data Collector](https://brightdata.com/products/data-collector/) - Pre-made collectors with web unlocking features and a JavaScript IDE to build your own scraper.
|
Loading…
Reference in New Issue
Block a user