mirror of
https://github.com/lorien/awesome-web-scraping.git
synced 2024-11-28 08:48:58 +02:00
List of libraries, tools and APIs for web scraping and data processing.
captcha-bypasscaptcha-recaptchacrawlercrawlingcrawling-frameworkcrawling-pythoncrawling-toolscrapingscraping-frameworkscraping-pythonscraping-toolspiderweb-scrapingwebscraping
.gitignore | ||
console_tools.md | ||
CONTRIBUTING.md | ||
golang.md | ||
java.md | ||
javascript.md | ||
LICENSE | ||
Makefile | ||
manuals.md | ||
perl.md | ||
php.md | ||
python.md | ||
README.md | ||
ruby.md |
Awesome Web Scraping
The list of tools, programming libraries and web services used for web scraping and data processing.
Feel free to give feedback or ask web scraping questions in Telegram groups: @grablab (English) and @grablab_ru (Russian).
Programming Libraries
Other Things
- Web Scraping Manuals - list of articles and books teaching web scraping
- Console tools
- dhamaniasad / HeadlessBrowsers - a list of (almost) all headless web browsers in existence
- DNS over HTTPS providers - list of DNS over HTTPs providers
Captcha Solving Services
These two links point to same captcha service, it is just a different language versions
- https://2captcha.com - English UI language
- https://rucaptcha.com - Russian UI language
Proxy Server Marketplaces
Largest marketplaces in the world which contain offers from hundreds sellers and services:
How to Contribute to this List
See Contributing guide.
Credits
The list is based initially on some data from these sources awesome-python, awesome-php, awesome-ruby, ruby-nlp, awesome-javascript