mirror of
https://github.com/lorien/awesome-web-scraping.git
synced 2024-11-28 08:48:58 +02:00
List of libraries, tools and APIs for web scraping and data processing.
captcha-bypasscaptcha-recaptchacrawlercrawlingcrawling-frameworkcrawling-pythoncrawling-toolscrapingscraping-frameworkscraping-pythonscraping-toolspiderweb-scrapingwebscraping
.gitignore | ||
console_tools.md | ||
CONTRIBUTING.md | ||
golang.md | ||
java.md | ||
javascript.md | ||
LICENSE | ||
Makefile | ||
perl.md | ||
php.md | ||
python.md | ||
README.md | ||
ruby.md | ||
web_services.md |
Awesome Web Scraping
The list of tools, programming libraries and web services used in web scraping and data processing.
Web scraping chats: @grablab (English) and @grablab_ru (Russian)
Programming Libraries
Other Things
- Web Scraping Services
- Console tools
- dhamaniasad / HeadlessBrowsers - a list of (almost) all headless web browsers in existence
- DNS over HTTPS providers - list of DNS over HTTPs providers
Captcha Solving Services
These two links points to same 2captcha/rucaptcha services, it is just a different language versions
- 2captcha.com - English UI language
- rucaptcha.com - Russian UI language
Contributing
See Contributing document.
Credits
The list is based initially on some data from these sources awesome-python, awesome-php, awesome-ruby, ruby-nlp, awesome-javascript