mirror of
https://github.com/lorien/awesome-web-scraping.git
synced 2024-11-28 08:48:58 +02:00
List of libraries, tools and APIs for web scraping and data processing.
captcha-bypasscaptcha-recaptchacrawlercrawlingcrawling-frameworkcrawling-pythoncrawling-toolscrapingscraping-frameworkscraping-pythonscraping-toolspiderweb-scrapingwebscraping
.gitignore | ||
captcha_solving_services.md | ||
console_tools.md | ||
CONTRIBUTING.md | ||
golang.md | ||
java.md | ||
javascript.md | ||
LICENSE | ||
Makefile | ||
new_language_template.md | ||
perl.md | ||
php.md | ||
proxy_services.md | ||
python.md | ||
README.md | ||
ruby.md | ||
web_services.md |
Awesome Web Scraping
The list of tools, programming libraries and web services used in web scraping.
Programming Libraries
Web Services
Other Things
Other Lists
- HeadlessBrowsers - a list of (almost) all headless web browsers in existence
- awesome-python-dev - a list of tools for debugging, profiling and analyzing python programs.
About awesome-web-scraping
Make this list better! Your contributions are always welcome! See contributing how-to. To add new language to Programming Libraries section use new_language_template as starting point.
Feel free to share feedback in telegram chats about web scraping: @grablab (English) and @grablab_ru (Russian)
The list is based initially on some data from these sources awesome-python, awesome-php, awesome-ruby, ruby-nlp, awesome-javascript