1
0
mirror of https://github.com/lorien/awesome-web-scraping.git synced 2024-11-24 08:32:19 +02:00
List of libraries, tools and APIs for web scraping and data processing.
Go to file
2020-10-24 17:32:33 +07:00
.gitignore add html5-parser to python 2020-10-19 21:48:59 +03:00
console_tools.md Add item to Other Lists to Console Tools list 2019-11-12 00:55:55 +03:00
CONTRIBUTING.md New stuff 2015-08-13 02:49:22 +06:00
golang.md Removing some deleted golang libs 2020-09-02 22:00:47 +04:00
java.md Update java.md 2018-09-06 16:09:58 +03:00
javascript.md Added Playwright 2020-09-11 10:54:07 +05:45
LICENSE Initial commit 2015-08-13 02:27:49 +06:00
Makefile New stuff 2015-08-13 02:49:22 +06:00
perl.md Update perl.md 2018-09-06 16:01:37 +03:00
php.md diffbot-client has been deprecated and hasn't had any updates since 2018 2020-09-23 13:29:24 +04:00
python.md Add Gerapy 2020-10-24 17:32:33 +07:00
README.md Update README.md 2020-09-11 20:42:12 +03:00
ruby.md Add arachnid2 to Ruby scrapers 2020-06-25 16:52:15 +01:00
web_services.md Move Scrapingbee to the bottom of the list 2020-09-22 16:27:44 +02:00

Awesome Web Scraping

The list of tools, programming libraries and web services used in web scraping and data processing.

Web scraping chats: @grablab (English) and @grablab_ru (Russian)

Programming Libraries

Other Things

Captcha Solving Services

Contributing

See Contributing document.

Credits

The list is based initially on some data from these sources awesome-python, awesome-php, awesome-ruby, ruby-nlp, awesome-javascript