1
0
mirror of https://github.com/lorien/awesome-web-scraping.git synced 2025-01-05 22:53:47 +02:00
List of libraries, tools and APIs for web scraping and data processing.
Go to file
2019-12-25 08:16:09 +03:00
.gitignore Refactor markup 2019-07-09 03:04:16 +03:00
captcha_solving_services.md Create captcha_solving_services.md 2019-01-28 13:28:15 +03:00
console_tools.md Add item to Other Lists to Console Tools list 2019-11-12 00:55:55 +03:00
CONTRIBUTING.md New stuff 2015-08-13 02:49:22 +06:00
golang.md Add geziyor scraper&crawler 2019-09-11 18:54:18 +03:00
java.md Update java.md 2018-09-06 16:09:58 +03:00
javascript.md Merge pull request #93 from andriyor/add-javascript-node-bookmarks-parser 2019-10-25 16:29:47 +03:00
LICENSE Initial commit 2015-08-13 02:27:49 +06:00
Makefile New stuff 2015-08-13 02:49:22 +06:00
new_language_template.md Create new_language_template.md 2015-08-21 21:17:28 +05:00
perl.md Update perl.md 2018-09-06 16:01:37 +03:00
php.md Change category for oscarotero/Embed project 2018-10-18 15:24:04 +03:00
proxy_services.md Update proxy_services.md 2019-06-13 17:54:06 +03:00
python.md Add python:httptools 2019-12-25 08:16:09 +03:00
README.md Update README.md 2019-12-16 22:48:57 +03:00
ruby.md added kimuraframework 2018-12-21 00:48:14 +03:00
web_services.md Merge pull request #77 from jancurn/master 2019-01-28 13:58:48 +03:00

Awesome Web Scraping

The list of tools, programming libraries and web services used in web scraping.

Programming Libraries

Web Services

Other Things

Other Lists

  • HeadlessBrowsers - a list of (almost) all headless web browsers in existence
  • awesome-python-dev - a list of tools for debugging, profiling and analyzing python programs.

About awesome-web-scraping

Make this list better! Your contributions are always welcome! See contributing how-to. To add new language to Programming Libraries section use new_language_template as starting point.

Feel free to share feedback in telegram chats about web scraping: @grablab (English) and @grablab_ru (Russian)

The list is based initially on some data from these sources awesome-python, awesome-php, awesome-ruby, ruby-nlp, awesome-javascript