mirror of
https://github.com/lorien/awesome-web-scraping.git
synced 2024-11-19 17:12:01 +02:00
List of libraries, tools and APIs for web scraping and data processing.
captcha-bypasscaptcha-recaptchacrawlercrawlingcrawling-frameworkcrawling-pythoncrawling-toolscrapingscraping-frameworkscraping-pythonscraping-toolspiderweb-scrapingwebscraping
edf7ab710a
add pipet to the list of cli tools |
||
---|---|---|
.gitignore | ||
cli.md | ||
CONTRIBUTING.md | ||
golang.md | ||
java.md | ||
javascript.md | ||
LICENSE | ||
Makefile | ||
manuals.md | ||
perl.md | ||
php.md | ||
python.md | ||
README.md | ||
ruby.md |
Awesome Web Scraping
Lists of packages, services and manuals related to web scraping.
Topics
- Python - Python packages
- PHP - PHP packages
- Ruby - Ruby packages
- JavaScript - JavaScript packages
- Go - Go packages
- Command Line Tools - tools with a command line interface
- Web Scraping Manuals - list of articles and books teaching web scraping
- dhamaniasad / HeadlessBrowsers - list of (almost) all headless web browsers in existence
- DNS over HTTPS providers - list of DNS over HTTPs providers
- Awesome Pastebins - list of pastebin sites
Captcha Solving Services
Proxy Server Marketplaces
Telegram Discussion Groups
- @grablab - talks in English
- @grablab_ru - talks in Russian
How to Contribute to This List
See Contributing guide.
Credits
The list is based initially on some data from these sources awesome-python, awesome-php, awesome-ruby, ruby-nlp, awesome-javascript