2022-11-14 17:50:01 +02:00
# Awesome Web Scraping
2017-03-21 18:19:25 +02:00
2023-10-30 13:50:38 +02:00
Lists of packages, services and manuals related to web scraping.
2020-05-12 23:42:55 +02:00
2023-10-30 13:46:28 +02:00
## Topics
2019-06-21 12:17:30 +02:00
2023-10-30 13:46:28 +02:00
* [Python ](https://github.com/lorien/web-scraping/blob/master/python.md ) - Python packages
* [PHP ](https://github.com/lorien/web-scraping/blob/master/php.md ) - PHP packages
* [Ruby ](https://github.com/lorien/web-scraping/blob/master/ruby.md ) - Ruby packages
* [JavaScript ](https://github.com/lorien/web-scraping/blob/master/javascript.md ) - JavaScript packages
* [Go ](https://github.com/lorien/web-scraping/blob/master/golang.md ) - Go packages
2024-10-15 14:29:22 +02:00
* [Command Line Tools ](https://github.com/lorien/web-scraping/blob/master/cli.md ) - tools with a command line interface
2023-08-07 09:25:07 +02:00
* [Web Scraping Manuals ](https://github.com/lorien/awesome-web-scraping/blob/master/manuals.md ) - list of articles and books teaching web scraping
2023-10-30 13:46:28 +02:00
* [dhamaniasad / HeadlessBrowsers ](https://github.com/dhamaniasad/HeadlessBrowsers ) - list of (almost) all headless web browsers in existence
2020-06-28 12:05:07 +02:00
* [DNS over HTTPS providers ](https://github.com/curl/curl/wiki/DNS-over-HTTPS ) - list of DNS over HTTPs providers
2024-06-08 21:07:44 +02:00
* [Awesome Pastebins ](https://github.com/lorien/awesome-pastebins ) - list of pastebin sites
2019-11-23 22:17:11 +02:00
2020-05-12 23:41:45 +02:00
## Captcha Solving Services
2023-10-30 13:47:11 +02:00
* [https://2captcha.com ](https://2captcha.com/?from=3019071 )
2020-05-12 23:41:45 +02:00
2022-12-31 15:53:12 +02:00
## Proxy Server Marketplaces
2015-08-12 22:49:22 +02:00
2022-12-31 15:53:12 +02:00
* https://www.blackhatworld.com/forums/proxies-for-sale.112/
* https://forum.antichat.com/forums/147/
2023-10-30 15:19:22 +02:00
## Telegram Discussion Groups
2024-06-08 15:11:55 +02:00
* [@grablab ](https://t.me/grablab ) - talks in English
* [@grablab_ru ](https://t.me/grablab_ru ) - talks in Russian
2023-10-30 15:19:22 +02:00
2023-10-30 13:50:38 +02:00
## How to Contribute to This List
2022-12-31 15:53:12 +02:00
See [Contributing ](https://github.com/lorien/web-scraping/blob/master/CONTRIBUTING.md ) guide.
2020-09-11 19:40:30 +02:00
## Credits
2019-11-22 03:34:56 +02:00
2019-11-23 22:15:31 +02:00
The list is based initially on some data from these sources [awesome-python ](https://github.com/vinta/awesome-python ), [awesome-php ](https://github.com/ziadoz/awesome-php ), [awesome-ruby ](https://github.com/markets/awesome-ruby ), [ruby-nlp ](https://github.com/diasks2/ruby-nlp ), [awesome-javascript ](https://github.com/sorrycc/awesome-javascript )