1
0
mirror of https://github.com/lorien/awesome-web-scraping.git synced 2025-02-21 19:06:39 +02:00

446 Commits

Author SHA1 Message Date
Gregory Petukhov
a16c8e7288
Update README.md 2019-11-23 23:15:31 +03:00
Gregory Petukhov
6f85d2c401
Update README.md 2019-11-23 23:14:38 +03:00
Gregory Petukhov
ff17e5e20f
Update README.md 2019-11-23 23:13:27 +03:00
Gregory Petukhov
68402d69e3
Add dnspython 2019-11-23 00:19:54 +03:00
Gregory Petukhov
14df577fd4
Update httplib2 description and link 2019-11-22 04:54:32 +03:00
Gregory Petukhov
6c66ce8086
Add ioweb to python web scraping frameworks 2019-11-22 04:48:28 +03:00
Gregory Petukhov
82e31e8e8f
Update README.md 2019-11-22 04:40:12 +03:00
Gregory Petukhov
5f3d09533e
Update front page 2019-11-22 04:34:56 +03:00
Gregory Petukhov
b1aed659bf
Add javascript engine bindings to python list 2019-11-22 01:55:05 +03:00
Gregory Petukhov
390d725b49
Add cloudscraper to python page 2019-11-22 01:43:29 +03:00
Gregory Petukhov
122cfd30e1
Add item to Other Lists to Console Tools list 2019-11-12 00:55:55 +03:00
Gregory Petukhov
18c5f9a3ce
Fix pycrumbs link 2019-11-08 13:51:43 +03:00
Gregory Petukhov
314fe87505
Merge pull request #96 from andriyor/add-python-user-agnt
add uap-python to Python User-Agent parser
2019-10-25 16:45:03 +03:00
Gregory Petukhov
4fbf8d0a7b
Fix pull request 2019-10-25 16:44:33 +03:00
Gregory Petukhov
ff40ff0e3f
Merge pull request #95 from andriyor/add-python-site-specific
add site specific scraper to Python
2019-10-25 16:40:43 +03:00
Gregory Petukhov
7d82ce1f0b
Merge branch 'master' into add-python-site-specific 2019-10-25 16:40:21 +03:00
Gregory Petukhov
bfc496cd8d
Minor fix 2019-10-25 16:37:20 +03:00
Gregory Petukhov
499aec1e94
Merge pull request #94 from andriyor/add-python-bookmarks-parser
add bookmarks-parser to Python Structured Formats
2019-10-25 16:33:56 +03:00
Gregory Petukhov
c9e10f2af4
Minor fix 2019-10-25 16:33:45 +03:00
Gregory Petukhov
969de5b449
Merge pull request #93 from andriyor/add-javascript-node-bookmarks-parser
add node-bookmarks-parser to JavaScript Specific Formats Processing
2019-10-25 16:29:47 +03:00
Gregory Petukhov
acf45daba5
Fix indentation 2019-10-25 16:29:35 +03:00
Gregory Petukhov
0a2c17a886
Merge pull request #92 from BurnzZ/add-more-tools
add more tools from Scrapinghub
2019-10-25 16:27:26 +03:00
Gregory Petukhov
49ea262e03
Some refactoring 2019-10-25 16:26:49 +03:00
Gregory Petukhov
dba8dc29a0
Merge pull request #91 from musabgultekin/patch-2
Add geziyor scraper & crawler
2019-10-25 16:16:42 +03:00
Andriy Orehov
dad7df7886 add uap-python to Python User-Agent 2019-10-20 17:25:55 +03:00
Andriy Orehov
42c996d117 add site specific scraper to Python 2019-10-20 15:35:47 +03:00
Andriy Orehov
988713933c add bookmarks-parser to Python Structured Formats 2019-10-20 15:23:33 +03:00
Andriy Orehov
75a60d9dfb add node-bookmarks-parser to JavaScript Parses Firefox/Chrome HTML bookmarks files 2019-10-20 15:11:32 +03:00
Kevin Lloyd Bernal
753f802aa3 add more tools from Scrapinghub 2019-10-19 21:11:35 +08:00
Musab Gültekin
c758e4584c
Add geziyor scraper&crawler
Geziyor is full-featured fast web scraping framework that supports JS rendering.
2019-09-11 18:54:18 +03:00
Gregory Petukhov
5aa61f96db
Merge pull request #86 from zisismaras/master
add Ayakashi javascript framework
2019-07-10 23:10:51 +03:00
Gregory Petukhov
ff29101afa
Add mistletoe to python 2019-07-10 22:41:47 +03:00
Gregory Petukhov
2d1a24681d
Fix TOC in py 2019-07-09 03:20:42 +03:00
Gregory Petukhov
0a41356646
Add ruia 2019-07-09 03:19:59 +03:00
Gregory Petukhov
ba31013628 Fix errors in python TOC 2019-07-09 03:09:18 +03:00
Gregory Petukhov
631f174466 Refactor markup 2019-07-09 03:04:16 +03:00
Gregory Petukhov
7b4fd573b7
Add serialization section. Add ujson. 2019-07-07 17:43:26 +03:00
Gregory Petukhov
76fa0e06b3
Add python tlslite-ng 2019-07-07 17:33:37 +03:00
Gregory Petukhov
996fc16564
Update pyyaml link 2019-07-07 17:32:23 +03:00
Gregory Petukhov
03b46b6f9c
Rename Queue to Job Queue. Add new section Message Queue. Add kombu library. 2019-07-07 17:30:12 +03:00
Gregory Petukhov
d9e46d4566
Add pyOpenSSL 2019-07-07 17:21:21 +03:00
Gregory Petukhov
986ad7bd29
add python dpkt 2019-07-07 17:18:53 +03:00
Gregory Petukhov
234e7bafa9
Change section for pyppeteer 2019-07-07 17:15:41 +03:00
Gregory Petukhov
5db0d85f96
Add python dateutil 2019-07-07 17:12:40 +03:00
Gregory Petukhov
5d96ccbe4a
Add python-whois 2019-07-07 17:08:15 +03:00
Gregory Petukhov
291f215293
Update README.md 2019-06-21 13:17:30 +03:00
Gregory Petukhov
78e1051c18
Add proxychains to console tools 2019-06-21 01:03:26 +03:00
Gregory Petukhov
76285d6e97
Update proxy_services.md 2019-06-13 17:54:06 +03:00
Gregory Petukhov
8531daf9b9
Update proxy_services.md 2019-06-01 19:30:21 +06:00
Zisis Maras
d0a1aca404 add Ayakashi javascript framework 2019-05-24 20:08:25 +03:00