1
0
mirror of https://github.com/lorien/awesome-web-scraping.git synced 2025-02-21 19:06:39 +02:00

245 Commits

Author SHA1 Message Date
lorien
98061ab146
Merge pull request #38 from mypolat/patch-2
Fixed Browsers/Selenium URL
2017-12-13 16:40:33 +03:00
lorien
8c8e0fc19a
Merge pull request #39 from kolarski/patch-2
Fix CSS parse categogy
2017-12-13 16:39:49 +03:00
lorien
af2d34e22b
Merge pull request #40 from sp1thas/master
add grequests
2017-12-13 16:38:45 +03:00
lorien
e430a6e3c0
Merge pull request #42 from vladyslavstartsev/patch-2
Added new HTML parsing library
2017-12-13 16:38:16 +03:00
lorien
8c4f8cac7a
Merge pull request #43 from aecio/patch-2
Added Java language
2017-12-13 16:32:44 +03:00
lorien
7f868e691d
Merge pull request #44 from wipeover/master
add colly scraper framework
2017-12-13 16:32:11 +03:00
lorien
e4e2b74de6
Merge pull request #45 from rushter/new-lib
Add selectolax HTML parser
2017-12-06 16:06:11 +03:00
rushter
136e0230cf Add selectolax 2017-12-02 19:48:49 +03:00
wipeover
fa9b0b1ad8
add colly scraper framework 2017-11-15 19:58:47 +01:00
Aécio Santos
cfed3bd430 Added Java language 2017-10-19 13:27:33 -04:00
vladyslavstartsev
2da3297543 Added new HTML parsing library 2017-10-08 01:13:38 +03:00
Simakis Panagiotis
9852c8b184 add grequests 2017-10-01 18:12:38 +03:00
Alex Kolarski
b43667a9b1 Fix CSS parse categogy 2017-09-25 17:05:26 +03:00
lorien
d963bb28e9 Update README.md 2017-09-04 01:23:36 +03:00
Mehmet Yüksel POLAT
8c86fbf913 Fixed Browsers/Selenium URL
The previous link (http://selenium.googlecode.com/git/docs/api/py/api.html) is broken. 
I changed it with a new link (http://selenium-python.readthedocs.io/).
2017-06-06 18:07:36 +03:00
ahivert
c680d1fa0e add chopper tool 2017-06-01 20:00:04 +02:00
Gregory Petukhov
4b260857e2 Merge pull request #36 from faolisilva/patch-2
Adding Learning Scrapy Book to collection
2017-05-29 16:19:41 +03:00
Fabio Oliveira Silva
c2414c4b51 Adding Learning Scrapy Book to collection
Adding Learning Scrapy Book to collection
2017-05-25 15:43:44 -03:00
Gregory Petukhov
f3d17d1c6d Merge pull request #35 from gingerhot/master
add more packages to list by copy from github.com/avelino/awesome-go
2017-05-20 15:18:54 +06:00
B1nj0y
2f88bf3a59 add more packages to list by copy from github.com/avelino/awesome-go 2017-05-16 00:42:40 +08:00
Gregory Petukhov
5f4efe9147 Merge pull request #34 from gingerhot/master
add Golang to the list
2017-05-14 21:58:52 +06:00
Gregory Petukhov
3fc48cb680 Merge pull request #33 from sonicdes/patch-2
Indentation fixes in ruby.md
2017-05-14 21:58:20 +06:00
B1nj0y
56ab7d57e3 add golang.md 2017-05-07 13:34:47 +08:00
Denis Sadomowski
2b588f87ae Indentation fixes in ruby.md
A couple of Markdown indentation fixes.
2017-04-28 18:54:28 +03:00
Gregory Petukhov
3ae4696044 Update README.md 2017-03-21 23:19:25 +07:00
Gregory Petukhov
2f587fc4f3 Update README.md 2017-03-21 23:08:27 +07:00
Gregory Petukhov
6399d6909f Merge pull request #32 from cyriac/cyriac-patch-1
Added hodor to HTML/XML Parsing
2017-01-12 22:45:07 +07:00
Cyriac Thomas
faacd85957 moving to HTML/XML parsing - General section 2017-01-12 21:11:58 +05:30
Cyriac Thomas
b9688c8f79 Added hodor 2017-01-12 20:33:03 +05:30
Gregory Petukhov
70db5d3a00 user_agent lib in python section 2016-12-02 11:18:38 +07:00
Gregory Petukhov
d0cc71166e Merge pull request #31 from pcinkh/fake-useragent
Fake-useragent update.
2016-12-02 11:15:14 +07:00
pcinkh
2930bb3e7e Fake-useragent update. 2016-11-28 09:55:35 +02:00
Gregory Petukhov
cea30665a1 Update proxy_services.md 2016-11-17 22:37:01 +03:00
Gregory Petukhov
d0a979e135 Update proxy_services.md 2016-11-17 22:36:34 +03:00
Gregory Petukhov
5416c3e002 Update proxy_services.md 2016-11-13 15:08:32 +03:00
Gregory Petukhov
bf32bfa940 Update books.md 2016-11-13 00:39:19 +03:00
Gregory Petukhov
a48ad842b6 Merge pull request #28 from FarhadurFahim/patch-2
Update Scrapy
2016-11-03 02:12:53 +03:00
Gregory Petukhov
dd9f7a88a9 Merge pull request #27 from mutewinter/patch-2
fix parenthesis typos in markdown links
2016-11-03 02:12:39 +03:00
Gregory Petukhov
7a6f214ca6 Merge pull request #26 from dalleng/patch-2
Update scrapy entry
2016-11-03 02:12:13 +03:00
Gregory Petukhov
e29fef43c5 Merge pull request #22 from barrycarton/patch-2
Kimono has discontinued it's service
2016-11-03 02:11:56 +03:00
Gregory Petukhov
e2dfb60d1c Merge pull request #21 from olegykz/patch-2
Faraday duplication removed
2016-11-03 02:11:13 +03:00
Gregory Petukhov
3a5861e556 Merge pull request #29 from garrylachman/patch-2
Update proxy_services.md
2016-11-03 02:10:25 +03:00
Garry L
1dfc5db407 Update proxy_services.md 2016-10-29 02:42:11 +03:00
Fahim
e2378eaa66 Update Scrapy
I have removed "Does not support Python3" from Scrapy hints in Web-Scraping Frameworks.
Because latest scrapy 1.2 runs on Python 2.7 and Python 3.3 or above.
2016-10-23 20:22:02 +06:00
Jeremy Mack
04e33b991d fix duplicate bracket 2016-08-17 09:38:04 -04:00
Jeremy Mack
7a020dfa5a fix parenthesis typos in markdown links 2016-08-17 09:36:16 -04:00
Diego Allen
bd61c29aef Update scrapy entry
scrapy does support python 3 currently
2016-08-09 08:40:34 -04:00
Gregory Petukhov
89530093b4 Update python.md 2016-05-21 15:09:41 +06:00
Gregory Petukhov
45569ea7d6 Merge pull request #24 from APIs-guru/master
Add morph.io
2016-04-20 17:02:54 +03:00
Ivan Goncharov
7d47135d6a Add morph.io 2016-04-20 16:41:42 +03:00