mirror of
https://github.com/vimagick/dockerfiles.git
synced 2024-12-25 02:04:06 +02:00
.. | ||
Dockerfile | ||
Dockerfile.old | ||
README.md |
WHAT-IS
Scrapy
: An open source and collaborative framework for extracting the data
you need from websites. In a fast, simple, yet extensible way.
This image is based on debian:jessie
, and it only takes 278.6 MB.
You can create a scrapy (v0.24.6) project on top of this image.
HOW-TO
$ docker run --name scrapy -it vimagick/scrapy
>>> scrapy startproject demo
>>> cd demo
>>> scrapy genspider example example.com
>>> scrapy edit example
>>> scrapy crawl example
TODO-LIST
- build libxml2/libxslt from source
- add scrapy_bash_completion script