1
0
mirror of https://github.com/lorien/awesome-web-scraping.git synced 2024-11-28 08:48:58 +02:00
awesome-web-scraping/console_tools.md
2015-08-15 01:15:19 +05:00

1.3 KiB

Console Tools

Console tools related to web scraping and data processing

HTTP Clients

  • curl - supports SSL certificates, HTTP POST, HTTP PUT, FTP uploading, HTTP form based upload, proxies, HTTP/2, cookies, user+password authentication (Basic, Plain, Digest, CRAM-MD5, NTLM, Negotiate and Kerberos), file transfer resume, proxy tunneling and more.
  • httpie - sending arbitrary HTTP requests using a simple and natural syntax, and displays colorized output. HTTPie can be used for testing, debugging, and generally interacting with HTTP servers.
  • wget - package for retrieving files using HTTP, HTTPS and FTP, the most widely-used Internet protocols. It is a non-interactive commandline tool, so it may easily be called from scripts, cron jobs, terminals without X-Windows support, etc.

Specific Formats Processing

  • Office
    • unoconv - Convert between any document format supported by LibreOffice/OpenOffice.
  • CSV
    • csvkit - Utilities for converting to and working with CSV.