1
0
mirror of https://github.com/lorien/awesome-web-scraping.git synced 2024-11-28 08:48:58 +02:00
awesome-web-scraping/java.md
2017-10-19 13:27:33 -04:00

3.0 KiB

Java Web Scraping

This list contains Java libraries related to web scraping and data processing

Network

Web-Scraping Frameworks

HTML/XML Parsing

Text Processing

Libraries for parsing and manipulating plain texts.

Specific Formats Processing

Libraries for parsing and manipulating specific text formats.

Natural Language Processing

Libraries for working with human languages.

Browser automation and emulation

Multiprocessing

  • TODO

Asynchronous

Libraries for asynchronous networking programming.

  • TODO

Queue

  • TODO

Email

Libraries for parsing email.

  • TODO

URL and Network Address Manipulation

Libraries for parsing/modifying URLs and network addresses.

  • URL
    • TODO
  • Network Address
    • TODO

Web Content Extracting

Libraries for extracting web contents.

WebSocket

Libraries for working with WebSocket.

  • TODO

DNS Resolving

Computer Vision

  • TODO

Proxy Server

  • TODO

Other FooLanguage lists

  • TODO