mirror of
https://github.com/lorien/awesome-web-scraping.git
synced 2024-12-04 10:24:43 +02:00
2.9 KiB
2.9 KiB
Java Web Scraping
This list contains Java libraries related to web scraping and data processing
- FooLanguage Web Scraping
- Network
- Web-scraping Frameworks
- HTML/XML Parsing
- Text processing
- Specific Formats Processing
- Natural Language Processing
- Browser automation and emulation
- Multiprocessing
- Queue
- URL and Network Address Manipulation
- Web Content Extracting
- Asynchronous
- WebSocket
- DNS Resolving
- Computer Vision
- Proxy Server
- Other FooLanguage Lists
Network
- General
- Asynchronous
Web-Scraping Frameworks
-
Full Featured Crawlers
-
Other
HTML/XML Parsing
Text Processing
Libraries for parsing and manipulating plain texts.
- General
Specific Formats Processing
Libraries for parsing and manipulating specific text formats.
-
General
-
Something
- TODO
Natural Language Processing
Libraries for working with human languages.
Browser automation and emulation
Multiprocessing
- TODO
Asynchronous
Libraries for asynchronous networking programming.
- TODO
Queue
- TODO
Libraries for parsing email.
- TODO
URL and Network Address Manipulation
Libraries for parsing/modifying URLs and network addresses.
- URL
- TODO
- Network Address
- TODO
Web Content Extracting
Libraries for extracting web contents.
- Text and Meta Data from HTML pages
WebSocket
Libraries for working with WebSocket.
- TODO
DNS Resolving
Computer Vision
- TODO
Proxy Server
- TODO
Other FooLanguage lists
- TODO