Merge pull request #126 from ray-102/master

Added Extractnet an ML based crawler in Python
2025-02-21 19:06:39 +02:00 · 2021-07-24 21:29:06 +03:00 · 2021-07-24 21:29:06 +03:00 · 44e7083e3a
commit 44e7083e3a
parent 06162ffbb6 d36d03b647
1 changed files with 1 additions and 0 deletions
--- a/python.md
+++ b/python.md
@ -385,6 +385,7 @@ Libraries for extracting web contents.
 * [trafilatura](https://github.com/adbar/trafilatura) - Fast extraction of main text and comments along with structure, conversion to TXT, CSV & XML.
 * [advertools](https://github.com/eliasdabbas/advertools) - A customizable crawler to analyze SEO and content of pages and websites.
 * [photon](https://github.com/s0md3v/Photon) - Incredibly fast crawler designed for OSINT
+* [extractnet](https://github.com/currentsapi/extractnet) - Machine Learning based content and metadata extraction in Python 3

 ## WebSocket