From 9a37c1adfbe520a9699af20d12e18d868127b4a8 Mon Sep 17 00:00:00 2001 From: Prayson Wilfred Daniel Date: Wed, 9 Sep 2020 12:09:32 +0200 Subject: [PATCH] Added Advertools advertools is added in Web Content Extraction --- python.md | 1 + 1 file changed, 1 insertion(+) diff --git a/python.md b/python.md index b2f2155..e66a42a 100644 --- a/python.md +++ b/python.md @@ -370,6 +370,7 @@ Libraries for extracting web contents. * [linkchecker](https://github.com/wummel/linkchecker) - check links in web documents or full websites * [python-sitemap](https://github.com/c4software/python-sitemap) - Mini website crawler to make sitemap from a website. * [trafilatura](https://github.com/adbar/trafilatura) - Fast extraction of main text and comments along with structure, conversion to TXT, CSV & XML. +* [advertools](https://github.com/eliasdabbas/advertools) - A customizable crawler to analyze SEO and content of pages and websites. ## WebSocket