mirror of
https://github.com/MontFerret/ferret.git
synced 2025-08-15 20:02:56 +02:00
38625ad059f9b153d03a522a688fc104d1b351b3
Refactor `Collector` into `Transformer` for enhanced flexibility and modularity. Introduce `Sorter` and `MultiSorter` as specialized transformers to handle sorting operations. Streamline VM operations by replacing dataset-based methods with transformer logic. Add encoding/decoding utilities for multiple sorting directions. Optimize `Emit` logic and update related tests.
Ferret
Try it! Docs CLI Test runner Web worker
What is it?
ferret
is a web scraping system. It aims to simplify data extraction from the web for UI testing, machine learning, analytics and more.
ferret
allows users to focus on the data. It abstracts away the technical details and complexity of underlying technologies using its own declarative language.
It is extremely portable, extensible, and fast.
Read the introductory blog post about Ferret here!
Features
- Declarative language
- Support of both static and dynamic web pages
- Embeddable
- Extensible
Documentation is available at our website.
Different languages
- Ferret for python. Pyfer
Description
Languages
Go
83.9%
HTML
14.2%
JavaScript
1.4%
ANTLR
0.4%