1
0
mirror of https://github.com/MontFerret/ferret.git synced 2025-08-15 20:02:56 +02:00
Go to file
Tim Voronov 38625ad059 Refactor collectors and sorters; introduce Transformers
Refactor `Collector` into `Transformer` for enhanced flexibility and modularity. Introduce `Sorter` and `MultiSorter` as specialized transformers to handle sorting operations. Streamline VM operations by replacing dataset-based methods with transformer logic. Add encoding/decoding utilities for multiple sorting directions. Optimize `Emit` logic and update related tests.
2025-06-06 16:34:14 -04:00
2023-03-28 13:35:56 -04:00
2021-10-16 17:24:54 -04:00
2025-05-06 11:48:10 -04:00
2025-05-05 16:29:56 -04:00
2018-11-05 11:36:55 -05:00
2018-09-18 21:47:54 -04:00
2019-10-15 08:11:15 +02:00
2022-02-10 19:08:45 -05:00
2025-05-05 16:29:56 -04:00
2023-11-20 11:36:05 -05:00
2023-11-20 11:36:05 -05:00
2019-07-09 14:43:33 -04:00
2023-03-28 13:35:56 -04:00
2021-05-04 15:01:27 -04:00
2021-04-22 15:13:27 -04:00
2019-09-07 01:59:32 -04:00

Ferret

Go Report Status Build Status Discord Chat Discord Chat Ferret release Apache-2.0 License

ferret

Try it! Docs CLI Test runner Web worker

What is it?

ferret is a web scraping system. It aims to simplify data extraction from the web for UI testing, machine learning, analytics and more.
ferret allows users to focus on the data. It abstracts away the technical details and complexity of underlying technologies using its own declarative language. It is extremely portable, extensible, and fast.

Read the introductory blog post about Ferret here!

Features

  • Declarative language
  • Support of both static and dynamic web pages
  • Embeddable
  • Extensible

Documentation is available at our website.

Different languages

  • Ferret for python. Pyfer
Languages
Go 83.9%
HTML 14.2%
JavaScript 1.4%
ANTLR 0.4%