ripgrep

mirror of https://github.com/BurntSushi/ripgrep.git synced 2025-11-23 21:54:45 +02:00

Files

Andrew Gallant 6cdb99ea61 deps: drop bytecount in favor of memchr_iter(..).count()

As of the memchr 2.6 release, its Iterator::count method is specialized
to only count the number of occurrences instead of finding the offset of
each occurrence. This replaces ripgrep's use of the bytecount crate.
While micro-benchmarks suggest that memchr's method has better
throughput than bytecount, it turned out to be an illusion. Namely, on a
~13GB haystack prior to this change:

    $ time rg-bytecount 'You killed my friend, my best friend, my lifelong friend!' OpenSubtitles2018.raw.en --line-number
    441450441:- You killed my friend, my best friend, my lifelong friend!

    real    1.473
    user    1.186
    sys     0.286
    maxmem  12512 MB
    faults  0

And then after:

    $ time rg 'You killed my friend, my best friend, my lifelong friend!' OpenSubtitles2018.raw.en --line-number
    441450441:- You killed my friend, my best friend, my lifelong friend!

    real    1.532
    user    1.280
    sys     0.250
    maxmem  12512 MB
    faults  0

But perf is just about in the same ballpark. That's good enough for me
at the moment in order to drop the extra dependency.

I did this because the marginal cost of adding the Iterator::count()
specialization to memchr was extremely small.

2023-09-02 12:25:34 -04:00

examples

edition: run 'cargo fix --edition --edition-idioms --all'

2021-06-01 21:07:37 -04:00

src

deps: drop bytecount in favor of memchr_iter(..).count()

2023-09-02 12:25:34 -04:00

Cargo.toml

deps: drop bytecount in favor of memchr_iter(..).count()

2023-09-02 12:25:34 -04:00

LICENSE-MIT

repo: move all source code in crates directory

2020-02-17 19:24:53 -05:00

README.md

edition: manual changes

2021-06-01 21:07:37 -04:00

UNLICENSE

repo: move all source code in crates directory

2020-02-17 19:24:53 -05:00

README.md

grep-searcher

A high level library for executing fast line oriented searches. This handles things like reporting contextual lines, counting lines, inverting a search, detecting binary data, automatic UTF-16 transcoding and deciding whether or not to use memory maps.

Dual-licensed under MIT or the UNLICENSE.

Documentation

https://docs.rs/grep-searcher

NOTE: You probably don't want to use this crate directly. Instead, you should prefer the facade defined in the grep crate.

Usage

Add this to your Cargo.toml:

[dependencies]
grep-searcher = "0.1"