ripgrep

mirror of https://github.com/BurntSushi/ripgrep.git synced 2025-06-04 05:57:39 +02:00

Author	SHA1	Message	Date
Andrew Gallant	81341702af	regex: push more pattern handling to matcher construction Previously, ripgrep core was responsible for escaping regex patterns and implementing the --line-regexp flag. This commit moves that responsibility down into the matchers such that ripgrep just needs to hand the patterns it gets off to the matcher builder. The builder will then take care of escaping and all that. This was done to make pattern construction completely owned by the matcher builders. With the arrival regex-automata, this means we can move to the HIR very quickly and then never move back to the concrete syntax. We can then build our regex directly from the HIR. This overall can save quite a bit of time, especially when searching for large dictionaries. We still aren't quite as fast as GNU grep when searching something on the scale of /usr/share/dict/words, but we are basically within spitting distance. Prior to this, we were about an order of magnitude slower. This architecture in particular lets us write a pretty simple fast path that avoids AST parsing and HIR translation entirely: the case where one is just searching for a literal. In that case, we can hand construct the HIR directly.	2023-07-05 14:04:29 -04:00
Andrew Gallant	1035f6b1ff	deps: initial migration steps to regex 1.9 This leaves the grep-regex crate in tatters. Pretty much the entire thing needs to be re-worked. The upshot is that it should result in some big simplifications. I hope. The idea here is to drop down and actually use regex-automata 0.3 instead of the regex crate itself.	2023-07-05 14:04:29 -04:00
Andrew Gallant	251376597f	deps: update minimum version of grep crate Ref #2516	2023-05-16 13:13:34 -04:00
Andrew Gallant	d58e9353fc	deps: update to grep 0.2.11	2023-01-05 09:13:47 -05:00
Andrew Gallant	0f61f08eb1	deps: update to ignore 0.4.19	2023-01-05 08:57:05 -05:00
Andrew Gallant	2e207833bc	deps: upgrade to jemallocator 0.5	2023-01-05 08:33:43 -05:00
Andrew Gallant	ac8fecbbf2	deps: upgrade bstr to 1.1	2023-01-05 08:21:15 -05:00
Andrew Gallant	28bff84a0a	deps: remove 'num_cpus' Now that std:🧵:available_parallelism is a thing, we no longer need num_cpus.	2023-01-05 08:15:09 -05:00
Alex Touchet	61101289fa	cargo: set rust-version This should hopefully make compilation errors from using an older-than-supported compiler more helpful. PR #2373	2022-12-21 07:37:09 -05:00
Andrew Gallant	af6b6c543b	13.0.0	2021-06-12 08:12:24 -04:00
Andrew Gallant	c8d8ab8ded	deps/grep: update minimal versions	2021-06-12 08:08:58 -04:00
Andrew Gallant	9efdbf74a1	deps/ignore: update minimal versions	2021-06-12 08:01:13 -04:00
Marco Ieni	0f502a9439	cargo: remove "readme" field It is apparently no longer required since a README.md file is automatically detected: https://doc.rust-lang.org/cargo/reference/manifest.html#the-readme-field Closes #1770	2021-05-31 21:51:18 -04:00
Varik Valefor	beda5f70dc	doc: improve wording This tightens up the wording in ripgrep's opening description. It's used in several places, so we update all of them. Closes #1881	2021-05-31 21:51:18 -04:00
Josh Soref	def993bad1	spelling: fix various misspellings These were found by the check spelling action[1] and reported here[2]. PR #1685 [1] - https://github.com/marketplace/actions/check-spelling [2] - `6f02d05671 (commitcomment-42625778)`	2020-09-22 10:29:16 -04:00
Andrew Gallant	7cb211378a	12.1.1	2020-05-29 09:26:47 -04:00
Andrew Gallant	a2f90747c9	core: update minimal dependency versions	2020-05-29 09:18:59 -04:00
Andrew Gallant	2658bd4e46	12.1.0	2020-05-09 11:13:33 -04:00
Andrew Gallant	72807462e8	deps: update minimal versions for dependencies	2020-05-09 10:39:43 -04:00
Andrew Gallant	1d5b1011e5	12.0.1	2020-03-29 18:59:40 -04:00
Andrew Gallant	655e33219a	crates.io: remove badges ... and don't replace them with anything because crates.io does not support GitHub Actions yet. But it's almost there: https://github.com/rust-lang/crates.io/pull/1838 Thanks @atouchet for noticing this.	2020-03-17 17:50:37 -04:00
Andrew Gallant	92daa34eb3	ripgrep: release 12.0.0	2020-03-15 21:42:54 -04:00
Andrew Gallant	52ec68799c	ci: make script names consistent	2020-03-15 21:06:45 -04:00
Andrew Gallant	fab5c812f3	tests: add debugging output The transient failures appear to be persisting and they are quite difficult to debug. So include a full directory listing in the output of every test failure.	2020-02-20 16:07:51 -05:00
Andrew Gallant	0874aa115c	repo: make ripgrep build with the new organization	2020-02-17 19:24:53 -05:00
Andrew Gallant	01eeec56bb	deb: fix fish completion install location It looks like `completions` is owned by Fish itself. Third party completions should go in `vendor_completions.d`. Fixes #1485	2020-02-17 17:16:28 -05:00
Andrew Gallant	e402d6c260	ripgrep: release 11.0.2	2019-08-01 18:02:15 -04:00
Andrew Gallant	b93762ea7a	bstr: update everything to bstr 0.2	2019-06-26 16:47:33 -04:00
Andrew Gallant	5ce2d7351d	ci: use cross for musl x86_64 builds This is necessary because jemalloc + musl + Ubuntu 16.04 is apparently broken. Moreover, jemalloc doesn't support i686, so we accept the performance regression there. See also: https://github.com/gnzlbg/jemallocator/issues/124	2019-04-25 11:12:14 -04:00
Andrew Gallant	03bf37ff4a	alloc: use jemalloc when building with musl It turns out that musl's allocator is slow enough to cause a fairly noticeable performance regression when ripgrep is built as a static binary with musl. We fix this by using jemalloc when building with musl. We continue to use the default system allocator in all other scenarios. Namely, glibc's allocator doesn't noticeably regress performance compared to jemalloc. But we could add more targets to this logic if other system allocators (macOS, Windows) prove to be slow. This wasn't necessary before because rustc recently stopped using jemalloc by default. Fixes #1268	2019-04-24 17:21:38 -04:00
Andrew Gallant	5f8805a496	ripgrep: release 11.0.1	2019-04-16 13:10:29 -04:00
Andrew Gallant	d7f57d9aab	ripgrep: release 11.0.0	2019-04-15 18:09:40 -04:00
Andrew Gallant	1a2a24ea74	grep: release 0.2.4	2019-04-15 18:03:46 -04:00
Andrew Gallant	8e8215aa65	ignore: release 0.4.7	2019-04-15 17:50:37 -04:00
Andrew Gallant	9b8f5cbaba	config: switch to using bstrs This lets us implement correct Unicode trimming and also simplifies the parsing logic a bit. This also removes the last platform specific bits of code in ripgrep core.	2019-04-05 23:24:08 -04:00
Andrew Gallant	7a6a40bae1	edition: move core ripgrep to Rust 2018	2019-01-19 10:44:30 -05:00
Andrew Gallant	968491f8e9	deps: update to bytecount 0.5 bytecount now uses runtime dispatch for enabling SIMD, which means we can no longer need the avx-accel features. We remove it from ripgrep since the next release will be a minor version bump, but leave them as no-ops for the crates that previously used it.	2019-01-19 10:44:30 -05:00
ykgmfq	184ee4c328	deb: add section info Put it in the same section as https://packages.debian.org/stretch/grep PR #1051	2018-09-13 08:17:24 -04:00
Andrew Gallant	eb18da0450	pcre2: use jit_if_available This will allow PCRE2 to fall back to non-JIT matching when running on platforms without JIT support. ref https://github.com/BurntSushi/rust-pcre2/issues/3	2018-09-08 17:12:14 -04:00
Andrew Gallant	b7a456ae83	deb: add completions This commit adds Bash, zsh and fish completions to the Debian binary package. Fixes #1032	2018-09-07 14:00:22 -04:00
Andrew Gallant	d14f0b37d6	deps: update versions for all crates I don't think every change here is needed, but this ensures we're using the latest version of every direct dependency.	2018-09-07 14:00:22 -04:00
Andrew Gallant	003c3695f4	deps: update grep version	2018-09-04 23:29:05 -04:00
Andrew Gallant	4846d63539	grep-cli: introduce new grep-cli crate This commit moves a lot of "utility" code from ripgrep core into grep-cli. Any one of these things might not be worth creating a new crate, but combining everything together results in a fair number of a convenience routines that make up a decent sized crate. There is potentially more we could move into the crate, but much of what remains in ripgrep core is almost entirely dealing with the number of flags we support. In the course of doing moving things to the grep-cli crate, we clean up a lot of gunk and improve failure modes in a number of cases. In particular, we've fixed a bug where other processes could deadlock if they write too much to stderr. Fixes #990	2018-09-04 23:18:55 -04:00
Andrew Gallant	05a0389555	ripgrep: use winapi-util for stdin_is_readable	2018-08-25 00:30:15 -04:00
Andrew Gallant	033ad2b8e4	deps: update clap Update clap to the latest version. Also, drop the ansi_term dependency by disabling color output in clap's error messages.	2018-08-21 23:10:34 -04:00
Andrew Gallant	5c80e4adb6	release: better support for binary Debian package This commit beefs up the package metadata used by the 'cargo deb' tool to produce a binary dpkg. In particular, we now include ripgrep's man page. This commit includes a new script, 'ci/build_deb.sh', which will handle the build process for a dpkg, which has become a bit more nuanced than just running 'cargo deb'. We don't (yet) run this script in CI. Fixes #842	2018-08-21 23:05:52 -04:00
Andrew Gallant	eb184d7711	tests: re-tool integration tests This basically rewrites every integration test. We reduce the amount of magic involved here in terms of which arguments are being passed to ripgrep processes. To make up for the boiler plate saved by the magic, we make the Dir (formerly WorkDir) type a bit nicer to use, along with a new TestCommand that wraps a std::process::Command. In exchange, we get tests that are easier to read and write. We also run every test with the `--pcre2` flag to make sure that works, when PCRE2 is available.	2018-08-20 07:10:19 -04:00
Andrew Gallant	bb110c1ebe	ripgrep: migrate to libripgrep This commit does the work to delete the old `grep` crate and effectively rewrite most of ripgrep core to use the new libripgrep crates. The new `grep` crate is now a facade that collects the various crates that make up libripgrep. The most complex part of ripgrep core is now arguably the translation between command line parameters and the library options, which is ultimately where we want to be.	2018-08-20 07:10:19 -04:00
Andrew Gallant	d9ca529356	libripgrep: initial commit introducing libripgrep libripgrep is not any one library, but rather, a collection of libraries that roughly separate the following key distinct phases in a grep implementation: 1. Pattern matching (e.g., by a regex engine). 2. Searching a file using a pattern matcher. 3. Printing results. Ultimately, both (1) and (3) are defined by de-coupled interfaces, of which there may be multiple implementations. Namely, (1) is satisfied by the `Matcher` trait in the `grep-matcher` crate and (3) is satisfied by the `Sink` trait in the `grep2` crate. The searcher (2) ties everything together and finds results using a matcher and reports those results using a `Sink` implementation. Closes #162	2018-08-20 07:10:19 -04:00
llogiq	ad9befbc1d	deps: update bytecount to 0.3.2 PR #1003	2018-08-06 06:44:16 -04:00

1 2 3 4

196 Commits