ripgrep

mirror of https://github.com/BurntSushi/ripgrep.git synced 2025-08-04 21:52:54 +02:00

Author	SHA1	Message	Date
Colin Heffernan	2d763a9a1b	ignore/types: add `*.svelte.ts` to Svelte file type glob I was somewhat unsure about adding this, since `.svelte.ts` seems primarily like a TypeScript file and it could be surprising to show up in a search for Svelte files. In particular, ripgrep doesn't know how to only search the Svelte stuff inside of a `.svelte.ts` file, so you could end up with lots of false positives. However, I was swayed[1] by the argument that the extension does actually include `svelte` in it, so maybe this is fine. Please open an issue if this change ends up being too annoying for most users. Closes #2874, Closes #2909 [1]: https://github.com/BurntSushi/ripgrep/issues/2874#issuecomment-3126892931	2025-07-28 07:58:09 -04:00
Andrew Gallant	afc820c9e9	cli: make `rg -vf file` behave sensibly Previously, when `file` is empty (literally empty, as in, zero byte), `rg -f file` and `rg -vf file` would behave identically. This is odd and also doesn't match how GNU grep behaves. It's also not logically correct. An empty file means _zero_ patterns which is an empty set. An empty set matches nothing. Inverting the empty set should result in matching everything. This was because of an errant optimization that lets ripgrep quit early if it can statically detect that no matches are possible. Moreover, there was also a bug in how we constructed the PCRE2 pattern when there are zero patterns. PCRE2 doesn't have a concept of sets of patterns (unlike the `regex` crate), so we need to fake it with an empty character class. Fixes #1332, Fixes #3001, Closes #3041	2025-07-27 15:09:19 -04:00
squidfunk	fcfe98fe58	globset: compact Debug impl for `GlobSetBuilder` and `Glob` Ideally we'd have a compact impl for `GlobSet` too, but that's a lot more work. In particular, the constituent types don't all store the original pattern string, so that would need to be added. Closes #3026	2025-07-27 10:28:55 -04:00
Zach Ahn	0434b5034d	ignore/types: add `*.rake` extension to list of Ruby file types This PR adds the .rake extension to the Ruby type. It's a pretty common file extension in Rails apps—in my experience, the Rakefile is often pretty empty and only sets some stuff up while most of the code lives in various .rake files. See: https://ruby.github.io/rake/doc/rakefile_rdoc.html#label-Multiple+Rake+Files Closes #2921	2025-07-26 11:55:30 -04:00
f3rn0s	b004eda8c8	ignore/types: add typst Closes #2914	2025-07-26 11:54:01 -04:00
Hamir Mahal	f1b4b182f2	style: simplify string formatting Most of this code was written before this was supported by Rust. Closes #2912	2025-07-26 11:52:49 -04:00
Thayne McCombs	e169881a36	globset: add matches_all method This returns true if all globs in the set match the supplied file. Fixes #2869, Closes #2900	2025-07-26 11:44:51 -04:00
Aleksey Vasilenko	83a4af7cb8	deps: switch to tikv-jemallocator It is now a recommended crate for jemalloc and it contains an [important fix for compilation on riscv64gc-unknown-linux-musl][fix], I bumped into this when I was trying to [build ripgrep on OpenWrt][openwrt]. Closes #2889 [fix]: https://github.com/tikv/jemallocator/pull/67 [openwrt]: https://github.com/openwrt/packages/pull/24961	2025-07-26 11:41:17 -04:00
Stephan Badragan	cf91d6e67a	printer: support `-r/--replace` with `--json` This adds a `replacement` field to each submatch object in the JSON output. In effect, this extends the `-r/--replace` flag so that it works with `--json`. This adds a new field instead of replacing the match text (which is how the standard printer works) for maximum flexibility. This way, consumers of the JSON output can access the original match text (and always rely on it corresponding to the original match text) while also getting the replacement text without needing to do the replacement themselves. Closes #1872, Closes #2883	2025-07-26 11:35:08 -04:00
Melvin Wang	0904f55d3e	ignore/types: include msbuild solution filters Closes #2871	2025-07-26 10:50:59 -04:00
Lucas Trzesniewski	8ead46a3e5	printer: use std::path::absolute on Windows This specifically avoids touching the file system, which can lead to fairly dramatic speed-ups in large repositories with lots of matches. Closes #2865	2025-07-26 10:49:52 -04:00
Alex Povel	d9744f3b03	ignore: improve multithreading heuristic This copies the one found in ripgrep. See also: `71d71d2d98/crates/core/flags/hiargs.rs (L172)` Closes #2854, Closes #2856	2025-07-26 10:42:29 -04:00
Thomas Otto	a6275648b3	ignore: don't process command line arguments in reverse order When searching in parallel with many more arguments than threads, the first arguments are searched last -- unlike in the -j1 case. This is unexpected for users who know about the parallel nature of rg and think they can give the scheduler a hint by positioning larger input files (L1, L2, ..) before smaller ones (█, ██). Instead, this can result in sub-optimal thread usage and thus longer runtime (simplified example with 2 threads): T1: █ ██ █ █ █ █ ██ █ █ █ █ █ ██ ╠═════════════L1════════════╣ T2: █ █ ██ █ █ ██ █ █ █ ██ █ █ ╠═════L2════╣ ┏━━━━┳━━━━┳━━━━┳━━━━┓ This is caused by assigning work to ┃ T1 ┃ T2 ┃ T3 ┃ T4 ┃ per-thread stacks in a round-robin ┡━━━━╇━━━━╇━━━━╇━━━━┩ manner, starting here → │ L1 │ L2 │ L3 │ L4 │ ↵ ├────├────┼────┼────┤ │ s5 │ s6 │ s7 │ s8 │ ↵ ├────┼────┼────┼────┤ ╷ .. ╷ .. ╷ .. ╷ .. ╷ ├────┼────┼────┼────┤ │ st │ su │ sv │ sw │ ↵ ├────┼────┼────┼────┘ │ sx │ sy │ sz │ └────┴────┴────┘ and then processing them bottom-up: ↥ ↥ ↥ ↥ ╷ .. ╷ .. ╷ .. ╷ .. ╷ This patch reverses the input order ├────┼────┼────┼────┤ so the two reversals cancel each other │ s7 │ s6 │ s5 │ L4 │ ↵ out. Now at least the first N ├────┼────┼────┼────┘ arguments, N=number-of-threads, are │ L3 │ L2 │ L1 │ processed before any others (then └────┴────┴────┘ work-stealing may happen): T1: ╠═════════════L1════════════╣ █ ██ █ █ █ █ █ █ ██ T2: ╠═════L2════╣ █ █ ██ █ █ ██ █ █ █ ██ █ █ ██ █ █ █ (With some more shuffling T1 could always be assigned L1 etc., but that would mostly be for optics). Closes #2849	2025-07-26 10:42:29 -04:00
Christoph Badura	7fc48961ed	ignore/types: add Makefile.* The BSD build systems make use of "Makefile.inc" a lot. Make the "make" type recognize this file by default. And more generally, `Makefile.` seems to be a convention, so just generalize it. Closes #2846	2025-07-26 10:42:28 -04:00
Matt Kulukundis	bd8a7ae793	ignore: support `.jj` as well as `.git` This makes it so the presence of `.jj` will cause ripgrep to treat it as a VCS directory, just as if `.git` were present. This is useful for ripgrep's default behavior when working with jj repositories that don't have a `.git` but do have `.gitignore`. Namely, ripgrep requires the presence of a VCS repository in order to respect `.gitignore`. We don't handle clone-specific exclude rules for jj repositories without `.git` though. It seems it isn't 100% set yet where we can find those[1]. Closes #2842 [1]: https://github.com/BurntSushi/ripgrep/pull/2842#discussion_r2020076722	2025-07-26 10:42:28 -04:00
Tor Shepherd	ff8afcf8aa	color: add italic to style attributes Closes #2841	2025-07-26 10:42:28 -04:00
robert-bryson	aebab44e3e	core: add "total" to --stats output This makes it a little clearer. Apologies to anyone who is regex matching on this output. Closes #2797	2025-07-26 10:42:28 -04:00
Stephen Albert-Moore	ca88b2fd95	ignore/gitignore: skip BOM at start of ignore file This matches Git's behavior. Fixes #2177, Closes #2782	2025-07-26 10:42:28 -04:00
Riccardo Attilio Galli	57e90533a0	searcher: add more tests for `replace_bytes` ... and add a comment explaining an optimization. Closes #2729	2025-07-26 10:42:28 -04:00
Keith Smiley	fe07bd7669	ignore/types: detect `WORKSPACE.bzlmod` for bazel file type This file came alongside MODULE.bazel and I should have added it here previously. Closes #2726	2025-07-26 10:42:28 -04:00
William Johnson	95979048c9	globset: add opt-in `Arbitrary` trait implementations This feature is mandatory when using `Glob` in fuzz testing. Closes #2720	2025-07-26 10:42:28 -04:00
ChristopherYoung	c2f1653ddd	ignore: fix filtering searching subdir or .ignore in parent dir The previous code deleted too many parts of the path when constructing the absolute path, resulting in a shortened final path. This patch creates the correct absolute path by only removing the necessary parts. Fixes #829, Fixes #2731, Fixes #2747, Fixes #2778, Fixes #2836, Fixes #2933 Closes #2933	2025-07-26 10:42:27 -04:00
Jan Verbeek	78803979c5	complete/fish: Take RIPGREP_CONFIG_PATH into account The fish completions now also pay attention to the configuration file to determine whether to suggest negation options and not just to the current command line. This doesn't cover all edge cases. For example the config file is cached, and so changes may not take effect until the next shell session. But the cases it doesn't cover are hopefully very rare. Closes #2708	2025-07-26 10:42:27 -04:00
wang384670111	85a86eba2b	impl: switch most atomic ops to `Relaxed` ordering These all seem pretty straight-forward. Compared with #2706, I dropped the changes to the atomic orderings used in `ignore` because I haven't had time to think through that carefully. But the ops in this PR seem fine. Closes #2706	2025-07-26 10:42:27 -04:00
dependabot[bot]	6dfaec03e8	deps: bump crossbeam-channel from 0.5.13 to 0.5.15 Bumps [crossbeam-channel](https://github.com/crossbeam-rs/crossbeam) from 0.5.13 to 0.5.15. - [Release notes](https://github.com/crossbeam-rs/crossbeam/releases) - [Changelog](https://github.com/crossbeam-rs/crossbeam/blob/master/CHANGELOG.md) - [Commits](https://github.com/crossbeam-rs/crossbeam/compare/crossbeam-channel-0.5.13...crossbeam-channel-0.5.15) --- updated-dependencies: - dependency-name: crossbeam-channel dependency-version: 0.5.15 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-04-10 10:55:32 -04:00
Pierre Rouleau	5fbc4fee64	ignore/types: fix Seed7 file extension PR #3023	2025-04-07 10:53:32 -04:00
Pierre Rouleau	004370bd16	ignore/types: add support for Seed7 files For more info on the Seed7 programming Language see: - on Wikipedia: https://en.wikipedia.org/wiki/Seed7 - Seed7 home: https://seed7.sourceforge.net/ - Seed7 repo: https://github.com/ThomasMertes/seed7 PR #3022	2025-04-07 08:51:22 -04:00
Andrew Gallant	de4baa1002	globset-0.4.16	2025-02-27 12:46:58 -05:00
Andrew Gallant	163ac157d3	globset: escape `{` and `}` in `escape` This appears to be an oversight from when `escape` was implemented in #2061.	2025-02-27 12:46:48 -05:00
Andrew Gallant	e2362d4d51	searcher: add log message noting detected encoding This helps improve diagnostics. Otherwise it can be easy to miss that ripgrep is doing transcoding. Fixes #2979	2025-01-25 14:27:00 -05:00
Max Coplan	94305125ef	zsh: support sourcing zsh completion dynamically Previously, you needed to save the completion script to a file and then source it. Now, you can dynamically source completions in zsh by running $ source <(rg --generate complete-zsh) Before this commit, you would get an error after step 1. After this commit, it should work as expected. We also improve the FAQ item for zsh completions. Fixes #2956	2024-12-31 08:23:13 -05:00
Andrew Gallant	79cbe89deb	doc: tweak wording for stdin detection This makes it slightly more precise to cover weird cases like trying to pass a directory on stdin. Closes #2906	2024-09-30 07:38:05 -04:00
Thayne McCombs	bf63fe8f25	regex: add as_match method to Captures trait Ref https://github.com/rust-lang/regex/issues/1146 PR #2898	2024-09-19 09:30:31 -04:00
Andrew Gallant	a1960877cf	grep-0.3.2	2024-09-08 22:11:00 -04:00
Andrew Gallant	bb0925af91	deps: bump grep-printer to 0.2.2	2024-09-08 22:10:49 -04:00
Andrew Gallant	be117dbafa	grep-printer-0.2.2	2024-09-08 22:10:29 -04:00
Andrew Gallant	06dc13ad2d	deps: bump grep-searcher to 0.1.14	2024-09-08 22:09:55 -04:00
Andrew Gallant	c6c2e69b8f	grep-searcher-0.1.14	2024-09-08 22:09:27 -04:00
Andrew Gallant	e67c868ddd	deps: bump grep-pcre2 to 0.1.8	2024-09-08 22:09:23 -04:00
Andrew Gallant	d33f2e2f70	grep-pcre2-0.1.8	2024-09-08 22:08:41 -04:00
Andrew Gallant	082edafffa	deps: bump grep-regex to 0.1.13	2024-09-08 22:08:22 -04:00
Andrew Gallant	7c8dc332b3	grep-regex-0.1.13	2024-09-08 22:07:52 -04:00
Andrew Gallant	ea961915b5	deps: bump grep-cli to 0.1.11	2024-09-08 22:07:30 -04:00
Andrew Gallant	7943bdfe82	grep-cli-0.1.11	2024-09-08 22:06:59 -04:00
Andrew Gallant	ac02f54c89	ignore-0.4.23	2024-09-08 22:06:03 -04:00
Andrew Gallant	24b337b940	deps: bump globset to 0.4.15	2024-09-08 22:05:45 -04:00
Andrew Gallant	a5083f99ce	globset-0.4.15	2024-09-08 22:04:48 -04:00
Andrew Gallant	f89cdba5df	doc: update date in man page template	2024-09-08 22:04:11 -04:00
Andrew Gallant	9d738ad0c0	regex: fix inner literal extraction that resulted in false negatives In some rare cases, it was possible for ripgrep's inner literal detector to extract a set of literals that could produce a false negative. #2884 gives an example: `(?i:e.x\|ex)`. In this case, the set extracted can be discovered by running `rg '(?i:e.x\|ex) --trace`: Seq[E("EX"), E("Ex"), E("eX"), E("ex")] This extraction leads to building a multi-substring matcher for `EX`, `Ex`, `eX` and `ex`. Searching the haystack `e-x` produces no match, and thus, ripgrep shows no matches. But the regex `(?i:e.x\|ex)` matches `e-x`. The issue at play here was that when two extracted literal sequences were unioned, we were correctly unioning their "prefix" attribute. And this in turn leads to those literal sequences being combined incorrectly via cross product. This case in particular triggers it because two different optimizations combine to produce an incorrect result. Firslty, the regex has a common prefix extracted and is rewritten as `(?i:e(?:.x\|x))`. Secondly, the `x` in the first branch of the alternation has its `prefix` attribute set to `false` (correctly), which means it can't be cross producted with another concatenation. But in this case, it is unioned with the `x` from the second branch, and this results in the union result having `prefix` set to `true`. This in turn pops up and lets it get cross producted with the `e` prefix, producing an incorrect literal sequence. We fix this by changing the implementation of `union` to return `prefix` set to `true` only when both literal sequences being unioned have `prefix` set to `true`. Doing this exposed a second bug that was present, but was purely cosmetic: the extracted literals in this case, after the fix, are `X` and `x`. They were considered "exact" (i.e., lead to a match), but of course they are not. Observing an `X` or an `x` does not mean there is a match. This was fixed by making `choose` always return an inexact literal sequence. This is perhaps too conservative in aggregate in some cases, but always correct. The idea here is that if one is choosing between two concatenations, then it is likely the case that the sequence returned should be considered inexact. The issue is that this can lead to avoiding cross products in some cases that would otherwise be correct. This is bad because it means extracting shorter literals in some cases. (In general, the longer the literal the better.) But we prioritize correctness for now and fix it. You can see a few tests where this shortens some extracted literals. Fixes #2884	2024-09-08 22:00:46 -04:00
Cort Spellman	af8c386d5e	doc: fix typo in --heading flag help PR #2864	2024-08-02 17:32:42 -04:00

1 2 3 4 5 ...

447 Commits