ripgrep

mirror of https://github.com/BurntSushi/ripgrep.git synced 2025-07-11 14:30:24 +02:00

Author	SHA1	Message	Date
Andrew Gallant	3a1780d841	deps: replace memmap with memmap2 memmap is unmaintained at this point and it is being flagged as a RUSTSEC advisory in ripgrep. This doesn't seem like that big of a deal to me honestly, but memmap2 looks like a fine choice at this point. Fixes #1785, Closes #1786	2021-01-17 18:49:51 -05:00
Andrew Gallant	a6d05475fb	ignore-0.4.17	2020-11-23 10:25:33 -05:00
Roey Darwish Dror	020c5453a5	cli: fix stdin detection for Powershell on Unix It seems that PowerShell uses sockets instead of FIFOs to redirect the output between commands. So add `is_socket` to our `is_readable_stdin` check. This seems unlikely to cause problems and it probably more generally correct than what we had before. In theory, it could cause problems if it produces false positives, in which case, ripgrep will try to read stdin when it should search the current working directory. (And this usually winds up manifesting as ripgrep blocking forever.) But, if the stdin handle reports itself as a socket, then it seems like we should read it. Fixes #1741, Closes #1742	2020-11-23 10:23:34 -05:00
Ed Page	873abecbf1	ignore: provide underlying IO Error `ignore::Error` wraps `std::io::Error` with additional information (as well as expose non-IO errors). For people wanting to inspect what the error is, they have to recursively match the Enum. This provides `io_error` and `into_io_error` helpers to do this for the user. PR #1740	2020-11-23 10:19:31 -05:00
James Harr	44e69ba627	ignore/types: add yang file type YANG is described in RFC 6020 https://tools.ietf.org/html/rfc6020 PR #1736	2020-11-20 09:41:29 -05:00
Andrew Gallant	d97fb72d84	doc: update CI links in crate READMEs I switched to GitHub Actions long ago, which replaces both Travis and AppVeyor. Fixes #1732	2020-11-16 19:07:16 -05:00
Andrew Gallant	d6365117e2	doc: sync --help output with man page The man page had the correct usage hints, but the -h/--help output was using an older more incorrect version of the hints. Closes #1730 (again)	2020-11-15 15:27:23 -05:00
Alex Touchet	3ca324fda7	doc: update several links to use https PR #1724	2020-11-03 10:33:36 -05:00
Andrew Gallant	2819212f89	printer: tweak binary detection message format This roughly matches similar changes made in GNU grep recently.	2020-11-02 10:52:51 -05:00
Andrew Gallant	810be0b348	deps: update base64 to 0.13.0	2020-11-02 10:52:51 -05:00
Andrew Gallant	3ef63dacbe	deps: targeted update of some dependencies This updates encoding_rs, crossbeam-utils and crossbeam-channel. This serves two purposes. The encoding_rs update fixes a compilation failure on the latest nightly. The crossbeam updates are good sense and to reduce duplicate dependencies such as cfg-if. (Although, we note that the log crate still pulls in cfg-if 0.1, so ripgrep has a duplicate dependency there for now. But it's very small.) Fixes #1721, Closes #1705	2020-11-02 10:52:51 -05:00
Vanessa McHale	e1ac18ef06	ignore/types: add Futhark See: https://futhark-lang.org/ PR #1720	2020-10-31 12:10:15 -04:00
Brandon Adams	ba3f9673ad	ignore/types: generalize bazel type a bit Bazel supports `BUILD.bazel` as well as `WORKSPACE.bazel`. In addition, it is common to ship BUILD/WORKSPACE templates for external repositories suffixed with .bazel for easier tool recognition. Co-authored-by: Brandon Adams <brandon.adams@imc.com> PR #1716	2020-10-23 12:24:30 -04:00
Andrew Gallant	c777e2cd57	globset-0.4.6	2020-10-21 21:10:43 -04:00
Ajeet D'Souza	e5639cf22d	globset: remove regex unicode dependency Since the translation from a glob to a regex always disables Unicode in the regex, it follows that we shouldn't need regex's Unicode features enabled. Now, ripgrep enables Unicode features in its regex dependency and of course uses them, which will cause globset to have it enabled in the ripgrep build as well. So this doesn't actually change anything for ripgrep. But this does slim thing downs for folks using globset independently of ripgrep. PR #1712	2020-10-19 14:29:05 -04:00
Dương Đỗ Minh Châu	86c843a44b	ignore/types: add a type for minified files Fixes #1710, PR #1711	2020-10-19 09:10:54 -04:00
Andrew Gallant	2b1637d1db	doc: clarify how -S/--smart-case works Whether or not smart case kicks in can be a little subtle in some cases. So we document the specific conditions in which it applies. These conditions were taken directly from the public API docs of the `grep-regex` crate: https://docs.rs/grep-regex/0.1.8/grep_regex/struct.RegexMatcherBuilder.html#method.case_smart Fixes #1708	2020-10-17 18:55:44 -04:00
Andrew Pyatkov	6301e20ee4	ignore/types: add flatbuffers type See: https://google.github.io/flatbuffers/ PR #1707	2020-10-16 20:19:16 -04:00
dana	145cef2eff	doc: elaborate on the function of -u/--unrestricted Fixes #1703	2020-10-16 09:52:42 -04:00
Andy Freeland	fc2a99bb1f	ignore/types: add vcl (#1659 ) VCL is the Varnish Configuration Language used by Varnish and Fastly. https://varnish-cache.org/docs/trunk/users-guide/vcl.html PR #1659	2020-08-19 16:28:14 -04:00
Raimon Grau (rgrau)	ffd4c9ccba	ignore/types: add racket PR #1628	2020-06-25 08:51:32 -04:00
jtrakk	a16bfcb3d6	ignore/types: add dvc This provides support for DVC files (https://dvc.org/). PR #1608	2020-06-09 07:44:09 -04:00
Martin Michlmayr	1b2c1dc675	doc: fix typos PR #1605	2020-06-04 09:06:09 -04:00
Andrew Gallant	f97cc623f7	grep-0.2.7	2020-05-29 09:17:24 -04:00
Andrew Gallant	f35de5c523	grep: update minimal dependency versions	2020-05-29 09:17:08 -04:00
Andrew Gallant	c9bb78ceba	grep-cli-0.1.5	2020-05-29 09:14:18 -04:00
Andrew Gallant	72bdde6771	ignore-0.4.16	2020-05-29 09:13:02 -04:00
Andy Salerno	e8822ce97a	ignore/doc: update misleading documentation This likely originated from a bad copy/paste. PR #1596	2020-05-24 23:12:53 -04:00
Andrew Gallant	a700b75843	doc: clarify capture group indices And in particular, note the special $0 index, which corresponds to the entire match. Fixes #1591	2020-05-21 22:22:51 -04:00
Gerion Entrup	b72ad8f8aa	ignore/types: add meson filetype Closes #1586, PR #1587	2020-05-18 14:01:35 -04:00
Andrew Gallant	1980630f17	doc: fix egregious markup output We use '+++' syntax to output a literal '*' for a '--glob' example. This '+++' syntax is pretty ugly when rendered literally via --help. We fix this by hackily inserting the '+++' syntax for its one specific case that we need it during man page generation. Not ideal but it works. And --help still has some 'foo*' markup, but we live with that for now. Fixes #1581	2020-05-13 08:13:05 -04:00
Andrew Gallant	72807462e8	deps: update minimal versions for dependencies	2020-05-09 10:39:43 -04:00
Andrew Gallant	08dee094dd	grep-0.2.6	2020-05-09 10:37:29 -04:00
Andrew Gallant	caa53b7b09	grep: update minimal dependency versions	2020-05-09 10:37:08 -04:00
Andrew Gallant	c5d6141562	grep-printer-0.1.5	2020-05-09 10:33:02 -04:00
Andrew Gallant	c0f0492b98	grep-regex-0.1.8	2020-05-09 10:31:29 -04:00
Andrew Gallant	568018386b	ignore-0.4.15	2020-05-09 10:27:19 -04:00
Andrew Gallant	b458cf39f2	deps: update to base64 0.12 No code changes were necessary.	2020-05-09 10:25:37 -04:00
Casey Rodarmor	793c1179cc	ignore: allow filtering with predicate Adds `WalkBuilder::filter_entry` that takes a predicate to be applied to all entries. If the predicate returns `false` on a given entry, that entry and all children will be skipped. Fixes #1555, Closes #1557	2020-05-08 23:24:40 -04:00
Wieland Hoffmann	df7a3bfc7f	grep-cli: support files compressed by compress(1) While Linux distributions (at least Arch Linux, RHEL, Debian) do not support compressing files with compress(1), macOS & AIX do (the utility is part of POSIX). Additionally, gzip is able to uncompress such compressed files and provides an `uncompress` binary. Closes #1547	2020-05-08 23:24:40 -04:00
Andrew Gallant	28f2a93cae	doc: shorten -h/--help prelude It has grown quite long. It would be nice if we could shorten this only when -h is used and keep it long for --help, but it seems clap doesn't let this happen. (It does have `about` and `long_about` options, but they don't work, even when I disable the use of the template.) The longer prelude is now only available in the man page. This addresses #189.	2020-05-08 23:24:40 -04:00
Andrew Gallant	64a4dee495	cli: improve invalid UTF-8 pattern error message When a pattern with invalid UTF-8 is given, the error message suggests unqualified use of hex escape sequences to match arbitrary bytes. But you also need to disable Unicode mode. So include that in the error message. Fixes #1339	2020-05-08 23:24:40 -04:00
Andrew Gallant	50840ea43b	doc: note how to escape a '$' in --replace Fixes #1524	2020-05-08 23:24:40 -04:00
Andrew Gallant	17dcc2bf51	doc: clarify that files override gitignores This attempts to fix some mild confusion that came up as part of #1574. Specifically: https://github.com/BurntSushi/ripgrep/issues/1574#issuecomment-625780436	2020-05-08 23:24:40 -04:00
Andrew Gallant	9a858e4909	doc: add config file note for --type-{add,clear} This clarifies that persistence is possible via a configuration file. Fixes #1571	2020-05-08 23:24:40 -04:00
Andrew Gallant	7ed9a31819	printer: fix --count-matches output In order to implement --count-matches, we simply re-execute the regex on the spans reported by the searcher. The spans always correspond to the lines that participated in the match. This is the correct thing to do, except when the regex contains look-ahead (or look-behind). In particular, the look-around permits the regex's match success to depends on an arbitrary point before or after the lines actually reported as participating in the match. Since only the matched lines are reported to the printer, it is possible for subsequent searching on those lines to fail. A true fix for this would somehow make the total span available to the printer. But that seems tricky since it isn't always available. For PCRE2's case in multiline mode, it is available because we force it to be so for correctness. For now, we simply detect this corner case heuristically. If the match count is zero, then it necessarily means there is some kind of look-around that isn't matching. So we set the match count to 1. This is probably incorrect in some cases, although my brain can't quite come up with a concrete example. Nevertheless, this is strictly better than the status quo. Fixes #1573	2020-05-08 23:24:40 -04:00
Andrew Gallant	139f186e57	crates/ignore: switch to depth first traversal This replaces the use of channels in the parallel directory traversal with a simple stack. The primary motivation for this change is to reduce peak memory usage. In particular, when using a channel (which is a queue), we wind up visiting files in a breadth first fashion. Using a stack switches us to a depth first traversal. While there are no real intrinsic differences, depth first traversal generally tends to use less memory because directory trees are more commonly wide than they are deep. In particular, the queue/stack size itself is not the only concern. In one recent case documented in #1550, a user wanted to search all Rust crates. The directory structure was shallow but extremely wide, with a single directory containing all crates. This in turn results is in descending into each of those directories and building a gitignore matcher for each (since most crates have `.gitignore` files) before ever searching a single file. This means that ripgrep has all such matchers in memory simultaneously, which winds up using quite a bit of memory. In a depth first traversal, peak memory usage is much lower because gitignore matches are built and discarded more quickly. In the case of searching all crates, the peak memory usage decrease is dramatic. On my system, it shrinks by an order magnitude, from almost 1GB to 50MB. The decline in peak memory usage is consistent across other use cases as well, but is typically more modest. For example, searching the Linux repo has a 50% decrease in peak memory usage and searching the Chromium repo has a 25% decrease in peak memory usage. Search times generally remain unchanged, although some ad hoc benchmarks that I typically run have gotten a bit slower. As far as I can tell, this appears to be result of scheduling changes. Namely, the depth first traversal seems to result in searching some very large files towards the end of the search, which reduces the effectiveness of parallelism and makes the overall search take longer. This seems to suggest that a stack isn't optimal. It would instead perhaps be better to prioritize searching larger files first, but it's not quite clear how to do this without introducing more overhead (getting the file size for each file requires a stat call). Fixes #1550	2020-04-18 11:33:03 -04:00
Andrew Gallant	a75b4d122a	doc: fix newline escape Fixes #1551	2020-04-13 08:49:27 -04:00
Andrew Gallant	1c4b5adb7b	regex: fix another inner literal bug It looks like `is_simple` wasn't quite correct. I can't wait until this code is rewritten. It is still not quite clearly correct to me. Fixes #1537	2020-04-01 20:37:48 -04:00
Marius Schulz	3d6a58faff	doc: fix typo in help description PR #1536	2020-03-30 17:31:16 -04:00

1 2

76 Commits