ripgrep

mirror of https://github.com/BurntSushi/ripgrep.git synced 2024-12-12 19:18:24 +02:00

Author	SHA1	Message	Date
Andrew Gallant	be7d6dd9ce	regex: print out final regex in trace mode This is useful for debugging to see what regex is actually being run. We put this as a trace since the regex can be quite gnarly. (It is not pretty printed.)	2019-04-05 23:24:08 -04:00
Andrew Gallant	9f15e3b671	regex: fix a perf bug when using -w flag When looking for an inner literal to speed up searches, if only a prefix is found, then we generally give up doing inner literal optimizations since the regex engine will generally handle it for us. Unfortunately, this decision was being made before we wrap the regex in (^\|\W)...($\|\W) when using the -w/--word-regexp flag, which would then defeat the literal optimizations inside the regex engine. We fix this with a bit of a hack that says, "if we're doing a word regexp, then give me back any literal you find, even if it's a prefix."	2019-04-05 23:24:08 -04:00
Andrew Gallant	254b8b67bb	globset: small perf improvements This tweaks the path handling functions slightly to make them a hair faster. In particular, `file_name` is called on every path that ripgrep visits, and it was possible to remove a few branches without changing behavior.	2019-04-05 23:24:08 -04:00
Andrew Gallant	8a7f43b84d	globset: use bstr This simplifies the various path related functions and pushed more platform dependent code down into bstr. This likely also makes things a bit more efficient on Windows, since we now only do a single UTF-8 check for each file path.	2019-04-05 23:24:08 -04:00
Andrew Gallant	d968a27ed5	cli: use bstr This uses bstr in the unescaping logic. This lets us remove some platform specific code, and also lets us remove a hacked UTF-8 decoder on raw bytes.	2019-04-05 23:24:08 -04:00
Andrew Gallant	9b8f5cbaba	config: switch to using bstrs This lets us implement correct Unicode trimming and also simplifies the parsing logic a bit. This also removes the last platform specific bits of code in ripgrep core.	2019-04-05 23:24:08 -04:00
Andrew Gallant	c52da74ac3	printer: use bstr This starts the usage of bstr in the printer. We don't use it too much yet, but it comes in handy for implementing PrinterPath and lets us push down some platform specific code into bstr.	2019-04-05 23:24:08 -04:00
Andrew Gallant	7dcbff9a9b	searcher: partially migrate to bstr This commit causes grep-searcher to use byte strings internally for its line buffer support. We manage to remove a use of `unsafe` by doing this (by pushing it down into `bstr`). We stop short of using byte strings everywhere else because we rely heavily on the `impl ops::Index<[u8]> for grep_matcher::Match` impl, which isn't available for byte strings. (It is premature to make bstr a public dep of a core crate like grep-matcher, but maybe some day.)	2019-04-05 23:24:08 -04:00
Andrew Gallant	bef1f0e770	ci: switch to xenial (#1234 ) Rust is having problems with trusty, in particular, see this bug I filed: https://github.com/rust-lang/rust/issues/59411 This was purpotedly fixed in https://github.com/rust-lang/rust/pull/59468, but it appears the issue is still occurring. This commit tries to update to Ubuntu 16.04 in the hope that it will fix this problem.	2019-04-03 19:52:34 -04:00
Andrew Gallant	cd9815cb37	deps: update to aho-corasick 0.7 We do the simplest possible change to migrate to the new version. Fixes #1228	2019-04-03 13:51:26 -04:00
Andrew Gallant	3f22c3a658	deps: update everything This updates all dependencies to their latest versions. We tolerate a duplicative aho-corasick for now, which we will fix in the next commit.	2019-04-03 13:07:26 -04:00
Andrew Gallant	0913972104	deps: bump encoding_rs_io This brings in a new API for disabling BOM sniffing. This is part of the work toward completing https://github.com/BurntSushi/ripgrep/issues/1207	2019-03-03 16:36:34 -05:00
Andrew Gallant	f19b84fb23	regex: bump regex dep to fix match bug See * `661bf53d5b` * `edf45e6f5f` for details on the bug fix, which was in the regex engine. Fixes #1203	2019-02-27 17:42:14 -05:00
Andrew Gallant	59fc583aeb	readme: include details about filtering Despite the fact that we mention this in several places, people are still surprised by ripgrep's "smart" filtering.	2019-02-27 08:01:23 -05:00
Andrew Gallant	1c7c4e6640	deps: update tempfile	2019-02-21 16:32:17 -05:00
Andrew Gallant	69c5e3938d	deps: bump smallvec This gets rid of the unmaintained crates `unreachable` and `void`. Yay!	2019-02-21 16:31:48 -05:00
Andrew Gallant	d9cf05ad50	deps: update to aho-corasick 0.6.10 This brings in a fix for this bug: https://github.com/BurntSushi/aho-corasick/issues/37 Fixes #1079	2019-02-16 11:39:33 -05:00
Andrew Gallant	af8b6caebb	deps: update various dependencies	2019-02-16 09:39:42 -05:00
Andrew Gallant	c84cfb6756	grep-regex-0.1.2	2019-02-16 09:30:06 -05:00
Andrew Gallant	895e26a000	ci: don't do releases on all tags This attempts to make Appveyor more conservative in what tags it thinks are releases. I don't know for sure, but it looks like the previous regex could match anywhere, so we anchor it. Fixes #1195	2019-02-10 12:51:56 -05:00
Andrew Gallant	8c95290ff6	deps: miscellaneous updates	2019-02-10 07:45:08 -05:00
Andrew Gallant	d6feeb7ff2	grep-searcher-0.1.3	2019-02-10 07:42:37 -05:00
Andrew Gallant	626ed00c19	searcher: revert big-endian patch This undoes the patch to stop using bytecount on big-endian architectures. In particular, we bump our bytecount dependency to the latest release, which has a fix. This reverts commit `a4868b8835`. Fixes #1144 (again), Closes #1194	2019-02-10 07:40:32 -05:00
Andrew Gallant	332ad18401	tests: use const constructor for atomics We did this in `05411b2b` for core ripgrep, but didn't carry it over to tests.	2019-02-09 16:27:25 -05:00
Andrew Gallant	fc3cf41247	grep-searcher-0.1.2	2019-02-09 16:13:07 -05:00
Andrew Gallant	a4868b8835	searcher: use naive line counting on big-endian This patches out bytecount's "fast" vectorized algorithm on big-endian machines, where it has been observed to fail. Going forward, bytecount should probably fix this on their end, but for now, we take a small performance hit on big-endian machines. Fixes #1144	2019-02-09 16:13:07 -05:00
John Schmidt	f99b991117	ignore/types: add zig PR #1191	2019-02-08 08:12:40 -05:00
Andrew Gallant	de0bc78982	deps: bump encoding_rs to 0.8.16 This brings in an updated `encoding_rs` crate that uses `packed_simd`, which compiles on the latest nightly. Compilation times do appear to be impacted significantly though. Fixes #1175 (again)	2019-02-07 17:05:14 -05:00
Steffen Banhardt	147e96914c	ignore/types: .dtx and .ins added for tex PR #1182	2019-01-31 09:06:19 -05:00
Andrew Gallant	0abc40c23c	readme: bump MSRV We bumped it a while back in the CI configuration, but didn't update the README.	2019-01-29 13:10:43 -05:00
Andrew Gallant	f768796e4f	deps: update other deps	2019-01-29 13:08:56 -05:00
Andrew Gallant	da0c0c4705	deps: update to crossbeam-channel 0.3.8 This drops dependencies on parking_lot and rand from ripgrep. (rand is still used for tests.)	2019-01-29 13:07:37 -05:00
Andrew Gallant	05411b2b32	deprecated: remove use of ATOMIC_BOOL_INIT Our MSRV is high enough that we can use const functions now.	2019-01-29 13:05:16 -05:00
Andrew Gallant	cc93db3b18	cargo: include auto-generated message This is going to be annoying for a while if one switches between the latest nightly compiler and older compilers. Sigh.	2019-01-29 13:04:40 -05:00
Alex Macleod	049354b766	readme: remove EOL Fedora install instructions Fedora 27 and below are past their EOL, so it can now be said that it's supported regularly on Fedora. PR #1177	2019-01-28 08:15:36 -05:00
Andrew Gallant	386dd2806d	changelog: BUG #916 This was fixed by bumping the MSRV above Rust 1.28. Fixes #916	2019-01-27 13:15:17 -05:00
Andrew Gallant	5fe9a954e6	changelog: BUG #1154	2019-01-27 13:05:50 -05:00
Andrew Gallant	f158a42a71	ignore: correctly detect hidden files on Windows This commit fixes a bug where ripgrep only treated files beginning with a `.` as hidden. On Windows, we continue this tradition, but additionally check whether a file has the special Windows "hidden" attribute set. If so, we treat it as a hidden file. In order to make this work without an additional stat call, we had to rearrange some of the plumbing from the directory traverser. Fixes #1154	2019-01-27 12:11:52 -05:00
Andrew Gallant	5724391d39	doc: small updates to the FAQ and GUIDE Notably, ripgrep can do multiline search now. We also update the supported compression format list and replace deprecated flags like `--sort-files` with `--sort path`.	2019-01-26 16:19:09 -05:00
Andrew Gallant	0df71240ff	search: fix -F and -f interaction bug This fixes what appears to be a pretty egregious regression where the `-F/--fixed-strings` flag wasn't be applied to patterns supplied via the `-f/--file` flag. The same bug existed for the `-x/--line-regexp` flag as well, which we fix here. Fixes #1176	2019-01-26 16:01:52 -05:00
Andrew Gallant	f3164f2615	exit: tweak exit status logic This changes how ripgrep emit exit status codes. In particular, any error that occurs while searching will now cause ripgrep to emit a `2` exit code, where as it previously would emit either a `0` or a `1` code based on whether it matched or not. That is, ripgrep would only emit a `2` exit code for a catastrophic error. This tweak includes additional logic that GNU grep adheres to, which seems like good sense. Namely, if -q/--quiet is given, and an error occurs and a match occurs, then ripgrep will emit a `0` exit code. Closes #1159	2019-01-26 15:44:49 -05:00
Andrew Gallant	31d3e24130	args: prevent panicking in 'rg -h \| rg' Previously, we relied on clap to handle printing either an error message, or --help/--version output, in addition to setting the exit status code. Unfortunately, for --help/--version output, clap was panicking if the write failed, which can happen in fairly common scenarios via a broken pipe error. e.g., `rg -h \| head`. We fix this by using clap's "safe" API and doing the printing ourselves. We also set the exit code to `2` when an invalid command has been given. Fixes #1125 and partially addresses #1159	2019-01-26 14:39:40 -05:00
Andrew Gallant	bf842dbc7f	doc: add note about inverted flags Fixes #1091	2019-01-26 14:13:06 -05:00
Andrew Gallant	6d5dba85bd	doc: clarify automatic encoding detection Fixes #1103	2019-01-26 13:55:47 -05:00
Andrew Gallant	afb89bcdad	fmt: shorten --ignore-file-case-insensitive description	2019-01-26 13:45:02 -05:00
Andrew Gallant	332dc56372	changelog: BUG #1095	2019-01-26 13:40:59 -05:00
Andrew Gallant	12a6ca45f9	config: add --no-ignore-dot flag This flag causes ripgrep to ignore `.ignore` files. Closes #1138	2019-01-26 13:40:12 -05:00
Andrew Gallant	9d703110cf	regex: make CRLF hack more robust This commit improves the CRLF hack to be more robust. In particular, in addition to rewriting `$` as `(?:\r??$)`, we now strip `\r` from the end of a match if and only if the regex has an ending line anchor required for a match. This doesn't quite make the hack 100% correct, but should fix most use cases in practice. An example of a regex that will still be incorrect is `foo\|bar$`, since the analysis isn't quite sophisticated enough to determine that a `\r` can be safely stripped from any match. Even if we fix that, regexes like `foo\r\|bar$` still won't be handled correctly. Alas, more work on this front should really be focused on enabling this in the regex engine itself. The specific cause of this bug was that grep-searcher was sneakily stripping CRLF from matching lines when it really shouldn't have. We remove that code now, and instead rely on better match semantics provided at a lower level. Fixes #1095	2019-01-26 12:34:28 -05:00
Andrew Gallant	e99b6bda0e	deps: bump regex-syntax to 0.6.5 This is necessary for the use of the new is_line_anchored_{start,end} APIs.	2019-01-26 12:20:02 -05:00
Andrew Gallant	276e2c9b9a	searcher: always strip BOM This fixes a bug where a BOM prefix was included. While this was somewhat intentional in order to have a faithful "UTF8 passthru" option, in practice, this causes problems such as breaking patterns like `^` in a really non-obvious way. The actual fix was to add a new API to encoding_rs_io, which this commit brings in. Fixes #1163	2019-01-25 17:18:57 -05:00

... 4 5 6 7 8 ...

1453 Commits