ripgrep

mirror of https://github.com/BurntSushi/ripgrep.git synced 2024-12-12 19:18:24 +02:00

Author	SHA1	Message	Date
Fabian Würfl	d1e4d28f30	readme: remove outdated statement Issue #10 already states that "ripgrep is now in most or all of the major package repositories." PR #1280	2019-05-14 18:44:50 -04:00
Andrew Gallant	5ce2d7351d	ci: use cross for musl x86_64 builds This is necessary because jemalloc + musl + Ubuntu 16.04 is apparently broken. Moreover, jemalloc doesn't support i686, so we accept the performance regression there. See also: https://github.com/gnzlbg/jemallocator/issues/124	2019-04-25 11:12:14 -04:00
Andrew Gallant	9dcfd9a205	deps: bump pcre2-sys to 0.2.1 This brings in a bug fix that no longer tries to run `git` to update the submodule if the `git` command doesn't exist. This is useful is more restricted build contexts where `git` isn't installed. Such as in the docker image used for running `cross`.	2019-04-25 11:12:14 -04:00
Andrew Gallant	36b276c6d0	printer: remove unnecessary mut	2019-04-24 17:22:27 -04:00
Andrew Gallant	03bf37ff4a	alloc: use jemalloc when building with musl It turns out that musl's allocator is slow enough to cause a fairly noticeable performance regression when ripgrep is built as a static binary with musl. We fix this by using jemalloc when building with musl. We continue to use the default system allocator in all other scenarios. Namely, glibc's allocator doesn't noticeably regress performance compared to jemalloc. But we could add more targets to this logic if other system allocators (macOS, Windows) prove to be slow. This wasn't necessary before because rustc recently stopped using jemalloc by default. Fixes #1268	2019-04-24 17:21:38 -04:00
Andrew Gallant	e7829c05d3	cli: fix bug where last byte was stripped In an effort to strip line terminators, we assumed their existence. But a pattern file may not end with a line terminator, so we shouldn't unconditionally strip them. We fix this by moving to bstr's line handling, which does this for us automatically.	2019-04-19 07:11:44 -04:00
Rory O’Kane	a6222939f9	readme: mention --pcre2 as long form of -P This is for consistency with the short and long flags given in other bullet points. I originally assumed there was no long flag for `-P` because none was given here. PR #1254	2019-04-16 21:22:48 -04:00
Rory O’Kane	6ffd434232	readme: mention --auto-hybrid-regex in advantages This feature solves a major reason I was skeptical of using ripgrep, so I think it’s good to mention it in the section about why one should use it. I use backreferences a lot, so I had previously thought that ripgrep would provide no speed advantage over ag, since I would always have `-P` enabled. But when I saw `--auto-hybrid-regex` in the 11.0.0 changelog, I learned that ripgrep can use it to speed up simple queries while still allowing me to write backreferences. PR #1253	2019-04-16 17:21:40 -04:00
Andrew Gallant	1f1cd9b467	pkg: update brew tap to 11.0.1	2019-04-16 13:39:56 -04:00
Andrew Gallant	973de50c9e	ripgrep: release 11.0.1, take 2	2019-04-16 13:11:28 -04:00
Andrew Gallant	5f8805a496	ripgrep: release 11.0.1	2019-04-16 13:10:29 -04:00
Andrew Gallant	fdde2bcd38	deps: update regex to 1.1.6 This brings in a fix for a regression introduced in ripgrep 11. Fixes #1247	2019-04-16 08:34:30 -04:00
Gerard de Melo	7b3fe6b325	doc: fix typo in FAQ PR #1248	2019-04-16 08:32:30 -04:00
Max Horn	b3dd3ae203	ignore/types: add GAP Add support for file types used by the GAP language, a research system computational discrete algebra, see <https://www.gap-system.org> PR #1249	2019-04-16 08:31:58 -04:00
Andrew Gallant	f3083e4574	readme: remove brew tap instructions The brew tap isn't really needed any more, since SIMD is now automatically enabled in all binaries.	2019-04-15 18:32:33 -04:00
Andrew Gallant	d03e30707e	pkg: update brew tap to 11.0.0	2019-04-15 18:32:10 -04:00
Andrew Gallant	d7f57d9aab	ripgrep: release 11.0.0	2019-04-15 18:09:40 -04:00
Andrew Gallant	1a2a24ea74	grep: release 0.2.4	2019-04-15 18:03:46 -04:00
Andrew Gallant	d66610b295	grep-cli: release 0.1.2	2019-04-15 18:02:44 -04:00
Andrew Gallant	019ae1989b	grep-printer: release 0.1.2	2019-04-15 18:00:49 -04:00
Andrew Gallant	36d3f235dc	grep-searcher: release 0.1.4	2019-04-15 17:59:22 -04:00
Andrew Gallant	79018eb693	grep-pcre2: release 0.1.3	2019-04-15 17:57:03 -04:00
Andrew Gallant	44cd344438	grep-regex: release 0.1.3	2019-04-15 17:56:04 -04:00
Andrew Gallant	e493e54b9b	grep-matcher: release 0.1.2	2019-04-15 17:53:29 -04:00
Andrew Gallant	8e8215aa65	ignore: release 0.4.7	2019-04-15 17:50:37 -04:00
Andrew Gallant	3fe701498e	doc: add note about --pre-glob There was a performance warning in the --pre docs, but didn't mention --pre-glob as a possible mitigation to it.	2019-04-15 17:47:48 -04:00
Andrew Gallant	e79085e9e4	release: globset 0.4.3	2019-04-15 14:07:03 -04:00
Andrew Gallant	764c197022	complete: fix typo	2019-04-15 07:04:57 -04:00
Andrew Gallant	ef1611b5f5	ripgrep: max-column-preview --> max-columns-preview Credit to @okdana for catching this. This naming is a bit more consistent with the existing --max-columns flag.	2019-04-15 06:51:51 -04:00
Andrew Gallant	45d12abbc5	changelog: small fixups	2019-04-14 20:21:55 -04:00
Andrew Gallant	5fde8391f9	changelog: backfill it I went through every commit since the 0.10.0 release and added anything that I thought was missing.	2019-04-14 20:04:01 -04:00
Marco Herrn	3edb11c513	ignore/types: add additional java files - .jspx for XHTML JSP files - .properties for Java Properties files (resource bundles, etc.) Closes #1242	2019-04-14 19:38:24 -04:00
Andrew Gallant	ed144be775	ci: bump MSRV to 1.34.0	2019-04-14 19:29:27 -04:00
Andrew Gallant	967e7ad0de	ripgrep: add --auto-hybrid-regex flag This flag, when set, will automatically dispatch to PCRE2 if the given regex cannot be compiled by Rust's regex engine. If both engines fail to compile the regex, then both errors are surfaced. Closes #1155	2019-04-14 19:29:27 -04:00
Andrew Gallant	9952ba2068	deps: update glob dev-dependency	2019-04-14 19:29:27 -04:00
Andrew Gallant	b751758d60	deps: update everything	2019-04-14 19:29:27 -04:00
Andrew Gallant	8f14cb18a5	ripgrep: increase pcre2's default JIT stack size The default stack size is 32KB, and this increases it to 10MB. 32KB is pretty paltry in the environments in which ripgrep runs, and 10MB is easily afforded as a maximum size. (The size limit we set for Rust's regex engine is considerably larger.) This was motivated due to the fack that JIT stack limits have been observed to be hit in the wild: https://github.com/Microsoft/vscode/issues/64606	2019-04-14 19:29:27 -04:00
Andrew Gallant	da9d720431	ripgrep: add --pcre2-version flag This flag will output details about the version of PCRE2 that ripgrep is using (if any).	2019-04-14 19:29:27 -04:00
Andrew Gallant	a9d71a0368	pcre2: add a few re-exports This adds the top-level is_jit_available and version free functions from the underlying pcre2 crate, and also forwards the max_jit_stack_size option.	2019-04-14 19:29:27 -04:00
Andrew Gallant	f3646242cc	deps: use pcre2 0.2.0 This comes with PCRE 10.32 and a few new options we'll use in subsequent commits.	2019-04-14 19:29:27 -04:00
Andrew Gallant	601f212a0b	ripgrep: add -I as a short option for --no-filename This flag is commonly used in pipelines and it can be annoying to write it out every time you need it. Ideally, we would use -h for this to match GNU grep, but -h is used to print help output. Closes #1185	2019-04-14 19:29:27 -04:00
Andrew Gallant	5a565354f8	versioning: next version will be ripgrep 11 This sets up the release announcement and briefly describes the versioning change. The actual version change itself won't happen until the release. Closes #1172	2019-04-14 19:29:27 -04:00
Andrew Gallant	2a6532ae71	doc: note cases of exorbitant memory usage Fixes #1189	2019-04-14 19:29:27 -04:00
Andrew Gallant	ece1f50cfe	printer: support previews for long lines This commit adds support for showing a preview of long lines. While the default still remains as completely suppressing the entire line, this new functionality will show the first N graphemes of a matching line, including the number of matches that are suppressed. This was unfortunately a fairly invasive change to the printer that required a bit of refactoring. On the bright side, the single line and multi-line coloring are now more unified than they were before. Closes #1078	2019-04-14 19:29:27 -04:00
Andrew Gallant	a7d26c8f14	binary: rejigger ripgrep's handling of binary files This commit attempts to surface binary filtering in a slightly more user friendly way. Namely, before, ripgrep would silently stop searching a file if it detected a NUL byte, even if it had previously printed a match. This can lead to the user quite reasonably assuming that there are no more matches, since a partial search is fairly unintuitive. (ripgrep has this behavior by default because it really wants to NOT search binary files at all, just like it doesn't search gitignored or hidden files.) With this commit, if a match has already been printed and ripgrep detects a NUL byte, then it will print a warning message indicating that the search stopped prematurely. Moreover, this commit adds a new flag, --binary, which causes ripgrep to stop filtering binary files, but in a way that still avoids dumping binary data into terminals. That is, the --binary flag makes ripgrep behave more like grep's default behavior. For files explicitly specified in a search, e.g., `rg foo some-file`, then no binary filtering is applied (just like no gitignore and no hidden file filtering is applied). Instead, ripgrep behaves as if you gave the --binary flag for all explicitly given files. This was a fairly invasive change, and potentially increases the UX complexity of ripgrep around binary files. (Before, there were two binary modes, where as now there are three.) However, ripgrep is now a bit louder with warning messages when binary file detection might otherwise be hiding potential matches, so hopefully this is a net improvement. Finally, the `-uuu` convenience now maps to `--no-ignore --hidden --binary`, since this is closer to the actualy intent of the `--unrestricted` flag, i.e., to reduce ripgrep's smart filtering. As a consequence, `rg -uuu foo` should now search roughly the same number of bytes as `grep -r foo`, and `rg -uuua foo` should search roughly the same number of bytes as `grep -ra foo`. (The "roughly" weasel word is used because grep's and ripgrep's binary file detection might differ somewhat---perhaps based on buffer sizes---which can impact exactly what is and isn't searched.) See the numerous tests in tests/binary.rs for intended behavior. Fixes #306, Fixes #855	2019-04-14 19:29:27 -04:00
Andrew Gallant	bd222ae93f	regex: fix HIR analysis bug An alternate can be empty at this point, so we must handle it. We didn't before because the regex engine actually disallows empty alternates, however, this code runs before the regex compiler rejects the regex.	2019-04-14 19:29:27 -04:00
hupfdule	4359d8aac0	ignore/types: add more extensions for xml This includes: .dtd for Document Type Definitions .xsl and .xslt for XSL Transformation descriptions .xsd for XML Schema definitions .xjb for JAXB bindings .rng for Relax NG files *.sch for Schematron files PR #1243	2019-04-09 15:17:57 -04:00
tonypai	308819fb1f	ignore/types: add lock files Treat anything with a `.lock` extension as a lock file, with an extra rule or two for special cases, e.g., package-lock.json.	2019-04-09 10:24:48 -04:00
Andrew Gallant	09108b7fda	regex: make multi-literal searcher faster This makes the case of searching for a dictionary of a very large number of literals much much faster. (~10x or so.) In particular, we achieve this by short-circuiting the construction of a full regex when we know we have a simple alternation of literals. Building the regex for a large dictionary (>100,000 literals) turns out to be quite slow, even if it internally will dispatch to Aho-Corasick. Even that isn't quite enough. It turns out that even parsing such a regex is quite slow. So when the -F/--fixed-strings flag is set, we short circuit regex parsing completely and jump straight to Aho-Corasick. We aren't quite as fast as GNU grep here, but it's much closer (less than 2x slower). In general, this is somewhat of a hack. In particular, it seems plausible that this optimization could be implemented entirely in the regex engine. Unfortunately, the regex engine's internals are just not amenable to this at all, so it would require a larger refactoring effort. For now, it's good enough to add this fairly simple hack at a higher level. Unfortunately, if you don't pass -F/--fixed-strings, then ripgrep will be slower, because of the aforementioned missing optimization. Moreover, passing flags like `-i` or `-S` will cause ripgrep to abandon this optimization and fall back to something potentially much slower. Again, this fix really needs to happen inside the regex engine, although we might be able to special case -i when the input literals are pure ASCII via Aho-Corasick's `ascii_case_insensitive`. Fixes #497, Fixes #838	2019-04-07 19:11:03 -04:00
Andrew Gallant	743d64f2e4	deps: update to clap 2.33	2019-04-06 10:35:08 -04:00

1 2 3 4 5 ...

1308 Commits