ripgrep

mirror of https://github.com/BurntSushi/ripgrep.git synced 2025-06-30 22:23:44 +02:00

Author	SHA1	Message	Date
Balaji Sivaraman	d57fc58081	termcolor: add underline support This commit adds underline support to the termcolor crate, and exposes it through ripgrep. Fixes #798	2018-02-20 07:10:03 -05:00
Andrew Gallant	361698b90a	ignore: fix improper hidden filtering This commit fixes a bug where `rg --hidden .` would behave differently with respect to ignore filtering than `rg --hidden ./`. In particular, this was due to a bug where the directory name `.` caused the leading `.` in a hidden directory to get stripped, which in turn caused the ignore rules to fail. Fixes #807	2018-02-14 18:16:38 -05:00
Andrew Gallant	8cb5833ef9	argv: update clap to 2.29.4 We use the new AppSettings::AllArgsOverrideSelf to permit all flags to be specified multiple times. This removes the need for our previous work-around where we would enable `multiple` for every flag and then just extract the last value when consuming clap's matches. We also add a couple regression tests that ensure repeated switches and flags work as expected.	2018-02-06 12:07:59 -05:00
Andrew Gallant	c57d0fb4e8	config: add persistent configuration This commit adds support for reading configuration files that change ripgrep's default behavior. The format of the configuration file is an "rc" style and is very simple. It is defined by two rules: 1. Every line is a shell argument, after trimming ASCII whitespace. 2. Lines starting with '#' (optionally preceded by any amount of ASCII whitespace) are ignored. ripgrep will look for a single configuration file if and only if the RIPGREP_CONFIG_PATH environment variable is set and is non-empty. ripgrep will parse shell arguments from this file on startup and will behave as if the arguments in this file were prepended to any explicit arguments given to ripgrep on the command line. For example, if your ripgreprc file contained a single line: --smart-case then the following command RIPGREP_CONFIG_PATH=wherever/.ripgreprc rg foo would behave identically to the following command rg --smart-case foo This commit also adds a new flag, --no-config, that when present will suppress any and all support for configuration. This includes any future support for auto-loading configuration files from pre-determined paths (which this commit does not add). Conflicts between configuration files and explicit arguments are handled exactly like conflicts in the same command line invocation. That is, this command: RIPGREP_CONFIG_PATH=wherever/.ripgreprc rg foo --case-sensitive is exactly equivalent to rg --smart-case foo --case-sensitive in which case, the --case-sensitive flag would override the --smart-case flag. Closes #196	2018-02-04 10:40:20 -05:00
Balaji Sivaraman	f007f940c5	search: add support for searching compressed files This commit adds opt-in support for searching compressed files during recursive search. This behavior is only enabled when the `-z/--search-zip` flag is passed to ripgrep. When enabled, a limited set of common compression formats are recognized via file extension, and a new process is spawned to perform the decompression. ripgrep then searches the stdout of that spawned process. Closes #539	2018-01-30 09:13:53 -05:00
kennytm	8514d4fbb4	termcolor: tweak reset escape Write `Ansi::reset()` using `\x1b[0m` instead of `\x1b[m`. This works around an AppVeyor bug: https://github.com/appveyor/ci/issues/1824	2018-01-29 14:14:55 -05:00
dana	58bdc366ec	printer: add --passthru flag The --passthru flag causes ripgrep to print every line, even if the line does not contain a match. This is a response to the common pattern of `^\|foo` to match every line, while still highlighting things like `foo`. Fixes #740	2018-01-11 18:45:51 -05:00
Balaji Sivaraman	14779ed0ea	ux: suggest --fixed-strings flag If a regex syntax error occurs, then ripgrep will suggest using the --fixed-strings flag. Fixes #727	2018-01-01 11:24:46 -05:00
Balaji Sivaraman	ba1023e1e4	printer: add support for line number alignment Closes #544	2018-01-01 09:00:31 -05:00
Igor Gnatenko	a5855a5d73	couple of trivial fixes to make clippy a bit more happy (#704 ) clippy: fix a few lints The fixes are: * Use single quotes for single-character * Use ticks in documentation when necessary. * Just bow to clippy's wisdom.	2017-12-30 16:06:16 -05:00
dana	d73a75d6cd	Omit context separators when using a contextless option like -c or -l Fixes #693	2017-11-29 12:55:42 -05:00
Martin Lindhe	c794ef2f04	fix some typos	2017-11-01 07:10:54 -04:00
Andrew Gallant	2a14bf2249	printer: fix colors on empty matches This fixes a bug where a "match" color escape was erroneously emitted after the new line character. This is because `^` is actually allowed to match after the end of a trailing new line, which means `^$` matches both before and after the trailing new line when multiline mode is enabled. The trailing match was causing the phantom escape sequence to appear, which we don't want. Incidentally, this is the root cause of #441 as well, although this commit doesn't fix that issue, since the line itself is printed before we detect the phantom match. Fixes #599	2017-10-21 22:40:10 -04:00
Evgeny Kulikov	f887bc1f86	printer: --only-matching works with --replace When -o/--only-matching is used with -r/--replace, the replacement works as expected. This is not a breaking change because the flags were previously set to conflict.	2017-10-20 20:58:27 -04:00
Sebastian Nowicki	712311fdc6	Don't create command until we know we can test it For regression 210 we may not actually need to test anything if the file system doesn't support creating files with invalid UTF-8 bytes. Don't create the command until we know there will be an assertion.	2017-10-20 20:51:12 -04:00
Sebastian Nowicki	8dc513b5d2	Skip regression 210 test on APFS APFS does not support creating filenames with invalid UTF-8 byte codes, thus this test doesn't make sense. Skip it on file systems where this shouldn't be possible. Fixes #559	2017-10-20 20:51:12 -04:00
Andrew Gallant	73c9ac4da5	integration tests: ignore regression_428 on Windows The test is severely constrained to the specific ANSI formatting of ripgrep in accordance with its default color scheme. The default color scheme on Windows changed, which caused the test to fail. For now, just disable the test on Windows.	2017-08-23 17:49:40 -04:00
dana	40bacbcd7c	Add -x/--line-regexp (#520 ) add -x/--line-regexp flag	2017-08-09 06:53:35 -04:00
dana	b7c3cf314d	Add test for option-arguments with leading hyphens	2017-07-30 17:55:24 -04:00
dana	6dce04963d	Allow options with non-numeric arguments to accept leading hyphens in arguments (fixes #568 )	2017-07-30 17:55:24 -04:00
Peter S Panov	4047d9db71	add --iglob flag Working with Chris Stadler, implemented https://github.com/BurntSushi/ripgrep/issues/163#issuecomment-300012592	2017-07-03 06:52:52 -04:00
Evan.Mattiza	06393f888c	fix word boundary w/ capture group fixes BurntSushi/ripgrep#506. Word boundary search as arg had unexpected behavior. added capture group to regex to encapsulate 'or' option search and prevent expansion and partial boundary finds. Signed-off-by: Evan.Mattiza <emattiza@gmail.com>	2017-06-15 06:55:55 -04:00
Andrew Gallant	112b3c5e0a	Fix another bug in -o/--only-matching. The handling of the -o/--only-matching was incorrect. We cannot ever re-run regexes on a subset of a matched line, because it doesn't take into account zero width assertions on the edges of the regex. This occurs whenever an end user uses an assertion explicity, but also occurs when one is used implicitly, e.g., with the `-w` flag. This instead reuses the initial matched range from the first regex match. We also apply this fix to coloring. Fixes #493	2017-05-29 09:51:58 -04:00
Marc Tiehuis	229b8e3b33	Make --quiet flag apply when using --files option Fixes #483.	2017-05-19 20:00:47 -04:00
Roman Proskuryakov	362abed44a	Fix reiteration of the first found match with --only-mathing flag Fixes #451	2017-04-21 08:11:55 -04:00
Andrew Gallant	7ad23e5565	Use for_label_no_replacement. This will cause certain unsupported legacy encodings to act as if they don't exist, in order to avoid using an unhelpful (in the context of file searching) "replacement" encoding. Kudos to @hsivonen for chirping about this!	2017-04-12 18:14:23 -04:00
Marc Tiehuis	66efbad871	Add dfa-size-limit and regex-size-limit arguments Fixes #362.	2017-04-12 18:14:23 -04:00
Roman Proskuryakov	90a11dec5e	Add `-o/--only-matching` flag. Currently, the `--only-matching` flag conflicts with the `--replace` flag. In the future, this restriction may be relaxed. Fixes #34	2017-04-09 08:47:35 -04:00
Roman Proskuryakov	aed3ccb9c7	Improves Printer, fixes some bugs	2017-03-31 14:44:13 -04:00
Roman Proskuryakov	01deac9427	Add -0 shortcut for --null Fixes #419	2017-03-28 18:37:40 -04:00
Ralf Jung	d352b79294	Add new -M/--max-columns option. This permits setting the maximum line width with respect to the number of bytes in a line. Omitted lines (whether part of a match, replacement or context) are replaced with a message stating that the line was elided. Fixes #129	2017-03-12 21:21:28 -04:00
Andrew Gallant	8bbe58d623	Add support for additional text encodings. This includes, but is not limited to, UTF-16, latin-1, GBK, EUC-JP and Shift_JIS. (Courtesy of the `encoding_rs` crate.) Specifically, this feature enables ripgrep to search files that are encoded in an encoding other than UTF-8. The list of available encodings is tied directly to what the `encoding_rs` crate supports, which is in turn tied to the Encoding Standard. The full list of available encodings can be found here: https://encoding.spec.whatwg.org/#concept-encoding-get This pull request also introduces the notion that text encodings can be automatically detected on a best effort basis. Currently, the only support for this is checking for a UTF-16 bom. In all other cases, a text encoding of `auto` (the default) implies a UTF-8 or ASCII compatible source encoding. When a text encoding is otherwise specified, it is unconditionally used for all files searched. Since ripgrep's regex engine is fundamentally built on top of UTF-8, this feature works by transcoding the files to be searched from their source encoding to UTF-8. This transcoding only happens when: 1. `auto` is specified and a non-UTF-8 encoding is detected. 2. A specific encoding is given by end users (including UTF-8). When transcoding occurs, errors are handled by automatically inserting the Unicode replacement character. In this case, ripgrep's output is guaranteed to be valid UTF-8 (excluding non-UTF-8 file paths, if they are printed). In all other cases, the source text is searched directly, which implies an assumption that it is at least ASCII compatible, but where UTF-8 is most useful. In this scenario, encoding errors are not detected. In this case, ripgrep's output will match the input exactly, byte-for-byte. This design may not be optimal in all cases, but it has some advantages: 1. In the happy path ("UTF-8 everywhere") remains happy. I have not been able to witness any performance regressions. 2. In the non-UTF-8 path, implementation complexity is kept relatively low. The cost here is transcoding itself. A potentially superior implementation might build decoding of any encoding into the regex engine itself. In particular, the fundamental problem with transcoding everything first is that literal optimizations are nearly negated. Future work should entail improving the user experience. For example, we might want to auto-detect more text encodings. A more elaborate UX experience might permit end users to specify multiple text encodings, although this seems hard to pull off in an ergonomic way. Fixes #1	2017-03-12 19:54:48 -04:00
Andrew Gallant	6ecffec537	Fix test on Windows. (This is what I get for directly pushing to master.)	2017-03-12 16:07:31 -04:00
Andrew Gallant	80e91a1f1d	Fix leading slash bug when used with `!`. When writing paths like `!/foo` in gitignore files (or when using the -g/--glob flag), the presence of `!` would prevent the gitignore builder from noticing the leading slash, which causes absolute path matching to fail. Fixes #405	2017-03-12 15:51:17 -04:00
Marc Tiehuis	adff43fbb4	Remove clap validator + add max-filesize integration tests	2017-03-08 10:17:18 -05:00
tiehuis	714ae82241	Add `--max-filesize` option to cli The --max-filesize option allows filtering files which are larger than the specified limit. This is potentially useful if one is attempting to search a number of large files without common file-types/suffixes. See #369.	2017-03-08 10:17:18 -05:00
Marc Tiehuis	066f97d855	Add enclosing group to alternations in globs Fixes #391.	2017-03-08 10:13:28 -05:00
Andrew Gallant	7a951f103a	Make --column imply --line-number. Closes #243	2017-01-11 18:53:35 -05:00
Andrew Gallant	8751e55706	Add --path-separator flag. This flag permits setting the path separator used for all file paths printed by ripgrep in normal operation. Fixes #275	2017-01-10 18:16:15 -05:00
Andrew Gallant	97e6873b38	Fix type compose test.	2017-01-07 22:50:38 -05:00
Ian Kerins	ed01e80a79	Provide a mechanism to compose type definitions This extends the syntax of the --type-add flag to allow including the globs of other already defined types. Fixes #83.	2017-01-07 18:14:24 -05:00
Andrew Gallant	b65a8c353b	Add --sort-files flag. When used, parallelism is disabled but the results are sorted by file path. Closes #263	2017-01-06 22:43:59 -05:00
Andrew Gallant	bb70f96743	Fix a non-termination bug. This was a very silly bug. Instead of creating a particular atomic once and cloning it, we created a new value for each worker. Fixes #279	2016-12-12 06:55:49 -05:00
Andrew Gallant	d66812102b	Fix leading hypen bug by updating clap. Fixes #270	2016-12-06 17:29:34 -05:00
Andrew Gallant	7282706b42	Fix bug reading root symlink. When give an explicit file path on the command line like `foo` where `foo` is a symlink, ripgrep should follow it even if `-L` isn't set. This is consistent with the behavior of `foo/`. Fixes #256	2016-12-05 20:05:57 -05:00
Andrew Gallant	0473df1ef5	Disable Unicode mode for literal regex. When ripgrep detects a literal, it emits them as raw hex escaped byte sequences to Regex::new. This permits literal optimizations for arbitrary byte sequences (i.e., possibly invalid UTF-8). The problem is that Regex::new interprets hex escaped byte sequences as Unicode codepoints by default, but we want them to actually stand for their raw byte values. Therefore, disable Unicode mode. This is OK, since the regex is composed entirely of literals and literal extraction does Unicode case folding. Fixes #251	2016-11-28 18:31:58 -05:00
Andrew Gallant	301a3fd71d	Detect more uppercase literals for --smart-case. This changes the uppercase literal detection for the "smart case" functionality. In particular, a character class is considered to have an uppercase literal if at least one of its ranges starts or stops with an uppercase literal. Fixes #229	2016-11-28 17:57:26 -05:00
Andrew Gallant	03f7605322	Rename --files-without-matches to --files-without-match. This is to be consistent with grep.	2016-11-19 20:15:41 -05:00
Daniel Luz	bd3e7eedb1	Add --files-without-matches flag. Performs the opposite of --files-with-matches: only shows paths of files that contain zero matches. Closes #138	2016-11-19 21:48:59 -02:00
Andrew Gallant	e37f783fc0	Fix issue number mixup. Thanks @bluss!	2016-11-17 20:30:18 -05:00

1 2

97 Commits