krak/zstd - zstd - Gitea: Git with a cup of tea

krak/zstd

mirror of https://github.com/facebook/zstd.git synced 2025-07-06 15:45:37 +02:00

Author	SHA1	Message	Date
Nick Terrell	a3c3a38b9b	[lazy] Skip over incompressible data Every 256 bytes the lazy match finders process without finding a match, they will increase their step size by 1. So for bytes [0, 256) they search every position, for bytes [256, 512) they search every other position, and so on. However, they currently still insert every position into their hash tables. This is different from fast & dfast, which only insert the positions they search. This PR changes that, so now after we've searched 2KB without finding any matches, at which point we'll only be searching one in 9 positions, we'll stop inserting every position, and only insert the positions we search. The exact cutoff of 2KB isn't terribly important, I've just selected a cutoff that is reasonably large, to minimize the impact on "normal" data. This PR only adds skipping to greedy, lazy, and lazy2, but does not touch btlazy2. \| Dataset \| Level \| Compiler \| CSize ∆ \| Speed ∆ \| \|---------\|-------\|--------------\|---------\|---------\| \| Random \| 5 \| clang-14.0.6 \| 0.0% \| +704% \| \| Random \| 5 \| gcc-12.2.0 \| 0.0% \| +670% \| \| Random \| 7 \| clang-14.0.6 \| 0.0% \| +679% \| \| Random \| 7 \| gcc-12.2.0 \| 0.0% \| +657% \| \| Random \| 12 \| clang-14.0.6 \| 0.0% \| +1355% \| \| Random \| 12 \| gcc-12.2.0 \| 0.0% \| +1331% \| \| Silesia \| 5 \| clang-14.0.6 \| +0.002% \| +0.35% \| \| Silesia \| 5 \| gcc-12.2.0 \| +0.002% \| +2.45% \| \| Silesia \| 7 \| clang-14.0.6 \| +0.001% \| -1.40% \| \| Silesia \| 7 \| gcc-12.2.0 \| +0.007% \| +0.13% \| \| Silesia \| 12 \| clang-14.0.6 \| +0.011% \| +22.70% \| \| Silesia \| 12 \| gcc-12.2.0 \| +0.011% \| -6.68% \| \| Enwik8 \| 5 \| clang-14.0.6 \| 0.0% \| -1.02% \| \| Enwik8 \| 5 \| gcc-12.2.0 \| 0.0% \| +0.34% \| \| Enwik8 \| 7 \| clang-14.0.6 \| 0.0% \| -1.22% \| \| Enwik8 \| 7 \| gcc-12.2.0 \| 0.0% \| -0.72% \| \| Enwik8 \| 12 \| clang-14.0.6 \| 0.0% \| +26.19% \| \| Enwik8 \| 12 \| gcc-12.2.0 \| 0.0% \| -5.70% \| The speed difference for clang at level 12 is real, but is probably caused by some sort of alignment or codegen issues. clang is significantly slower than gcc before this PR, but gets up to parity with it. I also measured the ratio difference for the HC match finder, and it looks basically the same as the row-based match finder. The speedup on random data looks similar. And performance is about neutral, without the big difference at level 12 for either clang or gcc.	2023-03-20 11:18:29 -07:00
daniellerozenblit	53bad103ce	patch-from speed optimization (#3545 ) * patch-from speed optimization: only load portion of dictionary into normal matchfinders * test regression for x8 multiplier * fix off-by-one error for bit shift bound * restrict patchfrom speed optimization to strategy < ZSTD_btultra * update results.csv * update regression test	2023-03-14 20:36:56 -04:00
Yonatan Komornik	a91e91d614	[Bugfix] row hash tries to match position 0 (#3548 ) #3543 decreases the size of the tagTable by a factor of 2, which requires using the first tag position in each row for head position instead of a tag. Although position 0 stopped being a valid match, it still persisted in mask calculation resulting in the matches loops possibly terminating before it should have. The fix skips position 0 to solve this problem.	2023-03-13 10:00:03 -07:00
Yonatan Komornik	33e39094e7	Reduce RowHash's tag space size by x2 (#3543 ) Allocate half the memory for tag space, which means that we get one less slot for an actual tag (needs to be used for next position index). The results is a slight loss in compression ratio (up to 0.2%) and some regressions/improvements to speed depending on level and sample. In turn, we get to save 16% of the hash table's space (5 bytes per entry instead of 6 bytes per entry).	2023-03-10 14:15:04 -08:00
Yann Collet	ebba9ff425	update regression results	2023-01-03 14:04:23 -08:00
daniellerozenblit	1c818e3a0a	Merge pull request #3302 from daniellerozenblit/optimal-huff-depth-speed Optimal huff depth speed improvements	2023-01-03 12:51:51 -05:00
Danielle Rozenblit	87becc567d	update regression results.csv	2023-01-03 08:41:40 -08:00
Danielle Rozenblit	c26f348dc8	fix CI errors	2022-12-20 12:43:46 -08:00
Danielle Rozenblit	482689b995	huf log speed optimization: unidirectional scan of logs + break when regressing	2022-12-20 12:27:38 -08:00
W. Felix Handte	8927f985ff	Update Copyright Headers 'Facebook' -> 'Meta Platforms' ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora $ -prune -o -type f); do sed -i 's/Facebook, Inc\./Meta Platforms, Inc. and affiliates./' $f; done ```	2022-12-20 12:37:57 -05:00
Elliot Gorokhovsky	bb3c01c853	Migrate other test usages of boolean LDM flag to paramSwitch enum	2022-11-21 16:20:38 -05:00
Danielle Rozenblit	75cd42afd7	Update regression results and better variable naming for HUF_cardinality	2022-10-14 13:37:19 -07:00
Danielle Rozenblit	117fddcd81	Update regression results	2022-10-12 13:49:44 -07:00
W. Felix Handte	1bc8019e10	Update results.csv	2022-05-11 10:27:35 -07:00
Elliot Gorokhovsky	3be9a81e46	Update results.csv	2022-05-04 16:05:37 -04:00
Yann Collet	cc7d23bcec	Merge pull request #2965 from facebook/offbase Converge sumtype (offset \| repcode) numeric representation towards offBase	2022-01-24 15:47:42 -08:00
Yann Collet	5595aec629	updated regression results	2022-01-07 15:08:06 -08:00
Yann Collet	03903f5701	fixed minor compression difference in btlazy2 subtle dependency on sumtype numeric representation	2021-12-29 18:51:03 -08:00
Yann Collet	01adddc3e0	update regression results	2021-12-16 20:43:23 -08:00
Yann Collet	7c7b9244d6	update regression results	2021-12-16 16:07:54 -08:00
Yann Collet	9a32492730	updated regression results.csv	2021-12-16 14:39:30 -08:00
W. Felix Handte	450fca9704	Update Regression Tests w/ New Sizes	2021-12-13 17:29:32 -05:00
Felix Handte	23c1a2d260	Merge pull request #2774 from felixhandte/zstd-dfast-pipelined-single Pipelined Implementation of ZSTD_dfast	2021-10-13 16:38:43 -04:00
Nick Terrell	c6c482fe07	[binary-tree] Fix underflow of nbCompares Fix underflow of `nbCompares` by switching to an `int` and comparing `nbCompares > 0`. This is a minimal fix, because I don't want to change the logic. These loops seem to be doing `nbCompares + 1` comparisons. The bug was reported by Dan Carpenter and found by Smatch static checker. https://lore.kernel.org/all/20211008063704.GA5370@kili/	2021-10-08 13:22:55 -07:00
W. Felix Handte	c2c32839dc	Update results.csv	2021-10-05 16:18:00 -04:00
Sen Huang	9360367371	Update regression test	2021-09-28 08:29:11 -07:00
Sen Huang	b8fd6bf30c	Skip most long matches in lazy hash table update	2021-09-28 08:19:39 -07:00
senhuang42	b5c35d7ea3	Use new paramSwitch enum for LCM, row matchfinder, and block splitter	2021-09-21 14:22:02 -04:00
Nick Terrell	8bf699aa59	[build] Add support for ASM files in Make + CMake * Extract out common portion of `lib/Makefile` into `lib/libzstd.mk`. Most relevantly, the way we find library files. * Use `lib/libzstd.mk` in the other Makefiles instead of repeating the same code. * Add a test `tests/test-variants.sh` that checks that the builds of `make -C programs allVariants` are correct, and run it in Actions. * Adds support for ASM files in the CMake build. The Meson build is not updated because it lists every file in zstd, and supports ASM off the bat, so the Huffman ASM commit will just add the ASM file to the list. The Visual Studios build is not updated because I'm not adding ASM support to Visual Studios yet.	2021-09-17 14:13:53 -07:00
Yann Collet	fd94b9d1c9	Merge branch 'dev' into opt_investigation	2021-09-14 01:15:51 -07:00
Sen Huang	d45d0ad9d8	Update regression test	2021-09-13 12:41:02 -04:00
Yann Collet	b6b2855b80	updated regression tests	2021-09-12 10:22:35 -07:00
Yann Collet	f58e63bee7	Merge branch 'dev' into opt_investigation	2021-09-12 01:42:49 -07:00
Felix Handte	d68aa19a2f	Merge pull request #2749 from felixhandte/zstd-fast-pipelined Pipelined Implementation of ZSTD_fast (~+5% Speed)	2021-09-09 17:05:30 -04:00
Yann Collet	4f0b1b9ee5	update regression tests	2021-09-08 14:37:42 -07:00
Yann Collet	b096a5c626	updated regression tests	2021-09-07 09:55:14 -07:00
Yann Collet	27a8bbe265	new initializer for ll price	2021-09-03 16:07:31 -07:00
Yann Collet	40e44bd56d	updated regression tests	2021-09-01 13:26:39 -07:00
W. Felix Handte	b0977e4ed2	Update results.csv	2021-09-01 14:45:00 -04:00
Sen Huang	539b3aab9b	Optimize 32-bit VecMask_next()	2021-08-04 17:14:58 -04:00
senhuang42	e411040ea1	Add 64 row entry support for lazy	2021-08-04 16:19:12 -04:00
senhuang42	aa1957477b	Improve Huffman sorting algorithm	2021-08-04 12:43:34 -04:00
senhuang42	939276cd0c	Add ldm and block splitter auto-enable to old api	2021-05-24 13:09:32 -04:00
Nick Terrell	c468e1b9cb	[test][regression] Update results.csv Changing the repcode search slightly, and changing when zstd is in ext-dict vs. prefix mode in edge cases, slightly changes the compressed results.	2021-05-18 19:46:37 -07:00
Nick Terrell	ce615d7fba	[test][regression] Update results.csv The LDM change in PR #2602 changed the algorithm slightly. The compressed size is generally positive, and when it is worse, it is only a few bytes.	2021-05-05 19:00:36 -07:00
senhuang42	33abda4400	Update results.csv	2021-04-26 15:55:23 -04:00
Sen Huang	4d63d6e8aa	Update results.csv, add Row hash to regression test	2021-04-07 10:31:41 -07:00
sen	e38124555e	Fix dictionary force reloading clevel selection (#2570 ) * Move cdict clevel override to before localdict init * Update results.csv after dict load changes	2021-04-06 15:35:09 -04:00
Nick Terrell	a494308ae9	[copyright][license] Switch to yearless copyright and some cleanup in the linux-kernel files * Switch to yearless copyright per FB policy * Fix up SPDX-License-Identifier lines in `contrib/linux-kernel` sources * Add zstd copyright/license header to the `contrib/linux-kernel` sources * Update the `tests/test-license.py` to check for yearless copyright * Improvements to `tests/test-license.py` * Check `contrib/linux-kernel` in `tests/test-license.py`	2021-03-30 10:30:43 -07:00
Sen Huang	bbbd578f45	Update results.csv	2021-03-25 11:16:37 -07:00

1 2

85 Commits