krak/zstd - zstd - Gitea: Git with a cup of tea

krak/zstd

mirror of https://github.com/facebook/zstd.git synced 2025-03-06 16:56:49 +02:00

Author	SHA1	Message	Date
Yann Collet	d23b95d21d	minor refactor for clarity since we can ensure that nbSubBlocks>0	2024-02-26 14:06:34 -08:00
Yann Collet	86db60752d	optimization: bail out faster in presence of incompressible data	2024-02-26 13:27:59 -08:00
Yann Collet	ef82b214ad	nit: comment indentation as reported by @terrelln	2024-02-26 13:23:59 -08:00
Yann Collet	aa8592c532	minor: reformulate nbSubBlocks assignment	2024-02-26 13:21:14 -08:00
Yann Collet	e0412c2062	fix extraneous semicolon ';' as reported by @terrelln	2024-02-26 12:26:54 -08:00
Yann Collet	1fafd0c4ae	fix minor visual static analyzer warning it's a false positive, but change the code nonetheless to make it more obvious to the static analyzer.	2024-02-25 19:45:32 -08:00
Yann Collet	038a8a906b	targetCBlockSize: modified splitting strategy to generate blocks of more regular size notably avoiding to feature a larger first block	2024-02-25 17:39:29 -08:00
Yann Collet	f8372191f5	reduced minimum compressed block size with the intention to match the transport layer size, such as Ethernet and 4G mobile networks.	2024-02-24 01:59:16 -08:00
Yann Collet	f77f634d41	update API documentation	2024-02-24 01:28:17 -08:00
Yann Collet	4b51526412	fix partial block uncompressed	2024-02-24 01:24:58 -08:00
Yann Collet	6719794379	fixed some regressionTests but not all	2024-02-23 18:48:29 -08:00
Yann Collet	0591e7eea1	minor: fix overly cautious conversion warning	2024-02-23 16:05:09 -08:00
Yann Collet	3b40100058	fix long sequences (> 64 KB)	2024-02-23 15:35:12 -08:00
Yann Collet	6b11fc436c	fix issue with incompressible sections	2024-02-23 14:53:56 -08:00
Yann Collet	cc4530924b	speed optimized version of targetCBlockSize note that the size of individual compressed blocks will vary more wildly with this modification. But it seems good enough for a first test, and fix the speed regression issue. Further refinements can be attempted later.	2024-02-23 14:03:26 -08:00
Yann Collet	621a263fb2	Merge pull request #3903 from gruenich/feature/reduce-scope-of-variables Reduce scope of variables	2024-02-22 09:42:02 -08:00
Yann Collet	e62e15df19	fix clangbuild notably -Wconversion and -Wdocumentation	2024-02-20 22:43:22 -08:00
Christoph Grüninger	b921f1aad6	Reduce scope of variables This improves readability, keeps variables local, and prevents the unintended use (e.g. typo) later on. Found by Cppcheck (variableScope)	2024-02-11 22:00:03 +01:00
Yann Collet	b0e8580dc7	fix fuzz issue 5131069967892480	2024-02-08 16:38:20 -08:00
Yann Collet	22574d848d	fix issue 5921623844651008 ossfuzz managed to create a scenario which triggers an `assert`. This fixes it, by giving +1 more space for the backward search pass.	2024-02-06 13:01:14 -08:00
Yann Collet	b88c593d8f	added or updated code comments as suggested by @terrelln, to make the code of the optimal parser a bit more understandable.	2024-02-05 18:32:25 -08:00
Yann Collet	6c35fb2e8c	fix msan warnings	2024-02-05 01:21:06 -08:00
Yann Collet	641749fc09	fix uasan dictionary_stream_round_trip fuzz test	2024-02-05 00:36:10 -08:00
Yann Collet	fe2e2ad36d	use ZSTD_memcpy() which can be redirected in Linux kernel mode	2024-02-03 19:57:38 -08:00
Yann Collet	0ae21d8c31	removed trace control	2024-02-03 19:32:59 -08:00
Yann Collet	5474edbe60	fixed wrong assert by introducing ZSTD_OPT_SIZE	2024-02-03 19:31:53 -08:00
Yann Collet	e5af24c5fa	fixed wrong assert	2024-02-03 17:48:29 -08:00
Yann Collet	8168a451e5	minor optimization, mostly for clarity	2024-02-03 17:26:47 -08:00
Yann Collet	d31018e223	finally, a version that generalizes well While it's not always strictly a win, it's a win for files that see a noticeably compression ratio increase, while it's a very small noise for other files. Downside is, this patch is less efficient for 32-bit arrays of integer than the previous patch which was introducing losses for other files, but it's still a net improvement on this scenario.	2024-02-03 14:26:18 -08:00
Yann Collet	0166b2ba80	modification: differentiate literal update at pos+1 helps when litlen==1 is cheaper than litlen==0 works great on pathological arr[u32] examples but doesn't generalize well on other files. silesia/x-ray is amoung the most negatively affected ones.	2024-01-31 11:20:43 -08:00
Yann Collet	4683667785	refactor optimal parser store stretches as intermediate solution instead of sequences. makes it possible to link a solution to a predecessor.	2024-01-31 02:51:46 -08:00
Yann Collet	de10f56be2	improve high compression ratio for file like #3793 this works great for 32-bit arrays, notably the synthetic ones, with extreme regularity, unfortunately, it's not universal, and in some cases, it's a loss. Crucially, on average, it's a loss on silesia. The most negatively impacted file is x-ray. It deserves an investigation before suggesting it as an evolution.	2024-01-29 23:25:24 -08:00
Yann Collet	e6f4b46493	playTests.sh does no longer needs grep -E it makes the test script more portable across posix systems because `grep -E` is not guaranteed while `grep` is fairly common.	2024-01-15 11:16:46 -08:00
Yann Collet	a07cae3976	Merge pull request #3847 from michoecho/fix_nullptr_deref_in_createCDict Fix a nullptr dereference in ZSTD_createCDict_advanced2()	2023-12-30 13:23:39 -08:00
Elliot Gorokhovsky	c6cabf9441	Make offload API compatible with static CCtx (#3854 ) * Add ZSTD_CCtxParams_registerSequenceProducer() to public API * add unit test * add docs to zstd.h * nits * Add ZSTDLIB_STATIC_API prefix * Add asserts	2023-12-28 14:48:46 -05:00
Michał Chojnowski	9a3b17c4d6	Fix a nullptr dereference in ZSTD_createCDict_advanced2() If the relevant allocation returns NULL, ZSTD_createCDict_advanced_internal() will return NULL. But ZSTD_createCDict_advanced2() doesn't check for this and attempts to use the returned pointer anyway, which leads to a segfault.	2023-12-16 13:02:18 +01:00
Elliot Gorokhovsky	d151a4880b	Move offload API params into ZSTD_CCtx_params	2023-11-27 08:11:01 -08:00
Elliot Gorokhovsky	809c7eb6bf	Refactor ZSTD_sequenceProducer_F typedef to ZSTD_sequenceProducer_F*	2023-11-27 06:56:37 -08:00
Nick Terrell	8193250615	Modernize macros to use `do { } while (0)` This PR introduces no functional changes. It attempts to change all macros currently using `{ }` or some variant of that to to `do { } while (0)`, and introduces trailing `;` where necessary. There were no bugs found during this migration. The bug in Visual Studios warning on this has been fixed since VS2015. Additionally, we have several instances of `do { } while (0)` which have been present for several releases, so we don't have to worry about breaking peoples builds. Fixes Issue #3830.	2023-11-21 20:05:17 -05:00
Yann Collet	6b3d12fe54	Merge pull request #3820 from facebook/xxh082 update xxhash library to v0.8.2	2023-11-21 09:11:40 -08:00
Nick Terrell	dd4de1dd7a	[huf] Fix null pointer addition `HUF_DecompressFastArgs_init()` was adding 0 to NULL. Fix it by exiting early for empty outputs. This is no change in behavior, because the function was already exiting 0 in this case, just slightly later.	2023-11-20 17:13:01 -05:00
Nick Terrell	5ab78c0418	[huf] Improve fast C & ASM performance on small data * Rename `ilimit` to `ilowest` and set it equal to `src` instead of `src + 6 + 8`. This is safe because the fast decoding loops guarantee to never read below `ilowest` already. This allows the fast decoder to run for at least two more iterations, because it consumes at most 7 bytes per iteration. * Continue the fast loop all the way until the number of safe iterations is 0. Initially, I thought that when it got towards the end, the computation of how many iterations of safe might become expensive. But it ends up being slower to have to decode each of the 4 streams individually, which makes sense. This drastically speeds up the Huffman decoder on the `github` dataset for the issue raised in #3762, measured with `zstd -b1e1r github/`. \| Decoder \| Speed before \| Speed after \| \|----------\|--------------\|-------------\| \| Fallback \| 477 MB/s \| 477 MB/s \| \| Fast C \| 384 MB/s \| 492 MB/s \| \| Assembly \| 385 MB/s \| 501 MB/s \| We can also look at the speed delta for different block sizes of silesia using `zstd -b1e1r silesia.tar -B#`. \| Decoder \| -B1K ∆ \| -B2K ∆ \| -B4K ∆ \| -B8K ∆ \| -B16K ∆ \| -B32K ∆ \| -B64K ∆ \| -B128K ∆ \| \|----------\|--------\|--------\|--------\|--------\|---------\|---------\|---------\|----------\| \| Fast C \| +11.2% \| +8.2% \| +6.1% \| +4.4% \| +2.7% \| +1.5% \| +0.6% \| +0.2% \| \| Assembly \| +12.5% \| +9.0% \| +6.2% \| +3.6% \| +1.5% \| +0.7% \| +0.2% \| +0.03% \|	2023-11-20 17:13:01 -05:00
Nick Terrell	c7269add7e	[huf] Improve fast huffman decoding speed in linux kernel gcc in the linux kernel was not unrolling the inner loops of the Huffman decoder, which was destroying decoding performance. The compiler was generating crazy code with all sorts of branches. I suspect because of Spectre mitigations, but I'm not certain. Once the loops were manually unrolled, performance was restored. Additionally, when gcc couldn't prove that the variable left shift in the 4X2 decode loop wasn't greater than 63, it inserted checks to verify it. To fix this, mask `entry.nbBits & 0x3F`, which allows gcc to eliete this check. This is a no op, because `entry.nbBits` is guaranteed to be less than 64. Lastly, introduce the `HUF_DISABLE_FAST_DECODE` macro to disable the fast C loops for Issue #3762. So if even after this change, there is a performance regression, users can opt-out at compile time.	2023-11-20 14:56:46 -05:00
Nick Terrell	e122fcbf58	[debug] Don't define g_debuglevel in the kernel We only use this constant when `DEBUGLEVEL>=2`, but we get -Werror=pedantic errors for empty translation units, so still define it except in kernel environments. Backport from the kernel: https://lore.kernel.org/lkml/20230616144400.172683-1-ben.dooks@codethink.co.uk/	2023-11-17 09:54:10 -08:00
Yann Collet	59dcc47579	update license text	2023-11-16 16:19:25 -08:00
Yann Collet	3fd5f9f52d	fix the copyright linter	2023-11-13 15:50:42 -08:00
Yann Collet	592b1acb18	update xxhash to v0.8.2 List of updates : https://github.com/Cyan4973/xxHash/releases/tag/v0.8.2 This is also a preparation task before taking care of #3819	2023-11-13 15:42:07 -08:00
Yann Collet	24dabde507	revert to manually defining DTable thus avoiding the analyzer and ubsan to associate DTable to a size of 1.	2023-10-18 22:45:57 -07:00
Yann Collet	d988e00a7f	baby-step towards solving flexArray issue #3785 the flexArray in structure FSE_DecompressWksp is just a way to derive a pointer easily, without risk/complexity of calculating it manually. Not sure if this change is good enough to avoid ubsan warnings though.	2023-10-18 16:21:39 -07:00
Yann Collet	6bb1688c1a	extended the fix to ZSTDMT's Buffer Pool	2023-10-08 00:25:17 -07:00

1 2 3 4 5 ...

4522 Commits