krak/zstd - zstd - Gitea: Git with a cup of tea

krak/zstd

mirror of https://github.com/facebook/zstd.git synced 2025-03-07 01:10:04 +02:00

Author	SHA1	Message	Date
Yann Collet	39ceef27f9	bump version number to v1.5.4 start preparation for release	2023-01-30 19:06:39 -08:00
Nick Terrell	2f74507bbd	Simplify 32-bit long offsets decoding logic The previous code had an issue when `bitsConsumed == 32` it would read 0 bits for the `ofBits` read, which violates the precondition of `BIT_readBitsFast()`. This can happen when the stream is corrupted. Fix thie issue by always reading the maximum possible number of extra bits. I've measured neutral decoding performance, likely because this branch is unlikely, but this should be faster anyways. And if not, it is only 32-bit decoding, so performance isn't as critical. Credit to OSS-Fuzz	2023-01-30 12:21:42 -08:00
daniellerozenblit	00176638e3	Merge pull request #3460 from daniellerozenblit/fix-long-offsets-resolution-pointer fix long offset resolution	2023-01-30 14:02:51 -05:00
Nick Terrell	b3b43f2893	Fix invalid assert in 32-bit decoding The assert is only correct for valid sequences, so disable it for everything execpt round trip fuzzers.	2023-01-27 14:40:38 -08:00
daniellerozenblit	2bde9fbf85	Update lib/compress/zstd_compress.c Co-authored-by: Nick Terrell <nickrterrell@gmail.com>	2023-01-27 16:58:53 -05:00
Nick Terrell	423a74986f	[fse] Delete unused functions Delete all unused FSE functions, now that we are no longer syncing to/from upstream. This avoids confusion about Zstd's stack usage like in Issue #3453. It also removes dead code, which is always a plus.	2023-01-27 13:15:07 -08:00
Danielle Rozenblit	9e4c66b9e9	record long offsets in ZSTD_symbolEncodingTypeStats_t + add test case	2023-01-27 12:04:29 -08:00
Danielle Rozenblit	814f4bfb99	fix long offset resolution	2023-01-27 08:21:47 -08:00
Nick Terrell	bda947e17a	[huf] Fix bug in fast C decoders The input bounds checks were buggy because they were only breaking from the inner loop, not the outer loop. The fuzzers found this immediately. The fix is to use `goto _out` instead of `break`. This condition can happen on corrupted inputs. I've benchmarked before and after on x86-64 and there were small changes in performance, some positive, and some negative, and they end up about balacing out. Credit to OSS-Fuzz	2023-01-26 14:39:13 -08:00
Yann Collet	efc9ae3480	Merge pull request #3455 from facebook/fix3454 Provide more accurate error codes for busy-loop scenarios	2023-01-25 15:22:51 -08:00
Nick Terrell	8957fef554	[huf] Add generic C versions of the fast decoding loops Add generic C versions of the fast decoding loops to serve architectures that don't have an assembly implementation. Also allow selecting the C decoding loop over the assembly decoding loop through a zstd decompression parameter `ZSTD_d_disableHuffmanAssembly`. I benchmarked on my Intel i9-9900K and my Macbook Air with an M1 processor. The benchmark command forces zstd to compress without any matches, using only literals compression, and measures only Huffman decompression speed: ``` zstd -b1e1 --compress-literals --zstd=tlen=131072 silesia.tar ``` The new fast decoding loops outperform the previous implementation uniformly, but don't beat the x86-64 assembly. Additionally, the fast C decoding loops suffer from the same stability problems that we've seen in the past, where the assembly version doesn't. So even though clang gets close to assembly on x86-64, it still has stability issues. \| Arch \| Function \| Compiler \| Default (MB/s) \| Assembly (MB/s) \| Fast (MB/s) \| \|---------\|----------------\|--------------\|----------------\|-----------------\|-------------\| \| x86-64 \| decompress 4X1 \| gcc-12.2.0 \| 1029.6 \| 1308.1 \| 1208.1 \| \| x86-64 \| decompress 4X1 \| clang-14.0.6 \| 1019.3 \| 1305.6 \| 1276.3 \| \| x86-64 \| decompress 4X2 \| gcc-12.2.0 \| 1348.5 \| 1657.0 \| 1374.1 \| \| x86-64 \| decompress 4X2 \| clang-14.0.6 \| 1027.6 \| 1659.9 \| 1468.1 \| \| aarch64 \| decompress 4X1 \| clang-12.0.5 \| 1081.0 \| N/A \| 1234.9 \| \| aarch64 \| decompress 4X2 \| clang-12.0.5 \| 1270.0 \| N/A \| 1516.6 \|	2023-01-25 13:47:51 -08:00
Yann Collet	db18a62f89	Provide more accurate error codes for busy-loop scenarios fixes #3454	2023-01-25 13:07:53 -08:00
daniellerozenblit	f3255bfeff	Merge pull request #3447 from daniellerozenblit/fuzz-sequence-compression Fuzz large offsets through sequence compression api	2023-01-25 09:27:34 -05:00
Yonatan Komornik	1d636b4ba0	Bug fix redzones by unpoisoning only the intended buffer and not the followup redzone.	2023-01-24 12:54:43 -08:00
Danielle Rozenblit	7d600c628a	fix bound check for ZSTD_copySequencesToSeqStoreNoBlockDelim()	2023-01-24 06:40:40 -08:00
Elliot Gorokhovsky	41682e6293	Merge pull request #3448 from facebook/embg-doc-fix Fix ZSTD_estimate* and ZSTD_initCStream() docs	2023-01-23 15:04:53 -05:00
daniellerozenblit	9116000be6	Merge pull request #3439 from daniellerozenblit/sequence-validation-bug-fix Fix sequence validation and seqStore bounds check	2023-01-23 13:50:37 -05:00
Elliot Gorokhovsky	3bfd3be5fb	Fix ZSTD_estimate* and ZSTD_initCStream() docs Fix the following documentation bugs: * Note that `ZSTD_estimate` functions are not compatible with the external matchfinder API Note that `ZSTD_estimateCStreamSize_usingCCtxParams()` is not compatible with `nbWorkers >= 1` * Remove incorrect warning that the legacy streaming API is incompatible with advanced parameters and/or dictionary compression * Note that `ZSTD_initCStream()` is incompatible with dictionary compression * Warn that	2023-01-23 13:28:36 -05:00
Nick Terrell	dc2b3e8876	Fix -Wstringop-overflow warning Backported from kernel patch [0]. I wasn't able to reproduce the warning locally, but could repro it in the kernel. [0] https://lore.kernel.org/lkml/20220330193352.GA119296@embeddedor/	2023-01-23 10:12:25 -08:00
Danielle Rozenblit	815d1d4eda	update external sequence error to fit error naming scheme	2023-01-23 09:58:34 -08:00
Danielle Rozenblit	1b65727e74	fix nits and add new error code for invalid external sequences	2023-01-23 07:59:02 -08:00
Yann Collet	d9280afb7d	fixed minor c89 warning introduced due to parallel merges	2023-01-20 18:04:20 -08:00
Nick Terrell	b4467c1061	Fix bufferless API with attached dictionary Fixes #3102.	2023-01-20 16:15:16 -08:00
Nick Terrell	329169189c	Replace Huffman boolean args with flags bit set	2023-01-20 14:12:53 -08:00
Nick Terrell	0cc1b0cb22	Delete unused Huffman functions Remove all Huffman functions that aren't used by zstd.	2023-01-20 14:12:53 -08:00
Yann Collet	6742f20a7f	Merge pull request #3435 from facebook/c89build added c89 build test to CI	2023-01-20 14:07:12 -08:00
Nick Terrell	666944fbe6	Cap hashLog & chainLog to ensure that we only use 32 bits of hash * Cap shortCache chainLog to 24 * Cap row match finder hashLog so that rowLog <= 24 * Add unit tests to expose all cases. The row match finder unit tests are only run in 64-bit mode, because they allocate ~1GB. Fixes #3336	2023-01-20 14:05:26 -08:00
Danielle Rozenblit	aa385ece13	fix sequence validation and bounds check in ZSTD_copySequencesToSeqStore()	2023-01-20 10:32:35 -08:00
Yann Collet	ea684c335a	added c89 build test to CI	2023-01-19 14:59:30 -08:00
Elliot Gorokhovsky	bce0382c82	Bugfixes for the External Matchfinder API (#3433 ) * external matchfinder bugfixes + tests * small doc fix	2023-01-19 10:41:24 -05:00
daniellerozenblit	dc1c6cc5df	Merge pull request #3418 from daniellerozenblit/fuzz-max-block-size Fuzz on maxBlockSize	2023-01-19 08:18:04 -05:00
Danielle Rozenblit	8353a4b095	fix maxBlockSize resolution + add test cases	2023-01-17 12:24:18 -08:00
Felix Handte	23a356cdde	Merge pull request #3424 from felixhandte/disable-asan-msan-poison-mingw Disable Custom ASAN/MSAN Poisoning on MinGW Builds	2023-01-17 12:41:41 -05:00
Elliot Gorokhovsky	5d8cfa6b96	Deprecate advanced streaming functions (#3408 ) * deprecate advanced streaming functions * remove internal usage of the deprecated functions * nit * suppress warnings in tests/zstreamtest.c * purge ZSTD_initDStream_usingDict * nits * c90 compat * zstreamtest.c already disables deprecation warnings! * fix initDStream() return value * fix typo * wasn't able to import private symbol properly, this commit works around that * new strategy for zbuff * undo zbuff deprecation warning changes * move ZSTD_DISABLE_DEPRECATE_WARNINGS from .h to .c	2023-01-13 14:51:47 -05:00
W. Felix Handte	d78fbedd96	Don't Even Declare Poisoning Functions if Poisoning is Disabled This guarantees that we won't accidentally forget to check the macro somewhere where we use these functions.	2023-01-13 11:56:48 -05:00
W. Felix Handte	f10922a8fa	Disable Custom ASAN/MSAN Poisoning on MinGW Builds Addresses #3240.	2023-01-13 11:53:09 -05:00
Danielle Rozenblit	14b8defb86	move ZSTD_BLOCKSIZE_MAX_MIN to static linking only section	2023-01-13 07:00:50 -08:00
Yann Collet	d5509080bc	Merge pull request #3419 from facebook/fix3416 fix root cause of #3416	2023-01-13 00:21:08 -08:00
Nick Terrell	5b266196a4	Add support for in-place decompression * Add a function and macro ZSTD_decompressionMargin() that computes the decompression margin for in-place decompression. The function computes a tight margin that works in all cases, and the macro computes an upper bound that will only work if flush isn't used. * When doing in-place decompression, make sure that our output buffer doesn't overlap with the input buffer. This ensures that we don't decide to use the portion of the output buffer that overlaps the input buffer for temporary memory, like for literals. * Add a simple unit test. * Add in-place decompression to the simple_round_trip and stream_round_trip fuzzers. This should help verify that our margin stays correct.	2023-01-12 16:28:08 -08:00
Yann Collet	ac45e078a5	add explanation about new test as requested by @terrelln	2023-01-12 15:49:01 -08:00
Yann Collet	796699c0bc	fix root cause of #3416 A minor change in 5434de0 changed a `<=` into a `<`, and as an indirect consequence allowed compression attempt of literals when there are only 6 literals to compress (previous limit was effectively 7 literals). This is not in itself a problem, as the threshold is merely an heuristic, but it emerged a bug that has always been there, and was just never triggered so far due to the previous limit. This bug would make the literal compressor believes that all literals are the same symbol, but for the exact case where nbLiterals==6, plus a pretty wild combination of other limit conditions, this outcome could be false, resulting in data corruption. Replaced the blind heuristic by an actual test for all limit cases, so that even if the threshold is changed again in the future, the detection of RLE mode will remain reliable.	2023-01-12 15:41:08 -08:00
Danielle Rozenblit	06b096db47	additional tests and documentation updates + allow maxBlockSize to be set to 0 (goes to default)	2023-01-12 13:41:50 -08:00
Danielle Rozenblit	53eb5a758c	add simple test for maxBlockSize expected functionality	2023-01-12 08:55:39 -08:00
Danielle Rozenblit	1fffcfe01d	update minimum threshold for max block size	2023-01-11 11:09:57 -08:00
Danielle Rozenblit	fe08137d9a	resolve max block value in cctx and use when calculating the max block size	2023-01-09 07:53:53 -08:00
Yann Collet	71dbe8f9d4	minor: fix conversion warnings	2023-01-04 20:00:04 -08:00
daniellerozenblit	d913417f72	Merge branch 'dev' into fuzz-max-block-size	2023-01-04 16:34:07 -05:00
Danielle Rozenblit	908e812733	initial commit	2023-01-04 13:01:54 -08:00
Yann Collet	ebba9ff425	update regression results	2023-01-03 14:04:23 -08:00
Yann Collet	5434de01e2	improve compression ratio of small alphabets fix #3328 In situations where the alphabet size is very small, the evaluation of literal costs from the Optimal Parser is initially incorrect. It takes some time to converge, during which compression is less efficient. This is especially important for small files, because there will not be enough data to converge, so most of the parsing is selected based on incorrect metrics. After this patch, the scenario ##3328 gets fixed, delivering the expected 29 bytes compressed size (smallest known compressed size).	2023-01-03 12:22:37 -08:00

1 2 3 4 5 ...

4422 Commits